Modeling and Clustering Users with Evolving Profiles in Usage Streams

Chongsheng Zhang 1 Florent Masseglia 2 Xiangliang Zhang 3
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Existing data stream models commonly assume that users' records or profiles in data streams will not be updated once they arrive. In many applications such as web usage, however, the users' records/profiles may evolve along time. This kind of streaming transactions are referred to as bi-streaming data - the data evolves temporally in two dimensions, the flowing of transactions as with the traditional data streams, and the evolving of users' profiles inside the streams, which makes bi-streaming data different from traditional data streams. The two-dimensional evolving of bi-streaming data brings difficulties on modeling and clustering for exploring the users' behaviours. This paper will propose three models to summarize bi-streaming data, which are the batch model, the Evolving Objects (EO) model and the Dynamic Data Stream (DDS) model. Through creating, updating and deleting user profiles, the models summarize the behaviours of each user as an object. Based on these models, clustering algorithms are employed to identify the user groups. The proposed models are tested on a real-world data set showing that the DDS model can summarize the bi-streaming data efficiently and effectively, providing better basis for clustering user profiles than the other two models.
Document type :
Conference papers
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00753791
Contributor : Florent Masseglia <>
Submitted on : Monday, November 19, 2012 - 4:12:03 PM
Last modification on : Saturday, February 23, 2019 - 7:06:02 PM
Long-term archiving on : Thursday, February 21, 2013 - 11:31:25 AM

File

time12.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-00753791, version 1

Citation

Chongsheng Zhang, Florent Masseglia, Xiangliang Zhang. Modeling and Clustering Users with Evolving Profiles in Usage Streams. TIME'2012: 19th International Symposium on Temporal Representation and Reasoning, Sep 2012, United Kingdom. pp.133-140. ⟨lirmm-00753791⟩

Share

Metrics

Record views

672

Files downloads

845