Skip to Main content Skip to Navigation
Conference papers

Modeling and Clustering Users with Evolving Profiles in Usage Streams

Chongsheng Zhang 1 Florent Masseglia 2 Xiangliang Zhang 3 
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Existing data stream models commonly assume that users' records or profiles in data streams will not be updated once they arrive. In many applications such as web usage, however, the users' records/profiles may evolve along time. This kind of streaming transactions are referred to as bi-streaming data - the data evolves temporally in two dimensions, the flowing of transactions as with the traditional data streams, and the evolving of users' profiles inside the streams, which makes bi-streaming data different from traditional data streams. The two-dimensional evolving of bi-streaming data brings difficulties on modeling and clustering for exploring the users' behaviours. This paper will propose three models to summarize bi-streaming data, which are the batch model, the Evolving Objects (EO) model and the Dynamic Data Stream (DDS) model. Through creating, updating and deleting user profiles, the models summarize the behaviours of each user as an object. Based on these models, clustering algorithms are employed to identify the user groups. The proposed models are tested on a real-world data set showing that the DDS model can summarize the bi-streaming data efficiently and effectively, providing better basis for clustering user profiles than the other two models.
Document type :
Conference papers
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download
Contributor : Florent Masseglia Connect in order to contact the contributor
Submitted on : Monday, November 19, 2012 - 4:12:03 PM
Last modification on : Tuesday, September 6, 2022 - 4:56:13 PM
Long-term archiving on: : Thursday, February 21, 2013 - 11:31:25 AM


Files produced by the author(s)


  • HAL Id : lirmm-00753791, version 1


Chongsheng Zhang, Florent Masseglia, Xiangliang Zhang. Modeling and Clustering Users with Evolving Profiles in Usage Streams. TIME'2012: 19th International Symposium on Temporal Representation and Reasoning, Sep 2012, United Kingdom. pp.133-140. ⟨lirmm-00753791⟩



Record views


Files downloads