Need For Speed: Mining Sequential Patterns in Data Stream

Chedy Raïssi; Pascal Poncelet; Maguelonne Teisseire

Communication Dans Un Congrès Année : 2005

Need For Speed: Mining Sequential Patterns in Data Stream

, , (1)

Chedy Raïssi

Fonction : Auteur
PersonId : 16730
IdHAL : chedy-raissi
IdRef : 125691750

Pascal Poncelet

Fonction : Auteur
PersonId : 6247
IdHAL : pascal-poncelet
ORCID : 0000-0002-8277-3490
IdRef : 069260613

Maguelonne Teisseire

Fonction : Auteur
PersonId : 8645
IdHAL : maguelonne-teisseire
ORCID : 0000-0001-9313-6414
IdRef : 117436593

TATOO - Fouille de données environnementales

Résumé

Recently, the data mining community has focused on a new challenging model where data arrives sequentially in the form of continuous rapid streams. It is often referred to as data streams or streaming data. Many real-world applications data are more appropriately handled by the data stream model than by traditional static databases. Such applications can be: stock tickers, network traffic measurements, transaction flows in retail chains, click streams, sensor networks and telecommunications call records. In this paper we propose a new approach, called SPEED (Sequential Patterns Efficient Extraction in Data streams), to identify sequential patterns in a data stream. To the best of our knowledge this is the first approach defined for mining sequential patterns in streaming data. The main originality of our mining method is that we use a novel data structure to maintain frequent sequential patterns coupled with a fast pruning strategy. At any time, users can issue requests for frequent sequences over an arbitrary time interval. Furthermore, our approach produces an approximate support answer with an assurance that it will not bypass a user-defined frequency error threshold. Finally the proposed method is analyzed by a series of experiments on different datasets.

Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00106085

Soumis le : vendredi 13 octobre 2006-10:23:08

Dernière modification le : mercredi 13 août 2025-03:10:27

Dates et versions

lirmm-00106085 , version 1 (13-10-2006)

Identifiants

HAL Id : lirmm-00106085 , version 1

Citer

Chedy Raïssi, Pascal Poncelet, Maguelonne Teisseire. Need For Speed: Mining Sequential Patterns in Data Stream. BDA: Bases de Données Avancées, Oct 2005, Saint-Malo. ⟨lirmm-00106085⟩

Exporter

Collections

230 Consultations

0 Téléchargements

Need For Speed: Mining Sequential Patterns in Data Stream

Résumé

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager