Need For Speed: Mining Sequential Patterns in Data Stream

Chedy Raïssi Pascal Poncelet Maguelonne Teisseire 1
1 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Recently, the data mining community has focused on a new challenging model where data arrives sequentially in the form of continuous rapid streams. It is often referred to as data streams or streaming data. Many real-world applications data are more appropriately handled by the data stream model than by traditional static databases. Such applications can be: stock tickers, network traffic measurements, transaction flows in retail chains, click streams, sensor networks and telecommunications call records. In this paper we propose a new approach, called SPEED (Sequential Patterns Efficient Extraction in Data streams), to identify sequential patterns in a data stream. To the best of our knowledge this is the first approach defined for mining sequential patterns in streaming data. The main originality of our mining method is that we use a novel data structure to maintain frequent sequential patterns coupled with a fast pruning strategy. At any time, users can issue requests for frequent sequences over an arbitrary time interval. Furthermore, our approach produces an approximate support answer with an assurance that it will not bypass a user-defined frequency error threshold. Finally the proposed method is analyzed by a series of experiments on different datasets.
Type de document :
Communication dans un congrès
BDA: Bases de Données Avancées, Oct 2005, Saint-Malo, 2005
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00106085
Contributeur : Christine Carvalho de Matos <>
Soumis le : vendredi 13 octobre 2006 - 10:23:08
Dernière modification le : jeudi 11 janvier 2018 - 06:26:17

Identifiants

  • HAL Id : lirmm-00106085, version 1

Collections

Citation

Chedy Raïssi, Pascal Poncelet, Maguelonne Teisseire. Need For Speed: Mining Sequential Patterns in Data Stream. BDA: Bases de Données Avancées, Oct 2005, Saint-Malo, 2005. 〈lirmm-00106085〉

Partager

Métriques

Consultations de la notice

74