Mining Evolving data Streams for Frequent Patterns

Abstract : A data stream is a potentially uninterrupted flow of data. Mining this flow makes it necessary to cope with uncertainty, as only a part of the stream can be stored. In this paper, we evaluate a statistical technique which biases the estimation of the support of patterns, so as to maximize either the precision or the recall, as chosen by the user, and limit the degradation of the other criterion. Theoretical results show that the technique is not far from the optimum, from the statistical standpoint. Experiments performed tend to demonstrate its potential, as it remains robust even under significant distribution drifts.
Type de document :
Article dans une revue
Pattern Recognition, Elsevier, 2007, 40 (2), pp.492-503. 〈10.1016/j.patcog.2006.03.006〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00365474
Contributeur : Pascal Poncelet <>
Soumis le : mardi 3 mars 2009 - 15:11:33
Dernière modification le : samedi 3 novembre 2018 - 22:20:23

Identifiants

Collections

Citation

Pierre-Alain Laur, Richard Nock, Jean-Émile Symphor, Pascal Poncelet. Mining Evolving data Streams for Frequent Patterns. Pattern Recognition, Elsevier, 2007, 40 (2), pp.492-503. 〈10.1016/j.patcog.2006.03.006〉. 〈lirmm-00365474〉

Partager

Métriques

Consultations de la notice

600