SPoID: Incomplete Sequence Mining for Sequential Patterns

Céline Fiot

Rapport Année : 2007

SPoID: Incomplete Sequence Mining for Sequential Patterns

(1)

Céline Fiot

Fonction : Auteur
PersonId : 835134

Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier

Résumé

Industrial databases often contain a large amount of unfilled information. During the knowledge discovery process one processing step is often necessary in order to remove these incomplete data either by deleting or assessing them. When the data mining task consists in mining for frequent sequences, incomplete data are, most of the time, deleted, which leads to an important loss of information. Extracted knowledge then becomes less representative of the database. Therefore we propose a method that uses the partial information contained in incomplete records, only temporary ignoring the missing part of the record. Experiments run on various synthetic datasets show the validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.

Domaines

Base de données [cs.DB] Intelligence artificielle [cs.AI]

Celine Fiot : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00136070

Soumis le : lundi 12 mars 2007-10:38:01

Dernière modification le : mercredi 5 juillet 2023-17:05:04

Dates et versions

lirmm-00136070 , version 1 (12-03-2007)

Identifiants

HAL Id : lirmm-00136070 , version 1

Citer

Céline Fiot. SPoID: Incomplete Sequence Mining for Sequential Patterns. RR-07006, 2007. ⟨lirmm-00136070⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIRMM LARA MIPS UNIV-MONTPELLIER

43 Consultations

0 Téléchargements

SPoID: Incomplete Sequence Mining for Sequential Patterns

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager