SPoID: Do Not Throw Meaningful Incomplete Sequences Away!

Céline Fiot; Anne Laurent; Maguelonne Teisseire

Communication Dans Un Congrès Année : 2007

SPoID: Do Not Throw Meaningful Incomplete Sequences Away!

(1) , (2) , (2)

1
2

Céline Fiot

Fonction : Auteur
PersonId : 835134

Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier

Anne Laurent

Fonction : Auteur
PersonId : 21743
IdHAL : anne-laurent
ORCID : 0000-0003-3708-6429
IdRef : 075173735

Fouille de données environnementales

Maguelonne Teisseire

Fonction : Auteur
PersonId : 8645
IdHAL : maguelonne-teisseire
ORCID : 0000-0001-9313-6414
IdRef : 117436593

Fouille de données environnementales

Résumé

Industrial databases often contain a large amount of unﬁlled information. During the knowledge discovery process one processing step is often necessary in order to remove these incomplete data either by deleting or assessing them. When the data mining task consists in mining for frequent sequences, incomplete data are, most of the time, deleted, which leads to an important loss of information. Extracted knowledge then becomes less representative of the database. Therefore we propose a method that uses the partial information contained in incomplete records, only temporary ignoring the missing part of the record. Experiments run on various synthetic datasets show the validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.

Mots clés

Data mining Sequential patterns missing values incomplete data

Domaines

Autre

Fichier principal

proceeding-eusflat-2007-I pages 329 - 336.pdf (618.28 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Martine Peridier : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00173030

Soumis le : samedi 21 septembre 2019-16:46:51

Dernière modification le : mardi 12 mars 2024-10:46:30

Archivage à long terme le : dimanche 9 février 2020-03:09:35

Dates et versions

lirmm-00173030 , version 1 (21-09-2019)

Identifiants

HAL Id : lirmm-00173030 , version 1

Citer

Céline Fiot, Anne Laurent, Maguelonne Teisseire. SPoID: Do Not Throw Meaningful Incomplete Sequences Away!. EUSFLAT, European Society For Fuzzy Logic and Technologies, Sep 2007, Ostrava, Czech Republic. pp.329-336. ⟨lirmm-00173030⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIRMM MIPS UNIV-MONTPELLIER

125 Consultations

48 Téléchargements

SPoID: Do Not Throw Meaningful Incomplete Sequences Away!

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager