Approximate Sequential Patterns for Incomplete Databases - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Rapport Année : 2007

Approximate Sequential Patterns for Incomplete Databases

Résumé

Industrial databases often contains a large amount of unfilled information. During the knowledge discovery process one specific processing step is often necessary in order to remove these incomplete data either by deleting them or by assessing them. When the data mining task consists in mining frequent sequences, incomplete data are, most of the time, deleted, which leads to an important loss of information. Extracted knowledge becomes so less representative of the whole database. We thus propose a method that uses estimation of missing values contained into incomplete records, while computing the frequency of sequences.
Fichier non déposé

Dates et versions

lirmm-00129415 , version 1 (07-02-2007)

Identifiants

  • HAL Id : lirmm-00129415 , version 1

Citer

Céline Fiot. Approximate Sequential Patterns for Incomplete Databases. RR-07003, 2007. ⟨lirmm-00129415⟩
57 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More