Approximate Sequential Patterns for Incomplete Databases - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Access content directly
Reports Year : 2007

Approximate Sequential Patterns for Incomplete Databases

Abstract

Industrial databases often contains a large amount of unfilled information. During the knowledge discovery process one specific processing step is often necessary in order to remove these incomplete data either by deleting them or by assessing them. When the data mining task consists in mining frequent sequences, incomplete data are, most of the time, deleted, which leads to an important loss of information. Extracted knowledge becomes so less representative of the whole database. We thus propose a method that uses estimation of missing values contained into incomplete records, while computing the frequency of sequences.
No file

Dates and versions

lirmm-00129415 , version 1 (07-02-2007)

Identifiers

  • HAL Id : lirmm-00129415 , version 1

Cite

Céline Fiot. Approximate Sequential Patterns for Incomplete Databases. RR-07003, 2007. ⟨lirmm-00129415⟩
57 View
0 Download

Share

Gmail Facebook X LinkedIn More