PGLCM: efficient parallel mining of closed frequent gradual itemsets

Numerical data (e.g., DNA micro-array data, sensor data) pose a challenging problem to existing frequent pattern mining methods which hardly handle them. In this framework, gradual patterns have been recently proposed to extract covariations of attributes, such as: “When X increases, Y decreases”. There exist some algorithms for mining frequent gradual patterns, but they cannot scale to real-world databases. We present in this paper GLCM, the first algorithm for mining closed frequent gradual patterns, which proposes strong complexity guarantees: the mining time is linear with the number of closed frequent gradual itemsets. Our experimental study shows that GLCM is two orders of magnitude faster than the state of the art, with a constant low memory usage. We also present PGLCM, a parallelization of GLCM capable of exploiting multicore processors, with good scale-up properties on complex datasets. These algorithms are the first algorithms capable of mining large real world datasets to discover gradual patterns.

Mots clés

Parallelism Data mining Frequent pattern mining Gradual itemsets

Domaines

Base de données [cs.DB]

Fichier principal

2014_do_kais_draft.pdf (1.43 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Anne Laurent : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01381085

Soumis le : lundi 7 octobre 2019-11:25:46

Dernière modification le : mercredi 18 décembre 2024-09:36:06

Dates et versions

lirmm-01381085 , version 1 (07-10-2019)

Identifiants

HAL Id : lirmm-01381085 , version 1
DOI : 10.1007/s10115-014-0749-8

Citer

Trong Dinh Thac Do, Alexandre Termier, Anne Laurent, Benjamin Negrevergne, Behrooz Omidvar Tehrani, et al.. PGLCM: efficient parallel mining of closed frequent gradual itemsets. Knowledge and Information Systems (KAIS), 2015, 43 (3), pp.497-527. ⟨10.1007/s10115-014-0749-8⟩. ⟨lirmm-01381085⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA INSA-RENNES IRISA LIG LIRMM CENTRALESUPELEC INRIA2 LIG-TDCGE-SLIDE UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER FADO UNIV-RENNES INSA-GROUPE UR1-MATH-NUM LIG_SIDCH WEB-CUBE

537 Consultations

247 Téléchargements