Aggregation-Aware Compression of Probabilistic Streaming Time Series

Reza Akbarinia; Florent Masseglia

doi:10.1007/978-3-319-21024-7_16

Communication Dans Un Congrès Année : 2015

Aggregation-Aware Compression of Probabilistic Streaming Time Series

(1) , (1)

Reza Akbarinia

Fonction : Auteur
PersonId : 172647
IdHAL : reza-akbarinia
ORCID : 0000-0002-7098-0361
IdRef : 119863421

Scientific Data Management

Florent Masseglia

Fonction : Auteur
PersonId : 172896
IdHAL : florent-masseglia
ORCID : 0000-0002-1149-585X
IdRef : 120528681

Scientific Data Management

Résumé

In recent years, there has been a growing interest for probabilistic data management. We focus on probabilistic time series where a main characteristic is the high volumes of data, calling for efficient compression techniques. To date, most work on probabilistic data reduction has provided synopses that minimize the error of representation w.r.t. the original data. However, in most cases, the compressed data will be meaningless for usual queries involving aggregation operators such as SUM or AVG. We propose PHA (Probabilistic Histogram Aggregation), a compression technique whose objective is to minimize the error of such queries over compressed probabilistic data. We incorporate the aggregation operator given by the end-user directly in the compression technique, and obtain much lower error in the long term. We also adopt a global error aware strategy in order to manage large sets of probabilistic time series, where the available memory is carefully balanced between the series, according to their individual variability.

Domaines

Recherche d'information [cs.IR]

Fichier principal

pha_mldm.pdf (974.71 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Reza Akbarinia : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01162366

Soumis le : mercredi 10 juin 2015-12:12:53

Dernière modification le : jeudi 15 février 2024-03:31:44

Archivage à long terme le : mardi 25 avril 2017-06:23:52

Dates et versions

lirmm-01162366 , version 1 (10-06-2015)

Identifiants

HAL Id : lirmm-01162366 , version 1
DOI : 10.1007/978-3-319-21024-7_16

Citer

Reza Akbarinia, Florent Masseglia. Aggregation-Aware Compression of Probabilistic Streaming Time Series. MLDM: Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. pp.232-247, ⟨10.1007/978-3-319-21024-7_16⟩. ⟨lirmm-01162366⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA ZENITH LIRMM INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER UNIV-RENNES UR1-MATH-NUM

219 Consultations

426 Téléchargements

Aggregation-Aware Compression of Probabilistic Streaming Time Series

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager