Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories

Anne Laurent
Michel Sala
  • Fonction : Auteur
  • PersonId : 938397
Nicolas Sicard
  • Fonction : Auteur

Résumé

Data aggregation is one of the key features used in databases, especially for Business Intelligence (e.g., ETL, OLAP) and analytics/data mining. When considering SQL databases, aggregation is used to prepare and visualize data for deeper analyses. However, these operations are often impossible on very large volumes of data regarding memory-and-timeconsumption. In this paper, we show how NoSQL databases such as MongoDB and its key-value stores, thanks to the native MapReduce algorithm, can provide an efficient framework to aggregate large volumes of data. We provide basic material about the MapReduce algorithm, the different NoSQL databases (read intensive vs. write intensive). We investigate how to efficiently modelize the data framework for BI and analytics. For this purpose, we focus on read intensive NoSQL databases using MongoDB and we show how NoSQL and MapReduce can help handling large volumes of data.
Fichier principal
Vignette du fichier
lirmm-00803917v1.pdf (219.45 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

lirmm-00803917 , version 1 (01-11-2019)

Identifiants

Citer

Laurent Bonnet, Anne Laurent, Bénédicte Laurent, Michel Sala, Nicolas Sicard. Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories. DEXA 2011 - 22nd International Conference on Database and Expert Systems Applications, Aug 2011, Toulouse, France. pp.483-488, ⟨10.1109/DEXA.2011.71⟩. ⟨lirmm-00803917⟩
373 Consultations
976 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More