Extending CloudMdsQL with MFR for Big Data Integration - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Extending CloudMdsQL with MFR for Big Data Integration

Carlyna Bondiombouy
Boyan Kolev
Patrick Valduriez
Oleksandra Levchenko

Résumé

In this short paper (see [2] for the long version), we propose a functional SQL-like query language (based on Cloud-MdsQL) and query engine to retrieve data from two different kinds of data stores - an RDBMS and a distributed data processing framework such as Apache Spark or Hadoop MapReduce on top of HDFS - and combine them by applying data integration operators (mostly joins). However, users need to be aware of how data are organized across the data stores, so that they write valid queries. The query therefore contains embedded invocations to the underlying data stores, expressed as subqueries. As our query language is functional, it introduces a tight coupling between data and functions.
Fichier principal
Vignette du fichier
MFR-BDA2016.pdf (178.9 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

lirmm-01409104 , version 1 (05-12-2016)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

  • HAL Id : lirmm-01409104 , version 1

Citer

Carlyna Bondiombouy, Boyan Kolev, Patrick Valduriez, Oleksandra Levchenko. Extending CloudMdsQL with MFR for Big Data Integration. BDA: Gestion de Données — Principes, Technologies et Applications, LIAS / ISAE-ENSMA, Poitiers, Nov 2016, Poitiers, France. ⟨lirmm-01409104⟩
413 Consultations
274 Téléchargements

Partager

Gmail Facebook X LinkedIn More