Integrating Big Data and Relational Data with a Functional SQL-like Query Language

Carlyna Bondiombouy 1 Boyan Kolev 1 Oleksandra Levchenko 1, 2 Patrick Valduriez 1
1 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Multistore systems have been recently proposed to provide integrated access to multiple, heterogeneous data stores through a single query engine. In particular, much attention is being paid on the integration of unstructured big data typically stored in HDFS with relational data. One main solution is to use a relational query engine that allows SQL-like queries to retrieve data from HDFS, which requires the system to provide a relational view of the unstructured data and hence is not always feasible. In this paper, we introduce a functional SQL-like query language that can integrate data retrieved from different data stores and take full advantage of the functionality of the underlying data processing frameworks by allowing the ad-hoc usage of user defined map/filter/reduce operators in combination with traditional SQL statements. Furthermore, the query language allows for optimization by enabling subquery rewriting so that filter conditions can be pushed inside and executed at the data store as early as possible. Our approach is validated with two data stores and a representative query that demonstrates the usability of the query language and evaluates the benefits from query optimization.
Type de document :
Communication dans un congrès
Qiming Chen; Abdelkader Hameurlain; Farouk Toumani; Roland Wagner; Hendrik Decker. DEXA’2015: 26th International Conference on Database and Expert Systems Applications, Sep 2015, Valencia, Spain. Lecture Notes in Computer Science 9261, Springer 2015, ISBN 978-3-319-22848-8, 2015
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01181242
Contributeur : Patrick Valduriez <>
Soumis le : mercredi 29 juillet 2015 - 15:46:34
Dernière modification le : jeudi 11 janvier 2018 - 17:01:53

Identifiants

  • HAL Id : lirmm-01181242, version 1

Collections

Citation

Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez. Integrating Big Data and Relational Data with a Functional SQL-like Query Language. Qiming Chen; Abdelkader Hameurlain; Farouk Toumani; Roland Wagner; Hendrik Decker. DEXA’2015: 26th International Conference on Database and Expert Systems Applications, Sep 2015, Valencia, Spain. Lecture Notes in Computer Science 9261, Springer 2015, ISBN 978-3-319-22848-8, 2015. 〈lirmm-01181242〉

Partager

Métriques

Consultations de la notice

242