Skip to Main content Skip to Navigation
Journal articles

CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language

Boyan Kolev 1 Patrick Valduriez 1 Carlyna Bondiombouy 1 Ricardo Jiménez-Peris 2 Raquel Pau 3 José Orlando Pereira 4 
1 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : The blooming of different cloud data management infrastructures, special-ized for different kinds of data and tasks, has led to a wide diversification of DBMS interfaces and the loss of a common programming paradigm. In this paper, we present the design of a Cloud Multidatastore Query Language (CloudMdsQL), and its query engine. CloudMdsQL is a functional SQL-like language, capable of querying multiple heterogeneous data stores (relational and NoSQL) within a single query that may contain embedded invocations to each data store’s native query interface. The query engine has a fully distributed architecture, which provides important opportunities for optimization. The major innovation is that a CloudMdsQL query can exploit the full power of local data stores, by simply allowing some local data store native queries (e.g. a breadth-first search query against a graph database) to be called as functions, and at the same time be optimized, e.g. by pushing down select predicates, using bind join, performing join ordering, or planning intermediate data shipping. Our experi-mental validation, with three data stores (graph, document and relational) and repre-sentative queries, shows that CloudMdsQL satisfies the five important requirements for a cloud multidatastore query language.
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download
Contributor : Patrick Valduriez Connect in order to contact the contributor
Submitted on : Wednesday, December 9, 2015 - 6:39:48 PM
Last modification on : Friday, August 5, 2022 - 3:03:28 PM
Long-term archiving on: : Friday, May 5, 2017 - 5:40:21 PM


Files produced by the author(s)



Boyan Kolev, Patrick Valduriez, Carlyna Bondiombouy, Ricardo Jiménez-Peris, Raquel Pau, et al.. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language. Distributed and Parallel Databases, Springer, 2016, 34 (4), pp.463-503. ⟨10.1007/s10619-015-7185-y⟩. ⟨lirmm-01184016⟩



Record views


Files downloads