Dynamic Workload-Based Partitioning Algorithms for Continuously Growing Databases

Miguel Liroz-Gistau 1 Reza Akbarinia 1 Esther Pacitti 1 Fabio Porto 2 Patrick Valduriez 1, 3
1 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Applications with very large databases, where data items are continuously appended, are becoming more and more common. Thus, the development of efficient data partitioning is one of the main requirements to yield good performance. In the case of applications that have complex access patterns, e.g. scientific applications, workload-based partitioning could be exploited. However, existing workload-based approaches, which work in a static way, cannot be applied to very large databases. In this paper, we propose DynPart and DynPartGroup, two dynamic partitioning algorithms for continuously growing databases. These algorithms efficiently adapt the data partitioning to the arrival of new data elements by taking into account the affinity of new data with queries and fragments. In contrast to existing static approaches, our approach offers constant execution time, no matter the size of the database, while obtaining very good partitioning efficiency. We validated our solution through experimentation over real-world data; the results show its effectiveness.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00906966
Contributor : Miguel Liroz-Gistau <>
Submitted on : Wednesday, November 20, 2013 - 3:30:56 PM
Last modification on : Monday, June 17, 2019 - 6:25:44 PM
Long-term archiving on: Friday, February 21, 2014 - 4:32:45 AM

File

tldks.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Miguel Liroz-Gistau, Reza Akbarinia, Esther Pacitti, Fabio Porto, Patrick Valduriez. Dynamic Workload-Based Partitioning Algorithms for Continuously Growing Databases. Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2013, LNCS (8320), pp.105-128. ⟨10.1007/978-3-642-45315-1_5⟩. ⟨lirmm-00906966⟩

Share

Metrics

Record views

953

Files downloads

1162