A Survey of Scheduling Frameworks in Big Data Systems

Ji Liu 1, 2 Esther Pacitti 1, 2 Patrick Valduriez 1, 2
1 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Cloud and big data technologies are now converging to enable organizations to outsource data in the cloud and get value from data through big data analytics. Big data systems typically exploit computer clusters to gain scalability and obtain a good cost-performance ratio. However, scheduling a workload in a computer cluster remains a well-known open problem. Scheduling methods are typically implemented in a scheduling framework and may have different objectives. In this paper, we survey scheduling methods and frameworks for big data systems, propose a taxonomy and analyze the features of the different categories of scheduling frameworks. These frameworks have been designed initially for the cloud (MapReduce) to process Web data. We examine sixteen popular scheduling frameworks and discuss their features. Our study shows that different frameworks are proposed for different big data systems, different scales of computer clusters and different objectives. We propose the main dimensions for workloads and metrics for benchmarks to evaluate these scheduling frameworks. Finally, we analyze their limitations and propose new research directions.
Complete list of metadatas

Cited literature [62 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01692229
Contributor : Patrick Valduriez <>
Submitted on : Wednesday, January 24, 2018 - 7:10:28 PM
Last modification on : Friday, March 15, 2019 - 1:15:11 AM
Long-term archiving on : Thursday, May 24, 2018 - 10:18:38 PM

File

HALVersionSurvey.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Ji Liu, Esther Pacitti, Patrick Valduriez. A Survey of Scheduling Frameworks in Big Data Systems. International Journal of Cloud Computing, Inderscience Publishers, 2018, 7 (2), pp.103-128. ⟨10.1504/IJCC.2018.093765⟩. ⟨lirmm-01692229⟩

Share

Metrics

Record views

2512

Files downloads

1276