Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Journal Articles Transactions on Large-Scale Data- and Knowledge-Centered Systems Year : 2017

Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud

Abstract

Recently, some Scientific Workflow Management Systems (SWfMSs) with provenance support (e.g. Chiron) have been deployed in the cloud. However, they typically use a single cloud site. In this paper, we consider a multisite cloud, where the data and computing resources are distributed at different sites (possibly in different regions). Based on a multisite architecture of SWfMS, i.e. multisite Chiron, and its provenance model, we propose a multisite task scheduling algorithm that considers the time to generate provenance data. We performed an extensive experimental evaluation of our algorithm using Microsoft Azure multisite cloud and two real-life scientific workflows (Buzz and Montage). The results show that our scheduling algorithm is up to 49.6% better than baseline algorithms in terms of total execution time.
Fichier principal
Vignette du fichier
TLDKS.pdf (1.93 Mo) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-01620224 , version 1 (20-10-2017)

Identifiers

Cite

Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso. Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud. Transactions on Large-Scale Data- and Knowledge-Centered Systems, 2017, 33, pp.80-112. ⟨10.1109/IPDPS.2007.370305⟩. ⟨lirmm-01620224⟩
261 View
491 Download

Altmetric

Share

More