StreamCloud: An Elastic and Scalable Data Streaming System

Vincenzo Gulisano 1 Ricardo Jimenez-Peris 1 Marta Patino-Martínez 1 Claudio Soriente 1 Patrick Valduriez 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Many applications in several domains such as telecommunications, network security, large-scale sensor networks, require online processing of continuous data flows. They produce very high loads that requires aggregating the processing capacity of many nodes. Current Stream Processing Engines do not scale with the input load due to single-node bottlenecks. Additionally, they are based on static configurations that lead to either under or overprovisioning. In this paper, we present StreamCloud, a scalable and elastic stream processing engine for processing large data stream volumes. StreamCloud uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead. Its elastic protocols exhibit low intrusiveness, enabling effective adjustment of resources to the incoming load. Elasticity is combined with dynamic load balancing to minimize the computational resources used. The paper presents the system design, implementation, and a thorough evaluation of the scalability and elasticity of the fully implemented system.
Type de document :
Article dans une revue
IEEE Transactions on Parallel and Distributed Systems, Institute of Electrical and Electronics Engineers, 2012, 23 (12), pp.2351-2365. 〈https://www.computer.org/csdl/trans/td/2012/12/ttd2012122351-abs.html〉. 〈10.1109/TPDS.2012.24〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00748992
Contributeur : Patrick Valduriez <>
Soumis le : mardi 6 novembre 2012 - 14:02:57
Dernière modification le : jeudi 24 mai 2018 - 15:59:21

Lien texte intégral

Identifiants

Collections

Citation

Vincenzo Gulisano, Ricardo Jimenez-Peris, Marta Patino-Martínez, Claudio Soriente, Patrick Valduriez. StreamCloud: An Elastic and Scalable Data Streaming System. IEEE Transactions on Parallel and Distributed Systems, Institute of Electrical and Electronics Engineers, 2012, 23 (12), pp.2351-2365. 〈https://www.computer.org/csdl/trans/td/2012/12/ttd2012122351-abs.html〉. 〈10.1109/TPDS.2012.24〉. 〈lirmm-00748992〉

Partager

Métriques

Consultations de la notice

397