Critical path based performance models for distributed queries, pp.2012-121, 2012. ,
Reining in the outliers in map-reduce clusters using mantri, USENIX Symp. on Operating Systems Design and Implementation (OSDI), pp.265-278, 2010. ,
Real-Time Big Data Analytics: Emerging Architecture. O'Reilly Media, 2013. ,
Apollo: Scalable and coordinated scheduling for cloudscale computing, USENIX Symp. on Operating Systems Design and Implementation (OSDI), pp.285-300, 2014. ,
Fault Tolerance and High Availability in Data Stream Management Systems, Encyclopedia of Database Systems, pp.1109-1115, 2009. ,
DOI : 10.1145/224056.224070
Operator Scheduling in a Data Stream Manager, Int. Conf. on Very Large Data Bases (VLDB), pp.838-849, 2003. ,
DOI : 10.1016/B978-012722442-8/50079-3
Reservation-based scheduling: If you're late don't blame us, In ACM Symp. on Cloud Computing (SOCC), vol.2, pp.1-2, 2014. ,
Streaming Queries over Streaming Data, Int. Conf. on Very Large Data Bases (VLDB), pp.203-214, 2002. ,
DOI : 10.1016/B978-155860869-6/50026-3
URL : http://www.cs.berkeley.edu/~franklin/Papers/psoupVLDB02.pdf
The Case for Evaluating MapReduce Performance Using Workload Suites, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems, pp.390-399, 2011. ,
DOI : 10.1109/MASCOTS.2011.12
Scope: Easy and efficient parallel processing of massive data sets, Proceedings of the VLDB Endowment (PVLDB), pp.1265-1276, 2008. ,
MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008. ,
DOI : 10.14293/S2199-1006.1.SOR-UNCAT.AUNHT8.v1.RBZFIB
Tarcil, Proceedings of the Sixth ACM Symposium on Cloud Computing, SoCC '15, pp.97-110, 2015. ,
DOI : 10.1145/2485922.2485974
URL : http://dspace.mit.edu/bitstream/1721.1/111979/1/Sanchez.tarcil.pdf
Jockey: Guaranteed job latency in data parallel clusters, ACM European Conf. on Computer Systems (EuroSys), pp.99-112, 2012. ,
Locality aware task scheduling in parallel data stream processing, Int. Symp. on Intelligent Distributed Computing (IDC), pp.331-342, 2014. ,
SmartNet: a scheduling framework for heterogeneous computing, Proceedings Second International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN'96), pp.514-521, 1996. ,
DOI : 10.1109/ISPAN.1996.509034
URL : https://calhoun.nps.edu/bitstream/10945/35380/1/SmartNet_Kidd_Moore.pdf
Metrics and benchmarking for parallel job scheduling, Job Scheduling Strategies for Parallel Processing, pp.1-24, 1998. ,
DOI : 10.1007/BFb0053978
URL : http://blog.esac.cnic.cn/uploadfiles/2007-9/910794483.pdf
Task scheduling in data stream processing, Dateso: Annual Int. Workshop on Databases, pp.85-96, 2011. ,
Multiresource packing for cluster schedulers, ACM Conf. on Special Interest Group on Data Communication (SIGCOMM), pp.455-466, 2014. ,
The google file system, ACM Symp. on Operating Systems Principles (SOSP), pp.29-43, 2003. ,
DOI : 10.1145/945445.945450
Do the hard stuff first: Scheduling dependent computations in data-analytics clusters, 2016. ,
A survey on job scheduling algorithms in Big data processing, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), pp.1-11, 2015. ,
DOI : 10.1109/ICECCT.2015.7226035
Mesos: A platform for fine-grained resource sharing in the data center, USENIX Conf. on Networked Systems Design and Implementation (NSDI), pp.295-308, 2011. ,
Matchmaking: A New MapReduce Scheduling Technique, 2011 IEEE Third International Conference on Cloud Computing Technology and Science, pp.40-47, 2011. ,
DOI : 10.1109/CloudCom.2011.16
Dryad: distributed data-parallel programs from sequential building blocks, ACM European Conf. on Computer Systems (EuroSys), pp.59-72, 2007. ,
Quincy, Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, SOSP '09, pp.261-276, 2009. ,
DOI : 10.1145/1629575.1629601
Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data, 2011. ,
Network-aware scheduling for data-parallel jobs: Plan when you can, ACM Conf. on Special Interest Group on Data Communication (SIGCOMM), pp.407-420, 2015. ,
DOI : 10.1145/2785956.2787488
Morpheus: Towards automated SLOs for enterprise clusters, USENIX Symp. on Operating Systems Design and Implementation (OSDI), pp.117-134, 2016. ,
Near-optimal scheduling mechanisms for deadline-sensitive jobs in large computing clusters, ACM Transactions on Parallel Computing (TOPC), vol.2, issue.1, p.3, 2015. ,
DOI : 10.1145/2312005.2312051
Twitter heron: Stream processing at scale, ACM SIGMOD Int. Conf. on Management of Data (SIGMOD), pp.239-250, 2015. ,
Virtualizing traffic shapers for practical resource allocation, USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), 2013. ,
Lambda architecture for cost-effective batch and speed big data processing, IEEE Int. Conf. on Big Data (Big Data), pp.2785-2792, 2015. ,
Mercury: Hybrid centralized and distributed scheduling in large shared clusters, USENIX Annual Technical Conf. (USENIX ATC 15), pp.485-497, 2015. ,
Heracles: Improving resource efficiency at scale, Annual Int. Symp. on Computer Architecture (ISCA), pp.450-462, 2015. ,
Multiobjective scheduling of scientific workflows in multisite clouds, Future Generation Computer System, vol.63, pp.76-95, 2016. ,
A Survey of Data-Intensive Scientific Workflow Management, Journal of Grid Computing, vol.1, issue.Webserver-Issue, pp.457-493, 2015. ,
DOI : 10.1109/SERVICES-1.2008.79
URL : https://hal.archives-ouvertes.fr/lirmm-01144760
Scientific Workflow Scheduling with Provenance Support in Multisite Cloud, Int. Meeting on High-Performance Computing for Computational Science (VECPAR), p.8, 2016. ,
DOI : 10.1145/1084805.1084816
URL : https://hal.archives-ouvertes.fr/lirmm-01342190
Not for the timid: On the impact of aggressive over-booking in the cloud, Int. Conf. on Very Large Data Bases (VLDB), pp.1-12, 2016. ,
Machine Learning, 1997. ,
On-demand, spot, or both: Dynamic resource allocation for executing batch jobs in the cloud, Int. Conf. on Autonomic Computing (ICAC), pp.177-187, 2014. ,
Towards workflow scheduling in cloud computing: A comprehensive analysis, Journal of Network and Computer Applications, vol.66, pp.64-82, 2016. ,
DOI : 10.1016/j.jnca.2016.01.018
The quantcast file system, Proceedings of the VLDB Endowment (PVLDB), pp.1092-1101, 2013. ,
Principles of Distributed Database Systems, 2011. ,
Sparrow, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.69-84, 2013. ,
DOI : 10.1145/2517349.2522716
Low latency geo-distributed data analytics, ACM Conf. on Special Interest Group on Data Communication (SIGCOMM), pp.421-434, 2015. ,
Scaling network verification using symmetry and surgery, Annual ACM SIGPLAN-SIGACT Symp. on Principles of Programming Languages (POPL), pp.69-83, 2016. ,
A comparison of approaches to large-scale data analysis, ACM SIGMOD Int. Conf. on Management of Data (SIGMOD), pp.165-178, 2009. ,
Kubernetes -scheduling the future at cloud scale, In O'Reilly Open Source CONvention, p.2015 ,
Efficient queue management for cluster scheduling, Proceedings of the Eleventh European Conference on Computer Systems, EuroSys '16, pp.1-15, 2016. ,
DOI : 10.1145/2741948.2741964
URL : http://dl.acm.org/ft_gateway.cfm?id=2901354&type=pdf
Article: Survey on improved scheduling in hadoop mapreduce in cloud environments, Int. Journal of Computer Applications, vol.34, issue.9, pp.29-33, 2011. ,
Heterogeneity and dynamicity of clouds at scale, Proceedings of the Third ACM Symposium on Cloud Computing, SoCC '12, pp.1-7, 2012. ,
DOI : 10.1145/2391229.2391236
Reevaluating Amdahl???s law in the multicore era, Journal of Parallel and Distributed Computing, vol.70, issue.2, pp.183-188, 2010. ,
DOI : 10.1016/j.jpdc.2009.05.002
Omega, Proceedings of the 8th ACM European Conference on Computer Systems, EuroSys '13, pp.351-364, 2013. ,
DOI : 10.1145/2465351.2465386
Article: A survey on big data management and job scheduling, Int. Journal of Computer Applications, vol.130, issue.13, pp.41-49, 2015. ,
A Survey of Large Scale Data Management Approaches in Cloud Environments, IEEE Communications Surveys & Tutorials, vol.13, issue.3, pp.311-336, 2011. ,
DOI : 10.1109/SURV.2011.032211.00087
URL : https://hal.archives-ouvertes.fr/inria-00623093
Shared-nothing architecture, Encyclopedia of Database Systems, pp.2638-2639, 2009. ,
Apache hadoop yarn: Yet another resource negotiator, Annual Symp. on Cloud Computing (SOCC), pp.1-516, 2013. ,
A Task Scheduling Algorithm for Hadoop Platform, Journal of Computers, vol.8, issue.4, pp.929-936, 2013. ,
DOI : 10.4304/jcp.8.4.929-936
A comparative review of job scheduling for MapReduce, 2011 IEEE International Conference on Cloud Computing and Intelligence Systems, pp.353-358, 2011. ,
DOI : 10.1109/CCIS.2011.6045089
Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling, ACM European Conf. on Computer Systems (EuroSys), pp.265-278, 2010. ,
Spark: Cluster computing with working sets, USENIX Conf. on Hot Topics in Cloud Computing (HotCloud), pp.10-10, 2010. ,
Fuxi: a faulttolerant resource management and job scheduling system at internet scale, Proceedings of the VLDB Endowment (PVLDB), pp.1393-1404, 2014. ,