,
,
,
, Azure Speed Test
, Chameleon
,
,
,
,
, USGS ANSS -Advanced National Seismic System
Parallel I/O and the metadata wall Characterization of scientific workflows, Workshop on Parallel Data Storage (PDSW) Workshop on WFs in Support of Large-Scale Science, pp.13-18, 2008. ,
Efficient metadata management in large distributed storage systems, 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings., pp.290-298, 2003. ,
DOI : 10.1109/MASS.2003.1194865
The Vesta parallel file system, ACM Transactions on Computer Systems, vol.14, issue.3, pp.225-264, 1996. ,
DOI : 10.1145/233557.233558
Managing Large-Scale Workflow Execution from Resource Provisioning to Provenance Tracking: The CyberShake Example, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06), pp.14-14, 2006. ,
DOI : 10.1109/E-SCIENCE.2006.261098
The cost of doing science on the cloud: The Montage example, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2008. ,
DOI : 10.1109/SC.2008.5217932
Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems, Scientific Programming, pp.219-237, 2005. ,
DOI : 10.1155/2005/128026
Algebraic dataflows for big data analysis, 2013 IEEE International Conference on Big Data, pp.150-155, 2013. ,
DOI : 10.1109/BigData.2013.6691567
Efficient querying of distributed provenance stores, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.613-621, 2010. ,
DOI : 10.1145/1851476.1851567
The Google file system, ACM SIGOPS Operating Systems Review, vol.37, issue.5, pp.29-43, 2003. ,
DOI : 10.1145/1165389.945450
Efficient identification of hot data for flash memory storage systems, ACM Transactions on Storage, vol.2, issue.1, pp.22-40, 2006. ,
DOI : 10.1145/1138041.1138043
GreedyDual??? Web caching algorithm: exploiting the two sources of temporal locality in Web request streams, Computer Communications, vol.24, issue.2, pp.174-183, 2001. ,
DOI : 10.1016/S0140-3664(00)00312-1
Characterizing and profiling scientific workflows, Future Generation Computer Systems, vol.29, issue.3, pp.682-692, 2013. ,
DOI : 10.1016/j.future.2012.08.015
Spyglass: Fast, scalable metadata search for large-scale storage systems, USENIX Conf. on File and Storage Technologies (FAST), pp.153-166, 2009. ,
Identifying hot and cold data in main-memory databases, 2013 IEEE 29th International Conference on Data Engineering (ICDE), pp.26-37, 2013. ,
DOI : 10.1109/ICDE.2013.6544811
A Survey of Data-Intensive Scientific Workflow Management, Journal of Grid Computing, vol.1, issue.Webserver-Issue, pp.457-493, 2015. ,
DOI : 10.1109/SERVICES-1.2008.79
URL : https://hal.archives-ouvertes.fr/lirmm-01144760
Mat- toso. Scientific workflow scheduling with provenance data in a multisite cloud, Transactions on Large-Scale Data-and Knowledge- Centered Systems (TLDKS), pp.80-112, 2016. ,
Multi-objective scheduling of Scientific Workflows in multisite clouds, Future Generation Computer Systems, vol.63, pp.76-95, 2016. ,
DOI : 10.1016/j.future.2016.04.014
URL : https://hal.archives-ouvertes.fr/lirmm-01342203
Tracking and sketching distributed data provenance ARC: A selftuning , low overhead replacement cache, Int. Conf. on e-Science USENIX Conf. on File and Storage Technologies (FAST), pp.190-197, 2003. ,
RAMA: An easy-to-use, high-performance parallel file system, Parallel Computing, vol.23, issue.4, pp.419-446 ,
An algebraic approach for data-centric scientific workflows, Proceedings of the VLDB Endowment (PVLDB), pp.1328-1339, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00640431
Chiron: a parallel engine for algebraic scientific workflows, Concurrency and Computation: Practice and Experience, pp.252327-2341, 2013. ,
DOI : 10.1109/eScience.2008.62
URL : https://hal.archives-ouvertes.fr/lirmm-00806557
Principles of Distributed Database Systems, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00483354
Towards Multi-site Metadata Management for Geographically Distributed Cloud Workflows, 2015 IEEE International Conference on Cluster Computing, pp.294-303, 2015. ,
DOI : 10.1109/CLUSTER.2015.49
URL : https://hal.archives-ouvertes.fr/hal-01239150
Managing hot metadata for scientific workflows on multisite clouds, 2016 IEEE International Conference on Big Data (Big Data), pp.390-397, 2016. ,
DOI : 10.1109/BigData.2016.7840628
URL : https://hal.archives-ouvertes.fr/hal-01395715
Theoretical Framework for Eliminating Redundancy in Workflows, 2009 IEEE International Conference on Services Computing, pp.41-48, 2009. ,
DOI : 10.1109/SCC.2009.19
GPFS: A shareddisk file system for large computing clusters, USENIX Conf. on File and Storage Technologies (FAST), pp.231-244, 2002. ,
"One Size Fits All": An Idea Whose Time Has Come and Gone, 21st International Conference on Data Engineering (ICDE'05), pp.2-11, 2005. ,
DOI : 10.1109/ICDE.2005.1
The end of an architectural era: Time for a complete rewrite, Int. Conf. on Very Large Data Bases (VLDB), pp.1150-1160 ,
CalvinFS: consistent wan replication and scalable metadata management for distributed file systems, USENIX Conf. on File and Storage Technologies (FAST), pp.1-14, 2015. ,
Indexing multi-dimensional data in a cloud system, Proceedings of the 2010 international conference on Management of data, SIGMOD '10, pp.591-602, 2010. ,
DOI : 10.1145/1807167.1807232
Swift/t: scalable data flow programming for many-task applications, ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pp.309-310, 2013. ,
Efficient B-tree based indexing for cloud data processing, Proceedings of the VLDB Endowment (PVLDB), pp.1207-1218, 2010. ,
DOI : 10.14778/1920841.1920991
Spark: Cluster computing with working sets, USENIX Workshop on Hot Topics in Cloud Computing (Hot- Cloud), pp.10-10, 2010. ,
Spade: An efficient algorithm for mining frequent sequences, Machine Learning, pp.31-60 ,
Distributed data provenance for large-scale data-intensive computing, 2013 IEEE International Conference on Cluster Computing (CLUSTER), pp.1-8, 2013. ,
DOI : 10.1109/CLUSTER.2013.6702685
FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems, 2014 IEEE International Conference on Big Data (Big Data), pp.61-70, 2014. ,
DOI : 10.1109/BigData.2014.7004214