Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project

Abstract : Data-intensive science requires the integration of two fairly different paradigms: high-performance computing (HPC) and data-intensive scalable computing (DISC), as exemplified by frameworks such as Hadoop and Spark. In this context, the SciDISC project addresses the grand challenge of scientific data analysis using DISC, by developing architectures and methods to combine simulation and data analysis. SciDISC is an ongoing project between Inria, several research institutions in Rio de Janeiro and NYU. This paper introduces the motivations and objectives of the project, and reports on the first results achieved so far.
Complete list of metadatas

Cited literature [29 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01867804
Contributor : Patrick Valduriez <>
Submitted on : Tuesday, September 4, 2018 - 4:09:22 PM
Last modification on : Wednesday, August 14, 2019 - 10:46:03 AM
Long-term archiving on : Wednesday, December 5, 2018 - 5:48:45 PM

File

ldas 2018 - scidisc.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-01867804, version 1

Collections

Citation

Patrick Valduriez, Marta Mattoso, Reza Akbarinia, Heraldo Borges, José Camata, et al.. Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project. LADaS: Latin America Data Science Workshop, Aug 2018, Rio de Janeiro, Brazil. ⟨lirmm-01867804⟩

Share

Metrics

Record views

595

Files downloads

231