TARS: An Array Model with Rich Semantics for Multidimensional Data - Archive ouverte HAL Access content directly
Conference Papers Year : 2017

TARS: An Array Model with Rich Semantics for Multidimensional Data

(1) , (1) , (1) , (2, 3)
1
2
3

Abstract

Relational DBMSs have been shown to be inefficient for scientific data management. One main reason is the difficulty to represent arrays, which are frequently adopted as a data model for scientific datasets representation. Array DBMSs, e.g. SciDB, were proposed to bridge this gap, building on a native array representation. Unfortunately, important scientific applications, such as numerical simulation, have additional requirements , in particular to deal with mesh topology and geometry. First, transforming simulation results datasets into DBMS array format incurs in huge latency due to the fixed format of array DBMSs layouts and data transformations to adapt to mesh data characteristics. Second, simulation applications require data visualization or computing uncertainty quantifi-cation (UQ), both requiring metadata beyond the simulation output array. To address these problems, we propose a novel data model called TARS (Typed ARray Schema), which extends the basic array data model with typed arrays. In TARS, the support of application dependent data characteristics , such as data visualization and UQ computation, is provided through the definition of TAR objects, ready to be manipulated by TAR operators. This approach provides much flexibility for capturing internal data layouts through mapping functions, which makes data ingestion independent of how simulation data has been produced, thus minimizing ingestion time. In this paper, we present the TARS data model and illustrate its use in the context of numerical simulation application.
Fichier principal
Vignette du fichier
TARS-paper19.pdf (992.39 Ko) Télécharger le fichier

Dates and versions

lirmm-01620376 , version 1 (20-10-2017)

Identifiers

  • HAL Id : lirmm-01620376 , version 1

Cite

Hermano Lustosa, Noel Lemus, Fabio Porto, Patrick Valduriez. TARS: An Array Model with Rich Semantics for Multidimensional Data. Forum and Demos at ER, Nov 2017, Valencia, Spain. pp.114-127. ⟨lirmm-01620376⟩
286 View
260 Download

Share

Gmail Facebook Twitter LinkedIn More