Implementation and Efficiency of Reproducible Level 1 BLAS - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Access content directly
Reports (Research Report) Year : 2015

Implementation and Efficiency of Reproducible Level 1 BLAS

Abstract

Numerical reproducibility failures appear in massively parallel floating-point computations. One way to guarantee this reproducibility is to extend the IEEE-754 correct rounding to larger computing sequences, e.g. to the BLAS. Is the extra cost for numerical reproducibility acceptable in practice? We present solutions and experiments for the level 1 BLAS. We detail optimized implementations and we conclude about their efficiency.
Fichier principal
Vignette du fichier
15 pages.pdf (453.11 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-01179986 , version 1 (23-07-2015)

Identifiers

  • HAL Id : lirmm-01179986 , version 1

Cite

Chemseddine Chohra, Philippe Langlois, David Parello. Implementation and Efficiency of Reproducible Level 1 BLAS. [Research Report] DALI - UPVD/LIRMM, UCD. 2015. ⟨lirmm-01179986⟩
239 View
678 Download

Share

Gmail Mastodon Facebook X LinkedIn More