Implementation and Efficiency of Reproducible Level 1 BLAS - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Reports (Research Report) Year : 2015

Implementation and Efficiency of Reproducible Level 1 BLAS

Abstract

Numerical reproducibility failures appear in massively parallel floating-point computations. One way to guarantee this reproducibility is to extend the IEEE-754 correct rounding to larger computing sequences, e.g. to the BLAS. Is the extra cost for numerical reproducibility acceptable in practice? We present solutions and experiments for the level 1 BLAS. We detail optimized implementations and we conclude about their efficiency.
Fichier principal
Vignette du fichier
15 pages.pdf (453.11 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-01179986 , version 1 (23-07-2015)

Identifiers

  • HAL Id : lirmm-01179986 , version 1

Cite

Chemseddine Chohra, Philippe Langlois, David Parello. Implementation and Efficiency of Reproducible Level 1 BLAS. [Research Report] DALI - UPVD/LIRMM, UCD. 2015. ⟨lirmm-01179986⟩
258 View
692 Download

Share

More