Implementation and Efficiency of Reproducible Level 1 BLAS - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2015

Implementation and Efficiency of Reproducible Level 1 BLAS

(1) , (1) , (1)
1

Abstract

Numerical reproducibility failures appear in massively parallel floating-point computations. One way to guarantee this reproducibility is to extend the IEEE-754 correct rounding to larger computing sequences, e.g. to the BLAS. Is the extra cost for numerical reproducibility acceptable in practice? We present solutions and experiments for the level 1 BLAS. We detail optimized implementations and we conclude about their efficiency.
Fichier principal
Vignette du fichier
15 pages.pdf (453.11 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

lirmm-01179986 , version 1 (23-07-2015)

Identifiers

  • HAL Id : lirmm-01179986 , version 1

Cite

Chemseddine Chohra, Philippe Langlois, David Parello. Implementation and Efficiency of Reproducible Level 1 BLAS. [Research Report] DALI - UPVD/LIRMM, UCD. 2015. ⟨lirmm-01179986⟩
230 View
640 Download

Share

Gmail Facebook Twitter LinkedIn More