HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation

Implementation and Efficiency of Reproducible Level 1 BLAS

Chemseddine Chohra 1 Philippe Langlois 1 David Parello 1
1 DALI - Digits, Architectures et Logiciels Informatiques
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, UPVD - Université de Perpignan Via Domitia
Abstract : Numerical reproducibility failures appear in massively parallel floating-point computations. One way to guarantee this reproducibility is to extend the IEEE-754 correct rounding to larger computing sequences, e.g. to the BLAS. Is the extra cost for numerical reproducibility acceptable in practice? We present solutions and experiments for the level 1 BLAS. We detail optimized implementations and we conclude about their efficiency.
Complete list of metadata

Cited literature [13 references]  Display  Hide  Download

Contributor : Philippe Langlois Connect in order to contact the contributor
Submitted on : Thursday, July 23, 2015 - 4:43:08 PM
Last modification on : Friday, October 22, 2021 - 3:07:35 PM
Long-term archiving on: : Saturday, October 24, 2015 - 11:44:20 AM


15 pages.pdf
Files produced by the author(s)


  • HAL Id : lirmm-01179986, version 1



Chemseddine Chohra, Philippe Langlois, David Parello. Implementation and Efficiency of Reproducible Level 1 BLAS. [Research Report] DALI - UPVD/LIRMM, UCD. 2015. ⟨lirmm-01179986⟩



Record views


Files downloads