Implementation and Efficiency of Reproducible Level 1, BLAS, 2015. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01179986
Reproducible and Accurate Matrix Multiplication in ExBLAS for High-Performance Computing, 2014. ,
A floating-point technique for extending the available precision, Numerische Mathematik, vol.5, issue.3, pp.224-242, 1971. ,
DOI : 10.1007/BF01397083
Fast Reproducible Floating-Point Summation, 2013 IEEE 21st Symposium on Computer Arithmetic, 2013. ,
DOI : 10.1109/ARITH.2013.9
Toward hardware support for Reproducible Floating-Point Computation, 2014. ,
-Vectors, ACM Transactions on Mathematical Software, vol.41, issue.4, pp.1-2420, 2015. ,
DOI : 10.1145/2699469
URL : https://hal.archives-ouvertes.fr/hal-00309608
Handbook of Floating-Point Arithmetic, 2010. ,
DOI : 10.1007/978-0-8176-4705-6
URL : https://hal.archives-ouvertes.fr/ensl-00379167
Fast exact summation using small and large superaccumulators, 2015. ,
ReproBLAS: Reproducible BLAS ,
Accurate Sum and Dot Product, SIAM Journal on Scientific Computing, vol.26, issue.6, pp.1955-1988, 2005. ,
DOI : 10.1137/030601818
Ultimately Fast Accurate Summation, SIAM Journal on Scientific Computing, vol.31, issue.5, pp.3466-3502, 2009. ,
DOI : 10.1137/080738490
Run-to-Run Numerical Reproducibility with the Intel Math Kernel Library and Intel Composer XE 2013, Tech. rep., Intel Corporation, 2013. ,
A parallel algorithm for accurate dot product, Parallel Computing, vol.34, issue.6-8, pp.392-410, 2008. ,
DOI : 10.1016/j.parco.2008.02.002
Correct Rounding and a Hybrid Approach to Exact Floating-Point Summation, SIAM Journal on Scientific Computing, vol.31, issue.4, pp.2981-3001, 2009. ,
DOI : 10.1137/070710020
Algorithm 908, ACM Transactions on Mathematical Software, vol.37, issue.3, pp.1-3713, 2010. ,
DOI : 10.1145/1824801.1824815