S. Collange, D. Defour, S. Graillat, and R. Iakimchuk, Reproducible and Accurate Matrix Multiplication in ExBLAS for High-Performance Computing, 2014.

T. J. Dekker, A floating-point technique for extending the available precision, Numerische Mathematik, vol.5, issue.3, pp.224-242, 1971.
DOI : 10.1007/BF01397083

J. W. Demmel and H. D. Nguyen, Fast Reproducible Floating-Point Summation, 2013 IEEE 21st Symposium on Computer Arithmetic, 2013.
DOI : 10.1109/ARITH.2013.9
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.309.6586

F. Jézéquel, P. Langlois, and N. Revol, First steps towards more numerical reproducibility, ESAIM: Proceedings and Surveys, vol.45, pp.229-238, 2013.
DOI : 10.1051/proc/201445023

J. M. Muller, N. Brisebarre, F. De-dinechin, C. P. Jeannerod, V. Lefèvre et al., Handbook of Floating-Point Arithmetic, 2010.
DOI : 10.1007/978-0-8176-4705-6
URL : https://hal.archives-ouvertes.fr/ensl-00379167

T. Ogita, S. M. Rump, and S. Oishi, Accurate Sum and Dot Product, SIAM Journal on Scientific Computing, vol.26, issue.6, pp.1955-1988, 2005.
DOI : 10.1137/030601818

J. Reinders, Intel Threading Building Blocks, 2007.

S. M. Rump, T. Ogita, and S. Oishi, Accurate Floating-Point Summation Part I: Faithful Rounding, SIAM Journal on Scientific Computing, vol.31, issue.1, pp.189-224, 2008.
DOI : 10.1137/050645671
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.519.3738

S. Story, Numerical reproducibility in the Intel Math Kernel Library, 2012.

F. G. Van-zee and R. A. Van-de-geijn, BLIS: A framework for rapidly instantiating BLAS functionality, ACM Transactions on Mathematical Software, vol.41, issue.3

N. Yamanaka, T. Ogita, S. Rump, and S. Oishi, A parallel algorithm for accurate dot product, Parallel Computing, vol.34, issue.6-8, pp.6-8, 2008.
DOI : 10.1016/j.parco.2008.02.002

Y. K. Zhu and W. B. Hayes, Correct Rounding and a Hybrid Approach to Exact Floating-Point Summation, SIAM Journal on Scientific Computing, vol.31, issue.4, pp.2981-3001, 2009.
DOI : 10.1137/070710020

Y. K. Zhu and W. B. Hayes, Algorithm 908, ACM Transactions on Mathematical Software, vol.37, issue.3, pp.1-3713, 2010.
DOI : 10.1145/1824801.1824815