Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations

Vassil Dimitrov; Richard Ford; Laurent Imbert; Arjuna Madanayake; Nilan Udayanga; Will Wray

doi:10.1109/ARITH61463.2024.00017

Communication Dans Un Congrès Année : 2024

Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations

(1, 2) , (2) , (3, 2) , (2) , (2) , (2)

1
2
3

Vassil Dimitrov

Fonction : Auteur

University of Calgary

Lemurian Labs

Richard Ford

Fonction : Auteur

Lemurian Labs

Laurent Imbert

Fonction : Auteur correspondant
PersonId : 6246
IdHAL : laurent-imbert
ORCID : 0000-0001-9362-2869
IdRef : 157640620

Connectez-vous pour contacter l'auteur

Exact Computing

Lemurian Labs

Arjuna Madanayake

Fonction : Auteur

Lemurian Labs

Nilan Udayanga

Fonction : Auteur

Lemurian Labs

Will Wray

Fonction : Auteur

Lemurian Labs

Résumé

The power of logarithmic quantizations and computations has been recognized as a useful tool in optimizing the performance of large ML models. In this article, we provide results that demonstrate significantly better quantization signal-to-noise ratio performance thanks to multiple-base logarithmic number systems (MDLNS) in comparison with the floating-point quantizations that use the same number of bits. On a hardware level, we present details about our Xilinx VCU-128 FPGA design for dot product and matrix-vector computations. The MDLNS matrix-vector design significantly outperforms equivalent fixed-point binary designs in terms of area (A) and time (T) complexity and power consumption as evidenced by a 4x scaling of AT2 metric for VLSI performance, and 57% increase in computational throughput per watt compared to fixed-point arithmetic.

Mots clés

Measurement Quantization (signal) Power demand Very large scale integration Throughput Digital arithmetic Hardware

Domaines

Arithmétique des ordinateurs Intelligence artificielle [cs.AI]

Fichier principal

main.pdf (755.55 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Laurent Imbert : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-04638183

Soumis le : lundi 8 juillet 2024-10:55:13

Dernière modification le : jeudi 7 novembre 2024-16:14:03

Dates et versions

lirmm-04638183 , version 1 (08-07-2024)

Identifiants

HAL Id : lirmm-04638183 , version 1
DOI : 10.1109/ARITH61463.2024.00017

Citer

Vassil Dimitrov, Richard Ford, Laurent Imbert, Arjuna Madanayake, Nilan Udayanga, et al.. Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations. ARITH 2024 - 31st IEEE International Symposium on Computer Arithmetic, Jun 2024, Málaga, Spain. pp.48-51, ⟨10.1109/ARITH61463.2024.00017⟩. ⟨lirmm-04638183⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ECO LIRMM UNIV-MONTPELLIER ANR CYBERSCURITE

41 Consultations

35 Téléchargements

Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager