On the Automatic Exploration of Weight Sharing for Deep Neural Network Compression

Etienne Dupuis; David Novo; Ian O'Connor; Alberto Bosio

doi:10.23919/DATE48585.2020.9116350

Communication Dans Un Congrès Année : 2020

On the Automatic Exploration of Weight Sharing for Deep Neural Network Compression

(1, 2) , (3) , (1, 2) , (1, 2)

1
2
3

Etienne Dupuis

Fonction : Auteur
PersonId : 1085472

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

David Novo

Fonction : Auteur
PersonId : 170933
IdHAL : david-novo
ORCID : 0000-0002-5510-4152
IdRef : 244276455

ADAptive Computing

Ian O'Connor

Fonction : Auteur
PersonId : 173133
IdHAL : oconnor-ian
ORCID : 0000-0002-6238-9600
IdRef : 119579766

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Alberto Bosio

Fonction : Auteur
PersonId : 172965
IdHAL : alberto-bosio
ORCID : 0000-0001-6116-7339
IdRef : 174383592

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Résumé

Deep neural networks demonstrate impressive levels of performance, particularly in computer vision and speech recognition. However, the computational workload and associated storage inhibit their potential in resource-limited embedded systems. The approximate computing paradigm has been widely explored in the literature. It improves performance and energyefficiency by relaxing the need for fully accurate operations. There are a large number of implementation options with very different approximation strategies (such as pruning, quantization, low-rank factorization, knowledge distillation, etc.). To the best of our knowledge, no automated approach exists to explore, select and generate the best approximate versions of a given convolutional neural network (CNN) according to the design objectives. The goal of this work in progress is to demonstrate that the design space exploration phase can enable significant network compression without noticeable accuracy loss. We demonstrate this via an example based on weight sharing and show that our method can obtain a 4x compression rate in an int-16 version of LeNet-5 (5-layer 1,720-kbit CNNs) without retraining and without any accuracy loss.

Mots clés

Deep Neural Networks Approximate Computing Model Compression Weight Sharing Design Space Exploration Embedded System Hardware Accelerator

Domaines

Informatique [cs]

Fichier principal

DATE_2020_submission_paper (1).pdf (301.77 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

David Novo : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-03054114

Soumis le : vendredi 11 décembre 2020-12:04:17

Dernière modification le : vendredi 5 avril 2024-03:23:43

Archivage à long terme le : vendredi 12 mars 2021-19:25:31

Dates et versions

lirmm-03054114 , version 1 (11-12-2020)

Identifiants

HAL Id : lirmm-03054114 , version 1
DOI : 10.23919/DATE48585.2020.9116350

Citer

Etienne Dupuis, David Novo, Ian O'Connor, Alberto Bosio. On the Automatic Exploration of Weight Sharing for Deep Neural Network Compression. DATE 2020 - 23rd Design, Automation and Test in Europe Conference and Exhibition, Mar 2020, Grenoble, France. pp.1319-1322, ⟨10.23919/DATE48585.2020.9116350⟩. ⟨lirmm-03054114⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 INSA-LYON EC-LYON INL LIRMM ADAC LIRMM_MIC MIC MIPS UNIV-MONTPELLIER INSA-GROUPE UDL ANR EC_LYON_STRICT

153 Consultations

275 Téléchargements

On the Automatic Exploration of Weight Sharing for Deep Neural Network Compression

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager