Sensitivity Analysis and Compression Opportunities in DNNs Using Weight Sharing

Etienne Dupuis; David Novo; Ian O'Connor; Alberto Bosio

doi:10.1109/DDECS50862.2020.9095658

Communication Dans Un Congrès Année : 2020

Sensitivity Analysis and Compression Opportunities in DNNs Using Weight Sharing

(1, 2) , (3) , (1, 2) , (1, 2)

1
2
3

Etienne Dupuis

Fonction : Auteur
PersonId : 1085472

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

David Novo

Fonction : Auteur
PersonId : 170933
IdHAL : david-novo
ORCID : 0000-0002-5510-4152
IdRef : 244276455

ADAptive Computing

Ian O'Connor

Fonction : Auteur
PersonId : 173133
IdHAL : oconnor-ian
ORCID : 0000-0002-6238-9600
IdRef : 119579766

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Alberto Bosio

Fonction : Auteur
PersonId : 172965
IdHAL : alberto-bosio
ORCID : 0000-0001-6116-7339
IdRef : 174383592

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Résumé

Deep artificial Neural Networks (DNNs) are currently one of the most intensively and widely used predictive models in the field of machine learning. However, the computational workload involved in DNNs is typically out of reach for lowpower embedded devices. The approximate computing paradigm can be exploited to reduce the DNN complexity. It improves performance and energy-efficiency by relaxing the need for fully accurate operations. There are a large number of implementation options leveraging many approximation techniques (e.g., pruning, quantization, weight-sharing, low-rank factorization, knowledge distillation, etc.). However, to the best of our knowledge, a few or no automated approach exists to explore, select and generate the best approximate version of a given DNN according to design objectives. The goal of this paper is to demonstrate that the design space exploration phase can enable significant network compression without noticeable accuracy loss. We demonstrate this via an example based on weight sharing and show that our direct conversion method can obtain a 4.85x compression rate with 0.14% accuracy loss in ResNet18 and 4.91x compression rate with 0.44% accuracy loss in SqueezeNet without involving retraining steps.

Mots clés

Deep Neural Networks Approximate Computing Model Compression Weight Sharing Design Space Exploration Embedded System Hardware Accelerator

Domaines

Informatique [cs]

Fichier principal

DDESC_2020_submission_paper (1).pdf (615.5 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

David Novo : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-03054134

Soumis le : vendredi 11 décembre 2020-12:09:47

Dernière modification le : jeudi 1 février 2024-14:26:28

Archivage à long terme le : vendredi 12 mars 2021-19:26:32

Dates et versions

lirmm-03054134 , version 1 (11-12-2020)

Identifiants

HAL Id : lirmm-03054134 , version 1
DOI : 10.1109/DDECS50862.2020.9095658

Citer

Etienne Dupuis, David Novo, Ian O'Connor, Alberto Bosio. Sensitivity Analysis and Compression Opportunities in DNNs Using Weight Sharing. DDECS 2020 - 23rd International Symposium on Design and Diagnostics of Electronic Circuits and Systems, Apr 2020, Novi Sad, Serbia. pp.1-6, ⟨10.1109/DDECS50862.2020.9095658⟩. ⟨lirmm-03054134⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 INSA-LYON EC-LYON INL LIRMM ADAC LIRMM_MIC MIC MIPS UNIV-MONTPELLIER INSA-GROUPE UDL ANR EC_LYON_STRICT

73 Consultations

306 Téléchargements

Sensitivity Analysis and Compression Opportunities in DNNs Using Weight Sharing

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager