A Heuristic Exploration of Retraining-free Weight-Sharing for CNN Compression

Etienne Dupuis; David Novo; Ian O'Connor; Alberto Bosio

doi:10.1109/ASP-DAC52403.2022.9712487

Communication Dans Un Congrès Année : 2022

A Heuristic Exploration of Retraining-free Weight-Sharing for CNN Compression

(1, 2) , (3) , (1, 2) , (1, 2)

1
2
3

Etienne Dupuis

Fonction : Auteur
PersonId : 1085472

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

David Novo

Fonction : Auteur
PersonId : 170933
IdHAL : david-novo
ORCID : 0000-0002-5510-4152
IdRef : 244276455

ADAptive Computing

Ian O'Connor

Fonction : Auteur
PersonId : 173133
IdHAL : oconnor-ian
ORCID : 0000-0002-6238-9600
IdRef : 119579766

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Alberto Bosio

Fonction : Auteur
PersonId : 172965
IdHAL : alberto-bosio
ORCID : 0000-0001-6116-7339
IdRef : 174383592

École Centrale de Lyon

INL - Conception de Systèmes Hétérogènes

Résumé

The computational workload involved in Convolutional Neural Networks (CNNs) is typically out of reach for low-power embedded devices. The scientific literature provides a large number of approximation techniques to address this problem. Among them, the Weight-Sharing (WS) technique gives promising results, but it requires carefully determining the shared values for each layer of a given CNN. As the number of possible solutions grows exponentially with the number of layers, the WS Design Space Exploration (DSE) time can easily explode for state-of-the-art CNNs. In this paper, we propose a new heuristic approach to drastically reduce the exploration time without sacrificing the quality of the output. The results carried out on recent CNNs (GoogleNet [1], ResNet50V2 [2], MobileNetV2 [3], InceptionV3 [4], and EfficientNet [5]), trained with the Ima-geNet [6] dataset, show over 5× memory compression at an acceptable accuracy loss (complying with the MLPerf [7] quality target) without any retraining step and in less than 10 hours. Our code is publicly available on GitHub [8].

Mots clés

Convolutional Neural Network Deep Learning Computer vision Hardware Accelerator Design Space Exploration Approximate Computing Weight-Sharing

Domaines

Informatique [cs]

Fichier principal

ASP_DAC_2022.pdf (433.53 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

David Novo : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-03767100

Soumis le : jeudi 1 septembre 2022-15:58:05

Dernière modification le : vendredi 5 avril 2024-03:23:42

Archivage à long terme le : vendredi 2 décembre 2022-18:49:06

Dates et versions

lirmm-03767100 , version 1 (01-09-2022)

Identifiants

HAL Id : lirmm-03767100 , version 1
DOI : 10.1109/ASP-DAC52403.2022.9712487

Citer

Etienne Dupuis, David Novo, Ian O'Connor, Alberto Bosio. A Heuristic Exploration of Retraining-free Weight-Sharing for CNN Compression. ASP-DAC 2022 - 27th Asia and South Pacific Design Automation Conference, Jan 2022, Taipei, Taiwan. pp.134-139, ⟨10.1109/ASP-DAC52403.2022.9712487⟩. ⟨lirmm-03767100⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 INSA-LYON EC-LYON INL LIRMM ADAC UNIV-MONTPELLIER INSA-GROUPE UDL ANR EC_LYON_STRICT

36 Consultations

130 Téléchargements

A Heuristic Exploration of Retraining-free Weight-Sharing for CNN Compression

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager