The Locality and Symmetry of Positional Encodings

Lihu Chen; Gaël Varoquaux; Fabian M. Suchanek

Communication Dans Un Congrès Année : 2023

The Locality and Symmetry of Positional Encodings

(1) , (1) , (2, 3)

1
2
3

Lihu Chen

Fonction : Auteur
PersonId : 1086705
IdRef : 271544996

Méthodes computationnelles et mathématiques pour comprendre la société et la santé à partir de données

Gaël Varoquaux

Fonction : Auteur
PersonId : 5878
IdHAL : gael-varoquaux
ORCID : 0000-0003-1076-5122
IdRef : 126239894

Méthodes computationnelles et mathématiques pour comprendre la société et la santé à partir de données

Fabian M. Suchanek

Fonction : Auteur
PersonId : 12540
IdHAL : fabian-suchanek
ORCID : 0000-0001-7189-2796
IdRef : 203477707

Data, Intelligence and Graphs

Département Informatique et Réseaux

Résumé

Positional Encodings (PEs) are used to inject word-order information into transformer-based language models. While they can significantly enhance the quality of sentence representations, their specific contribution to language models is not fully understood, especially given recent findings that various positional encodings are insensitive to word order. In this work, we conduct a systematic study of positional encodings in Bidirectional Masked Language Models (BERT-style) , which complements existing work in three aspects: (1) We uncover the core function of PEs by identifying two common properties, Locality and Symmetry; (2) We show that the two properties are closely correlated with the performances of downstream tasks; (3) We quantify the weakness of current PEs by introducing two new probing tasks, on which current PEs perform poorly. We believe that these results are the basis for developing better PEs for transformer-based language models. The code is available at https://github.

Mots clés

positional encodings Transformer

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Fichier principal

The_Locality_and_Symmetry_of_Positional_Encodings.pdf (2.92 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Lihu Chen : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04330367

Soumis le : vendredi 8 décembre 2023-03:04:58

Dernière modification le : mardi 2 janvier 2024-14:14:02

Dates et versions

hal-04330367 , version 1 (08-12-2023)

Identifiants

HAL Id : hal-04330367 , version 1

Citer

Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek. The Locality and Symmetry of Positional Encodings. EMNLP 2023 - Conference on Empirical Methods in Natural Language Processing, Dec 2023, Singapore, Singapore. ⟨hal-04330367⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM INRIA PARISTECH INRIA2 LTCI INFRES DIG IP_PARIS ANR GS-COMPUTER-SCIENCE

153 Consultations

26 Téléchargements

The Locality and Symmetry of Positional Encodings

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager