Increasing Long Tail in Weighted Lexical Networks

Mathieu Lafourcade 1 Alain Joubert 1
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Lexical networks can be used with benefit for semantic analysis of texts, word sense disambiguation (WSD) and in general for graph-based Natural Language Processing. Usually strong relations between terms (e.g.: cat --> animal) are sufficient to help for the task, but quite often, weak relations (e.g.: cat --> ball of wool) are necessary. Our purpose here is to acquire such relations by means of online serious games as other classical approaches seems impractical. Indeed, it is difficult to ask the users (non experts) to define a proper weighting for the relations they propose, and then we decided to relate weights with the frequency of their propositions. It allows us to acquire first the strongest relations, but also to populate the long tail of an already existing network. Furthermore, trying to get an estimation of our network by the very users thanks to a tip of the tongue (TOT) software, we realized that they rather tend to favor the relations of the long tail and thus promote their emergence. Developing the long tail of a lexical network with standard and non-standard relations of low weight can be of advantage for tasks such that words retrieval from clues or WSD in texts.
Type de document :
Communication dans un congrès
Cognitive Aspects of the Lexicon (CogAlex-III), COLING, France. pp.16, 2012, 〈http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html〉
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816236
Contributeur : Mathieu Lafourcade <>
Soumis le : samedi 20 avril 2013 - 22:44:51
Dernière modification le : jeudi 11 janvier 2018 - 06:26:53
Document(s) archivé(s) le : dimanche 21 juillet 2013 - 04:07:55

Fichier

COGALEX2012-ML-v4.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : lirmm-00816236, version 1

Collections

Citation

Mathieu Lafourcade, Alain Joubert. Increasing Long Tail in Weighted Lexical Networks. Cognitive Aspects of the Lexicon (CogAlex-III), COLING, France. pp.16, 2012, 〈http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html〉. 〈lirmm-00816236〉

Partager

Métriques

Consultations de la notice

164

Téléchargements de fichiers

198