FIN3E Approach: Identification of Named Entities from Extracted Terms

Mathieu Roche 1
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The Named Entities (NE) are classically defined as the names of People, Places, and Organizations. Moreover other NE classes as Documents (e.g. software, hardware), and Sciences (e.g. illness, medications) exist. In order to identify NE, a lot of systems rely on the presence of uppercases. This technique can be inefficient to treat non-standard documents (e.g. emails, blogs, fora, texts or fragments of texts totally written in uppercase or lowercase). In this work, we do not use this kind of information to identify the NE. Formally, to characterize the NE, there exists two important criteria: (1) Referential uniqueness (i.e. a proper noun refers to one referential entity), (2) Denominative stability (i.e. little possible variations). Our work is based on this last criterion to identify the NE from Noun-Noun terms obtained by terminology extraction methods. Our method deals with a cognitive process that simulates a human reasoning: (1) Expressing differently one term by a reformulation technique, (2) Judging the relevance of this reformulation to identify NE.
Type de document :
Poster
ICCS'2010: International Conference on Cognitive Science, Beijing, China. pp.356-358, 2010
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00588527
Contributeur : Mathieu Roche <>
Soumis le : dimanche 24 avril 2011 - 16:39:35
Dernière modification le : jeudi 24 mai 2018 - 15:59:23

Identifiants

  • HAL Id : lirmm-00588527, version 1

Collections

Citation

Mathieu Roche. FIN3E Approach: Identification of Named Entities from Extracted Terms. ICCS'2010: International Conference on Cognitive Science, Beijing, China. pp.356-358, 2010. 〈lirmm-00588527〉

Partager

Métriques

Consultations de la notice

33