Automatic Titling of Electronic Documents with Noun Phrase Extraction

Abstract : Automatic titling (i.e. providing titles) is one of key domains of Web site accessibility. This paper provides an approach allowing the automatic titling of texts (e.g. emails, fora, etc.) relying on the morphosyntactic study of human written titles in a corpus of various texts. The method is developed in four stages: Corpus acquisition, candidate sentences determination for titling, noun phrase extraction in the candidate sentences, and finally, selecting a particular noun phrase to play the role of the text title (ChTITRES approach). The method has been evaluated by ten users, and the satisfaction enquiry shows that the titles selected through this process are relevant.
Type de document :
Communication dans un congrès
SOCPAR'10: SOft Computing and PAttern Recognition, France. pp.168-171, 2010, 〈http://www.socpar.org/〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00563903
Contributeur : Cédric Lopez <>
Soumis le : lundi 7 février 2011 - 15:31:41
Dernière modification le : jeudi 11 janvier 2018 - 06:26:53
Document(s) archivé(s) le : dimanche 8 mai 2011 - 03:37:24

Fichier

4_pages.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : lirmm-00563903, version 1

Collections

Citation

Cédric Lopez, Violaine Prince, Mathieu Roche. Automatic Titling of Electronic Documents with Noun Phrase Extraction. SOCPAR'10: SOft Computing and PAttern Recognition, France. pp.168-171, 2010, 〈http://www.socpar.org/〉. 〈lirmm-00563903〉

Partager

Métriques

Consultations de la notice

167

Téléchargements de fichiers

158