How to Title Electronic Documents Using Text Mining Techniques - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Article Dans Une Revue International Journal of Computer Information Systems and Industrial Management Applications Année : 2012

How to Title Electronic Documents Using Text Mining Techniques

Résumé

Automatic titling of text is a task allowing to determine a well formed word group able to represent the text in a relevant way. The main difficulty of this task is to determine a title having morpho-syntactic characteristics close to titles written by concerned people. Our approach has to be relevant for all type of text (e.g. news, emails, fora, and so forth). Our automatic titling method is developed in four stages: Corpus acquisition, candidate sentences determination for titling, noun phrase extraction in the candidate sentences, and finally, selecting a particular noun phrase to play the role of the text title (ChTITRES approach). Evaluation shows that titles determined by our methods are relevant.
Fichier principal
Vignette du fichier
Paper62.pdf (654.14 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

lirmm-00687096 , version 1 (12-04-2012)

Identifiants

  • HAL Id : lirmm-00687096 , version 1

Citer

Cédric Lopez, Violaine Prince, Mathieu Roche. How to Title Electronic Documents Using Text Mining Techniques. International Journal of Computer Information Systems and Industrial Management Applications, 2012, 4, pp.562-569. ⟨lirmm-00687096⟩
184 Consultations
309 Téléchargements

Partager

Gmail Facebook X LinkedIn More