How to Title Electronic Documents Using Text Mining Techniques - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Journal Articles International Journal of Computer Information Systems and Industrial Management Applications Year : 2012

How to Title Electronic Documents Using Text Mining Techniques

Abstract

Automatic titling of text is a task allowing to determine a well formed word group able to represent the text in a relevant way. The main difficulty of this task is to determine a title having morpho-syntactic characteristics close to titles written by concerned people. Our approach has to be relevant for all type of text (e.g. news, emails, fora, and so forth). Our automatic titling method is developed in four stages: Corpus acquisition, candidate sentences determination for titling, noun phrase extraction in the candidate sentences, and finally, selecting a particular noun phrase to play the role of the text title (ChTITRES approach). Evaluation shows that titles determined by our methods are relevant.
Fichier principal
Vignette du fichier
Paper62.pdf (654.14 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-00687096 , version 1 (12-04-2012)

Identifiers

  • HAL Id : lirmm-00687096 , version 1

Cite

Cédric Lopez, Violaine Prince, Mathieu Roche. How to Title Electronic Documents Using Text Mining Techniques. International Journal of Computer Information Systems and Industrial Management Applications, 2012, 4, pp.562-569. ⟨lirmm-00687096⟩
192 View
318 Download

Share

More