Skip to Main content Skip to Navigation
Journal articles

How to Title Electronic Documents Using Text Mining Techniques

Cédric Lopez 1 Violaine Prince 1 Mathieu Roche 1, *
* Corresponding author
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Automatic titling of text is a task allowing to determine a well formed word group able to represent the text in a relevant way. The main difficulty of this task is to determine a title having morpho-syntactic characteristics close to titles written by concerned people. Our approach has to be relevant for all type of text (e.g. news, emails, fora, and so forth). Our automatic titling method is developed in four stages: Corpus acquisition, candidate sentences determination for titling, noun phrase extraction in the candidate sentences, and finally, selecting a particular noun phrase to play the role of the text title (ChTITRES approach). Evaluation shows that titles determined by our methods are relevant.
Document type :
Journal articles
Complete list of metadata

Cited literature [21 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00687096
Contributor : Cédric Lopez <>
Submitted on : Thursday, April 12, 2012 - 11:39:33 AM
Last modification on : Monday, July 22, 2019 - 4:34:05 PM
Long-term archiving on: : Friday, July 13, 2012 - 9:18:21 AM

File

Paper62.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-00687096, version 1

Collections

Citation

Cédric Lopez, Violaine Prince, Mathieu Roche. How to Title Electronic Documents Using Text Mining Techniques. International Journal of Computer Information Systems and Industrial Management Applications, Machine Intelligence Research Labs (MIR Labs), 2012, 4, pp.562-569. ⟨lirmm-00687096⟩

Share

Metrics

Record views

371

Files downloads

401