Finding text boundaries and finding topic boundaries: two different tasks ? - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Finding text boundaries and finding topic boundaries: two different tasks ?

Résumé

The goal of this paper is to demonstrate that usual evalua- tion methods for text segmentation are not adapted for every task linked to text segmentation. To do so we dierentiated the task of finding text boundaries in a corpus of concatenated texts from the task of finding transitions between topics inside the same text. We worked on a corpus of twenty two French political discourses trying to find boundaries be- tween them when they are concatenated, and to find topic boundaries inside them when they are not. We compared the results of our distance based method to the well known c99 algorithm.
Fichier principal
Vignette du fichier
gotal.pdf (197.6 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

lirmm-00336164 , version 1 (03-11-2008)

Identifiants

  • HAL Id : lirmm-00336164 , version 1

Citer

Alexandre Labadié, Violaine Prince. Finding text boundaries and finding topic boundaries: two different tasks ?. GoTAL'2008: 6th International Conference on Natural Language Processing, Aug 2008, Gothenburg, Sweden. pp.260-271. ⟨lirmm-00336164⟩
100 Consultations
322 Téléchargements

Partager

Gmail Facebook X LinkedIn More