Finding text boundaries and finding topic boundaries: two different tasks ? - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Conference Papers Year : 2008

Finding text boundaries and finding topic boundaries: two different tasks ?

Abstract

The goal of this paper is to demonstrate that usual evalua- tion methods for text segmentation are not adapted for every task linked to text segmentation. To do so we dierentiated the task of finding text boundaries in a corpus of concatenated texts from the task of finding transitions between topics inside the same text. We worked on a corpus of twenty two French political discourses trying to find boundaries be- tween them when they are concatenated, and to find topic boundaries inside them when they are not. We compared the results of our distance based method to the well known c99 algorithm.
Fichier principal
Vignette du fichier
gotal.pdf (197.6 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-00336164 , version 1 (03-11-2008)

Identifiers

  • HAL Id : lirmm-00336164 , version 1

Cite

Alexandre Labadié, Violaine Prince. Finding text boundaries and finding topic boundaries: two different tasks ?. GoTAL'2008: 6th International Conference on Natural Language Processing, Aug 2008, Gothenburg, Sweden. pp.260-271. ⟨lirmm-00336164⟩
105 View
329 Download

Share

More