Finding text boundaries and finding topic boundaries: two different tasks ?

Abstract : The goal of this paper is to demonstrate that usual evalua- tion methods for text segmentation are not adapted for every task linked to text segmentation. To do so we dierentiated the task of finding text boundaries in a corpus of concatenated texts from the task of finding transitions between topics inside the same text. We worked on a corpus of twenty two French political discourses trying to find boundaries be- tween them when they are concatenated, and to find topic boundaries inside them when they are not. We compared the results of our distance based method to the well known c99 algorithm.
Type de document :
Communication dans un congrès
GoTAL'2008: 6th International Conference on Natural Language Processing, Aug 2008, Gothenburg, Sweden. pp.260-271, 2008
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00336164
Contributeur : Alexandre Labadié <>
Soumis le : lundi 3 novembre 2008 - 09:14:36
Dernière modification le : jeudi 24 mai 2018 - 15:59:23
Document(s) archivé(s) le : lundi 7 juin 2010 - 22:37:14

Fichier

gotal.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : lirmm-00336164, version 1

Collections

Citation

Alexandre Labadié, Violaine Prince. Finding text boundaries and finding topic boundaries: two different tasks ?. GoTAL'2008: 6th International Conference on Natural Language Processing, Aug 2008, Gothenburg, Sweden. pp.260-271, 2008. 〈lirmm-00336164〉

Partager

Métriques

Consultations de la notice

159

Téléchargements de fichiers

259