Two Memory-Based Methods for Phrase Alignment

Johan Segura 1 Violaine Prince 1
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : This document presents two bilingual phrase-based alignment methods handling syntactic constituents (sub-sentential components) of parallel sentences. The methods relie on an asymmetrical parsing of both languages: Light part-of-speech tagging for the target language, syntactic tree building for the 'source' language and the complexity of each is studied. One of their benefits is that they do not require lexical knowledge for granting alignment. Another is that they align constituents of variable length and structure, thus providing information about divergent translations. Their originality rely on the fact that parsing of the supposed source language is reused both in resource building and alignment process. The models and methods can be seen as a subclass of Example Based Machine Translation.
Type de document :
Communication dans un congrès
University of Poznan. 5th Language and Technology Conference, Nov 2011, Poland. 1, pp.101-110, 2011, 〈http://www.ltc.amu.edu.pl/〉
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00838090
Contributeur : Violaine Prince-Barbier <>
Soumis le : lundi 24 juin 2013 - 16:55:21
Dernière modification le : jeudi 11 janvier 2018 - 02:06:42
Document(s) archivé(s) le : mercredi 25 septembre 2013 - 04:11:34

Fichier

SNLP2011.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : lirmm-00838090, version 1

Collections

Citation

Johan Segura, Violaine Prince. Two Memory-Based Methods for Phrase Alignment. University of Poznan. 5th Language and Technology Conference, Nov 2011, Poland. 1, pp.101-110, 2011, 〈http://www.ltc.amu.edu.pl/〉. 〈lirmm-00838090〉

Partager

Métriques

Consultations de la notice

138

Téléchargements de fichiers

136