Two Memory-Based Methods for Phrase Alignment

Johan Segura; Violaine Prince

Communication Dans Un Congrès Année : 2011

Two Memory-Based Methods for Phrase Alignment

(1) , (1)

Johan Segura

Fonction : Auteur

Exploration et exploitation de données textuelles

Violaine Prince

Fonction : Auteur
PersonId : 942907
ORCID : 0000-0002-5997-9677

Exploration et exploitation de données textuelles

Résumé

This document presents two bilingual phrase-based alignment methods handling syntactic constituents (sub-sentential components) of parallel sentences. The methods relie on an asymmetrical parsing of both languages: Light part-of-speech tagging for the target language, syntactic tree building for the 'source' language and the complexity of each is studied. One of their benefits is that they do not require lexical knowledge for granting alignment. Another is that they align constituents of variable length and structure, thus providing information about divergent translations. Their originality rely on the fact that parsing of the supposed source language is reused both in resource building and alignment process. The models and methods can be seen as a subclass of Example Based Machine Translation.

Mots clés

Natural Language Processing Machine Translation Alignment

Domaines

Traitement du texte et du document

Fichier principal

SNLP2011.pdf (119.88 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Violaine Prince-Barbier : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00838090

Soumis le : lundi 24 juin 2013-16:55:21

Dernière modification le : samedi 15 juillet 2023-04:09:53

Archivage à long terme le : mercredi 25 septembre 2013-04:11:34

Dates et versions

lirmm-00838090 , version 1 (24-06-2013)

Identifiants

HAL Id : lirmm-00838090 , version 1

Citer

Johan Segura, Violaine Prince. Two Memory-Based Methods for Phrase Alignment. 5th Language and Technology Conference, Nov 2011, Poland. pp.101-110. ⟨lirmm-00838090⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS TEXTE LIRMM MIPS UNIV-MONTPELLIER

133 Consultations

163 Téléchargements

Two Memory-Based Methods for Phrase Alignment

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager