Skip to Main content Skip to Navigation
Conference papers

Alignment Memories: A Useful Tool to Handle Phrase Alignment Bottleneck

Johan Segura 1 Violaine Prince 1
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : This document presents two bilingual phrase-based alignment methods handling syntactic constituents (sub-sentential components) of parallel sentences. The methods rely on an asymmetrical parsing of both languages: Light part-of-speech tagging for the target language, syntactic tree building for the 'source' language and the complexity of each is studied. One of their benefits is that they do not require lexical knowledge for granting alignment. Another is that they align constituents of variable length and structure, thus providing information about divergent translations. Their originality rely on the fact that parsing of the supposed source language is reused both in resource building and alignment process. The models and methods can be seen as a subclass of Example Based Machine Translation. They built what can be called as an Alignment Mem- ory, the alignment correspondent of Translation Memories. The information acquisition process, partly supervised, is embedded in an online graphical human interface which accelerates the construction of golden corpora by one or many users.
Document type :
Conference papers
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00764092
Contributor : Violaine Prince-Barbier <>
Submitted on : Wednesday, December 12, 2012 - 12:28:42 PM
Last modification on : Tuesday, December 4, 2018 - 3:12:02 PM
Document(s) archivé(s) le : Wednesday, March 13, 2013 - 3:52:57 AM

File

CLA2011.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-00764092, version 1

Collections

Citation

Johan Segura, Violaine Prince. Alignment Memories: A Useful Tool to Handle Phrase Alignment Bottleneck. CLA'2011: Computational Linguistics-Applications Conference, Oct 2011, Jachranka, Poland. pp.61-67. ⟨lirmm-00764092⟩

Share

Metrics

Record views

288

Files downloads

366