Word-based alignment, phrase-based translation: What's the link?

Adam Lopez, Philip Resnik

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

State-of-the-art statistical machine translation is based on alignments between phrases – sequences of words in the source and target sentences. The learning step in these systems often relies on alignments between words. It is often assumed that the quality of this word alignment is critical for translation. However, recent results suggest that the relationship between alignment quality and translation quality is weaker than previously thought. We investigate this question directly, comparing the impact of high-quality alignments with a carefully constructed set of degraded alignments. In order to tease apart various interactions, we report experiments investigating the impact of alignments on different aspects of the system. Our results confirm a weak correlation, but they also illustrate that more data and better feature engineering may be more beneficial than better alignment.
Original languageEnglish
Title of host publicationProceedings of AMTA 2006
Publication statusPublished - 2006

Fingerprint

Dive into the research topics of 'Word-based alignment, phrase-based translation: What's the link?'. Together they form a unique fingerprint.

Cite this