A quantitative analysis of reordering phenomena

Alexandra Birch-Mayne, Phil Blunsom, Miles Osborne

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Reordering is a serious challenge in statistical machine translation. We propose a method for analysing syntactic reordering in parallel corpora and apply it to understanding the differences in the performance of SMT systems. Results at recent large-scale evaluation campaigns show that synchronous grammar-based statistical machine translation models produce superior results for language pairs such as Chinese to English. However, for language pairs such as Arabic to English, phrasebased approaches continue to be competitive. Until now, our understanding of these results has been limited to differences in BLEU scores. Our analysis shows that current state-of-the-art systems fail to capture the majority of reorderings found in real data.
Original languageEnglish
Title of host publicationProceedings of the Fourth Workshop on Statistical Machine Translation 2009
PublisherAssociation for Computational Linguistics
Pages197-205
Number of pages9
Publication statusPublished - 2009

Fingerprint

Dive into the research topics of 'A quantitative analysis of reordering phenomena'. Together they form a unique fingerprint.

Cite this