Findings of the 2011 Workshop on Statistical Machine Translation

Chris Callison-Burch, Philipp Koehn, Christof Monz, Omar Zaidan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents the results of the WMT11 shared tasks, which included a translation task, a system combination task, and a task for machine translation evaluation metrics. We conducted a large-scale manual evaluation of 148 machine translation systems and 41 system combination entries. We used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of translation quality for 21 evaluation metrics. This year featured a Haitian Creole to English task translating SMS messages sent to an emergency response service in the aftermath of the Haitian earthquake. We also conducted a pilot ‘tunable metrics’ task to test whether optimizing a fixed system to different metrics would result in perceptibly different translation quality.
Original languageEnglish
Title of host publicationProceedings of the Sixth Workshop on Statistical Machine Translation
Place of PublicationEdinburgh, Scotland
PublisherAssociation for Computational Linguistics
Pages22-64
Number of pages43
ISBN (Print)978-1-937284-12-1
Publication statusPublished - 1 Jul 2011

Fingerprint

Dive into the research topics of 'Findings of the 2011 Workshop on Statistical Machine Translation'. Together they form a unique fingerprint.

Cite this