Edinburgh’s Syntax-Based Systems at WMT 2014

Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, Philipp Koehn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

This paper describes the string-to-tree systems built at the University of Edinburgh for the WMT 2014 shared translation task. We developed systems for English-German, Czech-English, French-English, German-English, Hindi-English, and Russian-English. This year we improved our English-German system through target-side compound splitting, morphosyntactic constraints, and refinements to parse tree annotation; we addressed the out-of-vocabulary problem using transliteration for Hindi and Russian and using morphological reduction for Russian; we improved our German-English system through tree binarization; and we reduced system development time by filtering the tuning sets.
Original languageEnglish
Title of host publicationProceedings of the Ninth Workshop on Statistical Machine Translation
Place of PublicationBaltimore, Maryland, USA
PublisherAssociation for Computational Linguistics
Pages207-214
Number of pages8
Publication statusPublished - 1 Jun 2014

Fingerprint

Dive into the research topics of 'Edinburgh’s Syntax-Based Systems at WMT 2014'. Together they form a unique fingerprint.

Cite this