Document-Level Adaptation for Neural Machine Translation

Sachith Sri Ram Kothur, Rebecca Knowles, Philipp Koehn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

It is common practice to adapt machine translation systems to novel domains, but even a well-adapted system may be able to perform better on a particular document if it were to learn from a translator's corrections within the document itself. We focus on adaptation within a single document -- appropriate for an interactive translation scenario where a model adapts to a human translator's input over the course of a document. We propose two methods: single-sentence adaptation (which performs online adaptation one sentence at a time) and dictionary adaptation (which specifically addresses the issue of translating novel words). Combining the two models results in improvements over both approaches individually, and over baseline systems, even on short documents. On WMT news test data, we observe an improvement of +1.8 BLEU points and +23.3% novel word translation accuracy and on EMEA data (descriptions of medications) we observe an improvement of +2.7 BLEU points and +49.2% novel word translation accuracy.
Original languageEnglish
Title of host publicationProceedings of the 2nd Workshop on Neural Machine Translation and Generation
Place of PublicationMelbourne, Australia
PublisherAssociation for Computational Linguistics
Pages64-73
Number of pages10
Publication statusPublished - 20 Jul 2018
Event2nd Workshop on Neural Machine Translation and Generation - Melbourne, Australia
Duration: 15 Jul 201820 Jul 2018
https://sites.google.com/site/wnmt18/home
https://sites.google.com/site/wnmt18/

Conference

Conference2nd Workshop on Neural Machine Translation and Generation
Abbreviated titleWNMT 2018
Country/TerritoryAustralia
CityMelbourne
Period15/07/1820/07/18
Internet address

Fingerprint

Dive into the research topics of 'Document-Level Adaptation for Neural Machine Translation'. Together they form a unique fingerprint.

Cite this