Abstract
We present a task to measure an MT system’s capability to translate ambiguous words with their correct sense according to the given context. The task is based on the German–English Word Sense Disambiguation (WSD) test set ContraWSD (Rios Gonzales et al., 2017), but it has been filtered to reduce noise, and the evaluation has been adapted to assess MT output directly rather than scoring existing translations. We evaluate all German–English submissions to the WMT’18 shared translation task, plus a number of submissions from previous years, and find that performance on the task has markedly improved compared to the 2016 WMT submissions (81%!93% accuracy on the WSD task). We also find that the unsupervised submissions to the task have a low WSD capability, and predominantly translate ambiguous source words with the same sense.
Original language | English |
---|---|
Title of host publication | EMNLP 2018 THIRD CONFERENCE ON MACHINE TRANSLATION (WMT18) |
Place of Publication | Brussels, Belgium |
Publisher | Association for Computational Linguistics |
Pages | 588-596 |
Number of pages | 9 |
DOIs | |
Publication status | Published - Oct 2018 |
Event | EMNLP 2018 Third Conference on Machine Translation (WMT18) - Brussels, Belgium Duration: 31 Oct 2018 → 1 Nov 2018 http://www.statmt.org/wmt18/ |
Workshop
Workshop | EMNLP 2018 Third Conference on Machine Translation (WMT18) |
---|---|
Abbreviated title | WMT18 |
Country/Territory | Belgium |
City | Brussels |
Period | 31/10/18 → 1/11/18 |
Internet address |