Abstract
We posed the shared task of assigning sentence-level quality scores for a very noisy corpus of sentence pairs crawled from the web, with the goal of sub-selecting 1% and 10% of high-quality data to be used to train machine translation systems. Seventeen participants from companies, national research labs, and universities participated in this task.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Third Conference on Machine Translation: Shared Task Papers |
| Place of Publication | Belgium, Brussels |
| Publisher | Association for Computational Linguistics |
| Pages | 726-739 |
| Number of pages | 14 |
| DOIs | |
| Publication status | Published - 31 Oct 2018 |
| Event | EMNLP 2018 Third Conference on Machine Translation (WMT18) - Brussels, Belgium Duration: 31 Oct 2018 → 1 Nov 2018 http://www.statmt.org/wmt18/ |
Workshop
| Workshop | EMNLP 2018 Third Conference on Machine Translation (WMT18) |
|---|---|
| Abbreviated title | WMT18 |
| Country/Territory | Belgium |
| City | Brussels |
| Period | 31/10/18 → 1/11/18 |
| Internet address |