Abstract
We posed the shared task of assigning sentence-level quality scores for a very noisy corpus of sentence pairs crawled from the web, with the goal of sub-selecting 1% and 10% of high-quality data to be used to train machine translation systems. Seventeen participants from companies, national research labs, and universities participated in this task.
Original language | English |
---|---|
Title of host publication | Proceedings of the Third Conference on Machine Translation: Shared Task Papers |
Place of Publication | Belgium, Brussels |
Publisher | Association for Computational Linguistics |
Pages | 726-739 |
Number of pages | 14 |
DOIs | |
Publication status | Published - 31 Oct 2018 |
Event | EMNLP 2018 Third Conference on Machine Translation (WMT18) - Brussels, Belgium Duration: 31 Oct 2018 → 1 Nov 2018 http://www.statmt.org/wmt18/ |
Workshop
Workshop | EMNLP 2018 Third Conference on Machine Translation (WMT18) |
---|---|
Abbreviated title | WMT18 |
Country/Territory | Belgium |
City | Brussels |
Period | 31/10/18 → 1/11/18 |
Internet address |