Abstract / Description of output
Existing approaches for evaluating word order in machine translation work with metrics computed directly over a permutation of word positions in system output relative to a reference translation. However, every permutation factorizes into a permutation tree (PET) built of primal permutations, i.e., atomic units that do not factorize any further. In this paper we explore the idea that permutations factorizing into (on average) shorter primal permutations should represent simpler ordering as well. Consequently, we contribute Permutation Complexity, a class of metrics over PETs and their extension to forests, and define tight metrics, a sub-class of metrics implementing this idea. Subsequently we define example tight metrics and empirically test them in word order evaluation. Experiments on the WMT13 data sets for ten language pairs show that a tight metric is more often than not better than the baselines.
Original language | English |
---|---|
Title of host publication | Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers |
Place of Publication | Osaka, Japan |
Publisher | The COLING 2016 Organizing Committee |
Pages | 2164-2173 |
Number of pages | 10 |
Publication status | Published - 30 Nov 2016 |
Event | 26th International Conference on Computational Linguistics - Osaka, Japan Duration: 11 Dec 2016 → 16 Dec 2016 http://coling2016.anlp.jp/ |
Conference
Conference | 26th International Conference on Computational Linguistics |
---|---|
Abbreviated title | COLING 2016 |
Country/Territory | Japan |
City | Osaka |
Period | 11/12/16 → 16/12/16 |
Internet address |