Abstract
We present our submission for the 1st Translation Memory Cleaning Shared Task. We treat the task as a 3-class classification problem and extract features that indicate (i) source sentence complexity, (ii) misalignments between source and target, and (iii) target sentence complexity. Our results show that focusing on the target side and finding ways to estimate the alignment quality between source and target yields expressive features which, together with a reliable classifier, produces competitive results. Our submission is ranked on 2nd place among 6 for the EN-DE language pair
Original language | English |
---|---|
Title of host publication | Proceedings of 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM 2016) |
Place of Publication | Portorož, Slovenia |
Number of pages | 4 |
Publication status | Published - May 2016 |
Event | 2nd Workshop on Natural Language Processing for Translation Memories - Portorož, Slovenia Duration: 28 May 2016 → 28 May 2016 http://rgcl.wlv.ac.uk/nlp4tm2016/ |
Conference
Conference | 2nd Workshop on Natural Language Processing for Translation Memories |
---|---|
Abbreviated title | NLP4TM 2016 |
Country/Territory | Slovenia |
City | Portorož |
Period | 28/05/16 → 28/05/16 |
Internet address |