Projects per year
Abstract
In this paper we improve Urdu→Hindi-English machine translation through triangulation and transliteration. First we built an Urdu→Hindi SMT system by inducing triangulated and transliterated phrase-tables from Urdu–English and Hindi–English phrase translation models. We then use it to translate the Urdu part of the Urdu-English parallel data into Hindi, thus creating an artificial Hindi-English parallel data. Our phrase-translation strategies give an improvement of up to +3.35 BLEU points over a baseline Urdu→Hindi system. The synthesized data improve Hindi→English system by +0.35 and English→Hindi system by +1.0 BLEU points.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 17th Annual conference of the European Association for Machine Translation |
| Pages | 71-78 |
| Number of pages | 8 |
| Publication status | Published - 2014 |
Fingerprint
Dive into the research topics of 'Improving machine translation via triangulation and transliteration'. Together they form a unique fingerprint.Projects
- 2 Finished
-
MateCat - Machine Translation Enhanced Computer Assited Translation
Koehn, P. (Principal Investigator)
1/11/11 → 31/10/14
Project: Research