IITP-MT System for Gujarati-English News Translation Task at WMT 2019

Sukanta Sen, Kamal Kumar Gupta, Asif Ekbal, Pushpak Bhattacharyya

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We describe our submission to WMT 2019 News translation shared task for Gujarati-English language pair. We submit constrained systems, i.e, we rely on the data provided for this language pair and do not use any external data. We train Transformer based subword-level neural machine translation (NMT) system using original parallel corpus along with synthetic parallel corpus obtained through back-translation of monolingual data. Our primary systems achieve BLEU scores of 10.4 and 8.1 for Gujarati→English and English→Gujarati, respectively. We observe that incorporating monolingual data through back-translation improves the BLEU score significantly over baseline NMT and SMT systems for this language pair.
Original languageEnglish
Title of host publicationProceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)
Place of PublicationFlorence, Italy
PublisherAssociation for Computational Linguistics
Pages407-411
Number of pages5
ISBN (Electronic)978-1-950737-27-7
DOIs
Publication statusPublished - 1 Aug 2019
EventACL 2019 Fourth Conference on Machine Translation - Florence, Italy
Duration: 1 Aug 20192 Aug 2019
http://www.statmt.org/wmt19/

Conference

ConferenceACL 2019 Fourth Conference on Machine Translation
Abbreviated titleWMT19
Country/TerritoryItaly
CityFlorence
Period1/08/192/08/19
Internet address

Fingerprint

Dive into the research topics of 'IITP-MT System for Gujarati-English News Translation Task at WMT 2019'. Together they form a unique fingerprint.

Cite this