Samsung and University of Edinburgh's System for the IWSLT 2018 Low Resource MT Task

Philip Williams, Marcin Chochowski, Pawel Przybysz, Rico Sennrich, Barry Haddow, Alexandra Birch

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes the joint submission to the IWSLT 2018 Low Resource MT task by Samsung R&D Institute, Poland, and the University of Edinburgh. We focused on supplementing the very limited in-domain Basque-English training data with out-of-domain data, with synthetic data, and with data for other language pairs. We also experimented with a variety of model architectures and features, which included the development of extensions to the Nematus toolkit. Our submission was ultimately produced by a system combination in which we reranked translations from our strongest individual system using multiple weaker systems.
Original languageEnglish
Title of host publicationProceedings of the 15th International Workshop on Spoken Language Translation
Place of PublicationBruges, Belgium
Pages118-123
Number of pages6
Publication statusPublished - 2018
Event15th International Workshop on Spoken Language Translation 2018 - Bruges, Belgium
Duration: 29 Oct 201830 Oct 2018
https://workshop2018.iwslt.org/index.php

Conference

Conference15th International Workshop on Spoken Language Translation 2018
Abbreviated titleIWSLT 2018
Country/TerritoryBelgium
CityBruges
Period29/10/1830/10/18
Internet address

Fingerprint

Dive into the research topics of 'Samsung and University of Edinburgh's System for the IWSLT 2018 Low Resource MT Task'. Together they form a unique fingerprint.

Cite this