Combining Domain and Topic Adaptation for SMT

Eva Hasler, Barry Haddow, Philipp Koehn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Recent years have seen increased interest in adapting translation models to test domains that are known in advance as well as using latent topic representations to adapt to unknown test domains. However, the relationship between domains and latent topics is still somewhat unclear and topic adaptation approaches typically do not make use of domain knowledge in the training data. We show empirically that combining domain and topic adaptation approaches can be beneficial and that topic representations can be used to predict the domain of a test document. Our best combined model yields gains of up to 0.82 BLEU over a domain-adapted translation system and up to 1.67 BLEU over an unadapted system, measured on the stronger of two training conditions.
Original languageEnglish
Title of host publicationProceedings of AMTA 2014
Pages139-151
Number of pages13
Publication statusPublished - 2014

Fingerprint

Dive into the research topics of 'Combining Domain and Topic Adaptation for SMT'. Together they form a unique fingerprint.

Cite this