Abstract
While domain adaptation techniques for SMT have proven to be effective at improving translation quality, their practicality for a multi-domain environment is often limited because of the computational and human costs of developing and maintaining multiple systems adapted to different domains. We present an architecture that delays the computation of translation model features until decoding, allowing for the application of mixture-modeling techniques at decoding time. We also describe a method for unsupervised adaptation with development and test data from multiple domains. Experimental results on two language pairs demonstrate the effectiveness of both our translation model architecture and automatic clustering, with gains of up to 1 BLEU over unadapted systems and single-domain adaptation.
Original language | English |
---|---|
Title of host publication | Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) |
Place of Publication | Sofia, Bulgaria |
Publisher | Association for Computational Linguistics |
Pages | 832-840 |
Number of pages | 9 |
Publication status | Published - 1 Aug 2013 |
Event | 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 - Sofia, Bulgaria Duration: 4 Aug 2013 → 9 Aug 2013 |
Conference
Conference | 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 |
---|---|
Country/Territory | Bulgaria |
City | Sofia |
Period | 4/08/13 → 9/08/13 |