Projects per year
Abstract / Description of output
Despite its potential to improve lexical selection, most state-of-the-art machine translation systems take only minimal contextual information into account. We capture context with a topic model over distributional profiles built from the context words of each translation unit. Topic distributions are inferred for each translation unit and used to adapt the translation model dynamically to a given test context by measuring their similarity. We show that combining information from both local and global test contexts helps to improve lexical selection and outperforms a baseline system by up to 1.15 B LEU. We test our topic-adapted model on a diverse data set containing documents from three different domains and achieve competitive performance in comparison with two supervised domain-adapted systems.
Original language | English |
---|---|
Title of host publication | Proceedings of the Ninth Workshop on Statistical Machine Translation |
Place of Publication | Baltimore, Maryland, USA |
Publisher | Association for Computational Linguistics |
Pages | 445-456 |
Number of pages | 12 |
DOIs | |
Publication status | Published - 1 Jun 2014 |
Event | Ninth Workshop on Statistical Machine Translation - Baltimore, United States Duration: 26 Jun 2014 → 27 Jun 2014 http://www.statmt.org/wmt14/ |
Conference
Conference | Ninth Workshop on Statistical Machine Translation |
---|---|
Country/Territory | United States |
City | Baltimore |
Period | 26/06/14 → 27/06/14 |
Internet address |
Fingerprint
Dive into the research topics of 'Dynamic Topic Adaptation for SMT using Distributional Profiles'. Together they form a unique fingerprint.Projects
- 2 Finished
Profiles
-
Barry Haddow
- School of Informatics - Senior Research Fellow
- Institute of Language, Cognition and Computation
- Language, Interaction, and Robotics
Person: Academic: Research Active (Research Assistant)