Abstract
Statistical machine translation (SMT) treats the translation of natural language as a machine learning problem. By examining many samples of human-produced translation, SMT algorithms automatically learn how to translate. SMT has made tremendous strides in less than two decades, and new ideas are constantly introduced. This survey presents a tutorial overview of the state of the art. We describe the context of the current research and then move to a formal problem description and an overview of the main subproblems: translation modeling, parameter estimation, and decoding. Along the way, we present a taxonomy of some different approaches within these areas. We conclude with an overview of evaluation and a discussion of future directions.
| Original language | English |
|---|---|
| Article number | 8 |
| Pages (from-to) | 1-49 |
| Number of pages | 49 |
| Journal | ACM Computing Surveys |
| Volume | 40 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - 1 Aug 2008 |
Keywords / Materials (for Non-textual outputs)
- Natural language processing, machine translation
Fingerprint
Dive into the research topics of 'Statistical Machine Translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver