Constraining the Phrase-based, Joint Probability Statistical Translation Model

Alexandra Birch, Chris Callison-Burch, Miles Osborne, Philipp Koehn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The joint probability model proposed by Marcu and Wong (2002) provides a strong probabilistic framework for phrase-based statistical machine translation (SMT). The model's usefulness is, however, limited by the computational complexity of estimating parameters at the phrase level. We present the first model to use word alignments for constraining the space of phrasal alignments searched during Expectation Maximization (EM) training. Constraining the joint model improves performance, showing results that are very close to state-of-the-art phrase-based models. It also allows it to scale up to larger corpora and therefore be more widely applicable.
Original languageEnglish
Title of host publicationProceedings of the Workshop on Statistical Machine Translation
Place of PublicationStroudsburg, PA, USA
PublisherAssociation for Computational Linguistics
Pages154-157
Number of pages4
Publication statusPublished - 2006

Fingerprint Dive into the research topics of 'Constraining the Phrase-based, Joint Probability Statistical Translation Model'. Together they form a unique fingerprint.

Cite this