Joshua 3.0: Syntax-based Machine Translation with the Thrax Grammar Extractor

Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, Adam Lopez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present progress on Joshua, an open-source decoder for hierarchical and syntax-based machine translation. The main focus is describing Thrax, a flexible, open source synchronous context-free grammar extractor. Thrax extracts both hierarchical (Chiang, 2007) and syntax-augmented machine translation (Zollmann and Venugopal, 2006) grammars. It is built on Apache Hadoop for efficient distributed performance, and can easily be extended with support for new grammars, feature functions, and output formats.
Original languageEnglish
Title of host publicationProceedings of the Sixth Workshop on Statistical Machine Translation
Place of PublicationEdinburgh, Scotland
PublisherAssociation for Computational Linguistics
Pages478-484
Number of pages7
Publication statusPublished - 1 Jul 2011

Fingerprint

Dive into the research topics of 'Joshua 3.0: Syntax-based Machine Translation with the Thrax Grammar Extractor'. Together they form a unique fingerprint.

Cite this