Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation

Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Joakim Nivre

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

We describe Docent, an open-source decoder for statistical machine translation that breaks with the usual sentence-by-sentence paradigm and translates complete documents as units. By taking translation to the document level, our decoder can handle feature models with arbitrary discourse-wide dependencies and constitutes an essential infrastructure component in the quest for discourse-aware SMT models.
Original languageEnglish
Title of host publicationProceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Place of PublicationSofia, Bulgaria
PublisherAssociation for Computational Linguistics
Pages193-198
Number of pages6
Publication statusPublished - 9 Aug 2013
Event51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 - Sofia, Bulgaria
Duration: 4 Aug 20139 Aug 2013

Conference

Conference51st Annual Meeting of the Association for Computational Linguistics, ACL 2013
Country/TerritoryBulgaria
CitySofia
Period4/08/139/08/13

Fingerprint

Dive into the research topics of 'Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation'. Together they form a unique fingerprint.

Cite this