Abstract / Description of output
Adding syntactic labels to synchronous
context-free translation rules can improve
performance, but labeling with phrase structure constituents, as in GHKM (Galley et al.,
2004), excludes potentially useful translation
rules. SAMT (Zollmann and Venugopal,
2006) introduces heuristics to create new
non-constituent labels, but these heuristics
introduce many complex labels and tend to
add rarely-applicable rules to the translation
grammar. We introduce a labeling scheme
based on categorial grammar, which allows
syntactic labeling of many rules with a minimal, well-motivated label set. We show that
our labeling scheme performs comparably to
SAMT on an Urdu–English translation task,
yet the label set is an order of magnitude
smaller, and translation is twice as fast.
Original language | English |
---|---|
Title of host publication | Proceedings of the Seventh Workshop on Statistical Machine Translation |
Place of Publication | Montréal, Canada |
Publisher | Association for Computational Linguistics |
Pages | 222-231 |
Number of pages | 10 |
Publication status | Published - 1 Jun 2012 |