Using foreign inclusion detection to improve parsing performance

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Inclusions from other languages can be a significant source of errors for monolin-
gual parsers. We show this for English inclusions, which are sufficiently frequent to
present a problem when parsing German. We describe an annotation-free approach for accurately detecting such inclusions, and develop two methods for interfacing this approach with a state-of-the-art parser for German. An evaluation on the TIGER corpus shows that our inclusion entity model achieves a performance gain of 4.3 points in F-score over a baseline of no inclusion detection, and even outperforms a parser with access to gold standard part-of-speech tags.
Original languageEnglish
Title of host publicationProceedings of EMNLP-CoNLL 2007
Number of pages10
Publication statusPublished - 2007

Keywords

  • seer,

Cite this