Abstract / Description of output
Inclusions from other languages can be a significant source of errors for monolin-
gual parsers. We show this for English inclusions, which are sufficiently frequent to
present a problem when parsing German. We describe an annotation-free approach for accurately detecting such inclusions, and develop two methods for interfacing this approach with a state-of-the-art parser for German. An evaluation on the TIGER corpus shows that our inclusion entity model achieves a performance gain of 4.3 points in F-score over a baseline of no inclusion detection, and even outperforms a parser with access to gold standard part-of-speech tags.
gual parsers. We show this for English inclusions, which are sufficiently frequent to
present a problem when parsing German. We describe an annotation-free approach for accurately detecting such inclusions, and develop two methods for interfacing this approach with a state-of-the-art parser for German. An evaluation on the TIGER corpus shows that our inclusion entity model achieves a performance gain of 4.3 points in F-score over a baseline of no inclusion detection, and even outperforms a parser with access to gold standard part-of-speech tags.
Original language | English |
---|---|
Title of host publication | Proceedings of EMNLP-CoNLL 2007 |
Number of pages | 10 |
Publication status | Published - 2007 |
Keywords / Materials (for Non-textual outputs)
- seer,