Projects per year
Abstract
This paper introduces Chart Inference (CI), an algorithm for deriving a CCG category for an unknown word from a partial parse chart. It is shown to be faster and more precise than a baseline brute-force method, and to achieve wider coverage than a rule-based system. In addition, we show the application of CI to a domain adaptation task for question words, which are largely missing in the
Penn Treebank. When used in combination with self-training, CI increases the precision of the baseline StatCCG parser over subject-extraction questions by 50%. An error analysis shows that CI contributes to the increase by expanding the number of category types available to the parser, while self-training adjusts the counts.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, 27-31 July 2011, John McIntyre Conference Centre, Edinburgh, UK, A meeting of SIGDAT, a Special Interest Group of the ACL |
Publisher | Association for Computational Linguistics |
Pages | 1246-1256 |
Number of pages | 11 |
Publication status | Published - 2011 |
Fingerprint
Dive into the research topics of 'Semi-supervised CCG Lexicon Extension'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Xperience - 'Robotes Bootstrapped through Learning from Experience'
Steedman, M., Geib, C. & Petrick, R.
1/01/10 → 31/12/15
Project: Research