“Lightweight” semantic annotation of text calls for a simple representation, ideally without requiring a semantic lexicon to achieve good coverage in the language and domain. In this paper, we repurpose WordNet’s supersense tags for annotation, developing specific guidelines for nominal expressions and applying them to Arabic Wikipedia articles in four topical domains. The resulting corpus has high coverage and was completed quickly with reasonable inter-annotator agreement.
|Title of host publication||Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics|
|Publisher||Association for Computational Linguistics|
|Number of pages||6|
|Publication status||Published - 2012|