A Corpus and Model Integrating Multiword Expressions and Supersenses

Nathan Schneider, Noah A Smith

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper introduces a task of identifying and semantically classifying lexical expressions in running text. We investigate the online reviews genre, adding semantic supersense annotations to a 55,000 word English corpus that was previously annotated for multiword expressions. The noun and verb supersenses apply to full lexical expressions, whether single- or multiword.
We then present a sequence tagging model that jointly infers lexical expressions
and their supersenses. Results show that even with our relatively small training corpus in a noisy domain, the joint task can be performed to attain 70% class labeling
Original languageEnglish
Title of host publicationA corpus and model integrating multiword expressions and supersenses
Pages1537-1547
Number of pages11
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'A Corpus and Model Integrating Multiword Expressions and Supersenses'. Together they form a unique fingerprint.

Cite this