NegPar: a parallel corpus annotated for negation

Qianchu Liu, Federico Fancellu, Bonnie Webber

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Although the existence of English corpora annotated for negation has allowed for extensive work on monolingual negation detection, little is understood on how negation-related phenomena translate across languages. The current study fills this gap by presenting NegPar, the first English-Chinese parallel corpus annotated for negation in the narrative domain (a collection of stories from Conan Doyle's Sherlock Holmes). While we followed the annotation guidelines in the ConanDoyleNeg corpus (Morante and Daelemans, 2012), we reannotated certain scope-related phenomena to ensure more consistent and interpretable semantic representation. To both ease the annotation process and analyze how similar negation is signaled in the two languages, we experimented with first projecting the annotations from English and then manually correcting the projection output in Chinese. Results show that projecting negation via word-alignment offers limited help to the annotation process, as negation can be rendered in different ways across languages.
Original languageEnglish
Title of host publicationProceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Place of PublicationMiyazaki, Japan
PublisherEuropean Language Resources Association (ELRA)
Pages3464-3472
Number of pages9
ISBN (Electronic)979-10-95546-00-9
Publication statusE-pub ahead of print - 12 May 2018
Event11th Edition of the Language Resources and Evaluation Conference - Miyazaki, Japan
Duration: 7 May 201812 May 2018
http://lrec2018.lrec-conf.org/en/

Conference

Conference11th Edition of the Language Resources and Evaluation Conference
Abbreviated titleLREC 2018
CountryJapan
CityMiyazaki
Period7/05/1812/05/18
Internet address

Fingerprint Dive into the research topics of 'NegPar: a parallel corpus annotated for negation'. Together they form a unique fingerprint.

Cite this