Evaluating Informal-Domain Word Representations With UrbanDictionary

Naomi Saphra, Adam Lopez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Existing corpora for intrinsic evaluation are not targeted towards tasks in informal domains such as Twitter or news comment forums. We want to test whether a representation of informal words fulfills the promise of eliding explicit text normalization as a preprocessing step. One possible evaluation metric for such domains is the proximity of spelling variants. We propose how such a metric might be computed and how a spelling variant dataset can be collected using UrbanDictionary.
Original languageEnglish
Title of host publicationProceedings of the 1st Workshop on Evaluating Vector Space Representations for NLP
PublisherAssociation for Computational Linguistics
Pages94-98
Number of pages5
ISBN (Electronic)978-1-945626-14-2
DOIs
Publication statusPublished - 12 Aug 2016
Event1st Workshop on Evaluating Vector Space Representations for NLP - Berlin, Germany
Duration: 12 Aug 201612 Aug 2016
https://sites.google.com/site/repevalacl16/home
https://sites.google.com/site/repevalacl16/home

Conference

Conference1st Workshop on Evaluating Vector Space Representations for NLP
Abbreviated titleRepEval 2016
Country/TerritoryGermany
CityBerlin
Period12/08/1612/08/16
Internet address

Fingerprint

Dive into the research topics of 'Evaluating Informal-Domain Word Representations With UrbanDictionary'. Together they form a unique fingerprint.

Cite this