Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments

Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, Noah A. Smith

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We address the problem of part-of-speech tagging for English data from the popular microblogging service Twitter. We develop a tagset, annotate data, develop features, and report tagging results nearing 90% accuracy. The data and tools have been made available to the research community with the goal of enabling richer text analysis of Twitter and related social media data sets.
Original languageEnglish
Title of host publicationProceedings of the 49th Annual Meeting of the Association for Computational Linguistics
PublisherAssociation for Computational Linguistics
Pages42-47
Number of pages6
Publication statusPublished - 2011

Fingerprint Dive into the research topics of 'Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments'. Together they form a unique fingerprint.

Cite this