Language processing for arabic microblog retrieval

Kareem Darwish, Walid Magdy, Ahmed Mourad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.
Original languageEnglish
Title of host publication21st ACM International Conference on Information and Knowledge Management, CIKM'12, Maui, HI, USA, October 29 - November 02, 2012
PublisherACM
Pages2427-2430
Number of pages4
ISBN (Print)978-1-4503-1156-4
DOIs
Publication statusPublished - Nov 2012

Fingerprint Dive into the research topics of 'Language processing for arabic microblog retrieval'. Together they form a unique fingerprint.

Cite this