Abstract
The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.
| Original language | English |
|---|---|
| Title of host publication | 21st ACM International Conference on Information and Knowledge Management, CIKM'12, Maui, HI, USA, October 29 - November 02, 2012 |
| Publisher | ACM |
| Pages | 2427-2430 |
| Number of pages | 4 |
| ISBN (Print) | 978-1-4503-1156-4 |
| DOIs | |
| Publication status | Published - Nov 2012 |
Fingerprint
Dive into the research topics of 'Language processing for arabic microblog retrieval'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver