Multitask Learning for Arabic Offensive Language and Hate-Speech Detection

Ibrahim Abu Farha, Walid Magdy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Offensive language and hate-speech are phenomena that spread with the rising popularity of social media. Detecting such content is crucial for understanding and predicting conflicts, understanding polarisation among communities and providing means and tools to filter or block inappropriate content. This paper describes the SMASH team submission to OSACT4’s shared task on hate-speech and offensive language detection, where we explore different approaches to perform these tasks. The experiments cover a variety of approaches that include deep learning, transfer learning and multitask learning. We also explore the utilisation of sentiment information to perform the previous task. Our best model is a multitask learning architecture, based on CNN-BiLSTM, that was trained to detect hate-speech and offensive language and predict sentiment.
Original languageEnglish
Title of host publicationProceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools
PublisherEuropean Language Resources Association (ELRA)
Chapter90
Pages86
Number of pages5
ISBN (Electronic)979-10-95546-51-1
Publication statusPublished - 12 May 2020
EventThe 4th Workshop on Open-Source Arabic Corpora and Processing Tools - Marseille, France
Duration: 12 May 202012 May 2020
http://edinburghnlp.inf.ed.ac.uk/workshops/OSACT4/

Workshop

WorkshopThe 4th Workshop on Open-Source Arabic Corpora and Processing Tools
Abbreviated titleOSACT4
Country/TerritoryFrance
CityMarseille
Period12/05/2012/05/20
Internet address

Keywords / Materials (for Non-textual outputs)

  • Arabic
  • hate-speech
  • offensive language

Fingerprint

Dive into the research topics of 'Multitask Learning for Arabic Offensive Language and Hate-Speech Detection'. Together they form a unique fingerprint.

Cite this