Abstract / Description of output
Offensive language and hate-speech are phenomena that spread with the rising popularity of social media. Detecting such content is crucial for understanding and predicting conflicts, understanding polarisation among communities and providing means and tools to filter or block inappropriate content. This paper describes the SMASH team submission to OSACT4’s shared task on hate-speech and offensive language detection, where we explore different approaches to perform these tasks. The experiments cover a variety of approaches that include deep learning, transfer learning and multitask learning. We also explore the utilisation of sentiment information to perform the previous task. Our best model is a multitask learning architecture, based on CNN-BiLSTM, that was trained to detect hate-speech and offensive language and predict sentiment.
Original language | English |
---|---|
Title of host publication | Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools |
Publisher | European Language Resources Association (ELRA) |
Chapter | 90 |
Pages | 86 |
Number of pages | 5 |
ISBN (Electronic) | 979-10-95546-51-1 |
Publication status | Published - 12 May 2020 |
Event | The 4th Workshop on Open-Source Arabic Corpora and Processing Tools - Marseille, France Duration: 12 May 2020 → 12 May 2020 http://edinburghnlp.inf.ed.ac.uk/workshops/OSACT4/ |
Workshop
Workshop | The 4th Workshop on Open-Source Arabic Corpora and Processing Tools |
---|---|
Abbreviated title | OSACT4 |
Country/Territory | France |
City | Marseille |
Period | 12/05/20 → 12/05/20 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- Arabic
- hate-speech
- offensive language