Abstract / Description of output
We describe the findings of the fifth Nuanced Arabic Dialect Identification Shared Task (NADI 2024). NADI's objective is to help advance SoTA Arabic NLP by providing guidance, datasets, modeling opportunities, and standardized evaluation conditions that allow researchers to collaboratively compete on pre-specified tasks. NADI 2024 targeted both dialect identification cast as a multi-label task (Subtask~1), identification of the Arabic level of dialectness (Subtask~2), and dialect-to-MSA machine translation (Subtask~3). A total of 51 unique teams registered for the shared task, of whom 12 teams have participated (with 76 valid submissions during the test phase). Among these, three teams participated in Subtask~1, three in Subtask~2, and eight in Subtask~3. The winning teams achieved 50.57 F\textsubscript{1} on Subtask~1, 0.1403 RMSE for Subtask~2, and 20.44 BLEU in Subtask~3, respectively. Results show that Arabic dialect processing tasks such as dialect identification and machine translation remain challenging. We describe the methods employed by the participating teams and briefly offer an outlook for NADI.
Original language | English |
---|---|
Title of host publication | Proceedings of The Second Arabic Natural Language Processing Conference |
Editors | Nizar Habash, Houda Bouamor, Ramy Eskander, Nadi Tomeh, Ibrahim Abu Farha, Ahmed Abdelali, Samia Touileb, Injy Hamed, Yaser Onaizan, Bashar Alhafni, Wissam Antoun, Salam Khalifa, Hatem Haddad, Imed Zitouni, Badr AlKhamissi, Rawan Almatham, Khalil Mrini |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 709-728 |
Number of pages | 20 |
Edition | 2 |
ISBN (Electronic) | 9798891761322 |
DOIs | |
Publication status | Published - 16 Aug 2024 |
Event | The Second Arabic Natural Language Processing Conference - Hybrid, Bangkok, Thailand Duration: 16 Aug 2024 → 16 Aug 2024 Conference number: 2 https://arabicnlp2024.sigarab.org/ |
Conference
Conference | The Second Arabic Natural Language Processing Conference |
---|---|
Abbreviated title | ArabicNLP 2024 |
Country/Territory | Thailand |
City | Bangkok |
Period | 16/08/24 → 16/08/24 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- computation and language
- artificial intelligence