Abstract / Description of output
Algorithmic oppression is an urgent and persistent problem in speech and language technologies. Considering power relations embedded in datasets before compiling or using them to train or test speech and language technologies is essential to designing less harmful, more just technologies. This paper presents a reflective exercise to recognise and challenge gaps and the power relations they reveal in speech and language datasets by applying principles of Data Feminism and Design Justice, and building on work on dataset documentation and sociolinguistics.
Original language | English |
---|---|
Title of host publication | Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion |
Editors | Bharathi Raja Chakravarthi, B Bharathi, John P. McCrae, Manel Zarrouk, Kalika Bali, Paul Buitelaar |
Place of Publication | Dublin, Ireland |
Publisher | Association for Computational Linguistics |
Pages | 1-12 |
Number of pages | 12 |
ISBN (Electronic) | 978-1-955917-43-8 |
DOIs | |
Publication status | Published - 3 Jun 2022 |
Event | 2nd Workshop on Language Technology for Equality, Diversity, Inclusion 2022 - Dublin, Ireland Duration: 27 May 2022 → 27 May 2022 Conference number: 2 https://sites.google.com/view/lt-edi-2022/home |
Workshop
Workshop | 2nd Workshop on Language Technology for Equality, Diversity, Inclusion 2022 |
---|---|
Abbreviated title | LT-EDI 2022 |
Country/Territory | Ireland |
City | Dublin |
Period | 27/05/22 → 27/05/22 |
Internet address |