Abstract
All language is characterised by variation which language users employ to construct complex social identities and express social meaning. Like other machine learning technologies, speech and language technologies (re)produce structural oppression when they perform worse for marginalised language communities. Using knowledge and theories from sociolinguistics, I explore why commercial automatic speech recognition systems and other language technologies perform significantly worse for already marginalised populations, such as second-language speakers and speakers of stigmatised varieties of English in the British Isles. Situating language technologies within the broader scholarship around algorithmic bias, consider the allocative and representational harms they can cause even (and perhaps especially) in systems which do not exhibit predictive bias, narrowly defined as differential performance between groups. This raises the question whether addressing or “fixing” this “bias” is actually always equivalent to mitigating the harms algorithmic systems can cause, in particular to marginalised communities.
Original language | English |
---|---|
Title of host publication | Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency (FAccT 2022) |
Publisher | ACM Association for Computing Machinery |
Pages | 521-534 |
Number of pages | 14 |
ISBN (Electronic) | 978-1-4503-7252-7 |
DOIs | |
Publication status | Published - 20 Jun 2022 |
Event | 5th Annual ACM Conference on Fairness, Accountability, and Transparency - Seoul, Korea, Democratic People's Republic of Duration: 21 Jun 2022 → 24 Jun 2022 Conference number: 5 https://facctconference.org/2022/index.html |
Conference
Conference | 5th Annual ACM Conference on Fairness, Accountability, and Transparency |
---|---|
Abbreviated title | FAccT 2022 |
Country/Territory | Korea, Democratic People's Republic of |
City | Seoul |
Period | 21/06/22 → 24/06/22 |
Internet address |
Keywords
- algorithmic bias
- speech and language technologies
- language variation
- speech recognition