Abstract / Description of output
An increasingly prevalent problem for intelligent technologies is text safety, as uncontrolled systems may generate recommendations to their users that lead to injury or life-threatening consequences. However, the degree of explicitness of a generated statement that can cause physical harm varies. In this paper, we distinguish types of text that can lead to physical harm and establish one particularly underexplored category: covertly unsafe text. Then, we further break down this category with respect to the system’s information and discuss solutions to mitigate the generation of text in each of these subcategories. Ultimately, our work defines the problem of covertly unsafe language that causes physical harm and argues that this subtle yet dangerous issue needs to be prioritized by stakeholders and regulators. We highlight mitigation strategies to inspire future researchers to tackle this challenging problem and help improve safety within smart systems.
Original language | English |
---|---|
Title of host publication | Findings of the Association for Computational Linguistics: EMNLP 2022 |
Editors | Yoav Goldberg, Zornitsa Kozareva, Yue Zhang |
Place of Publication | Abu Dhabi, United Arab Emirates |
Publisher | Association for Computational Linguistics |
Pages | 2914–2926 |
Number of pages | 13 |
Edition | 3 |
ISBN (Electronic) | 9781959429432 |
DOIs | |
Publication status | Published - 11 Dec 2022 |
Event | The 2022 Conference on Empirical Methods in Natural Language Processing - Abu Dhabi National Exhibition Centre, Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022 Conference number: 27 https://2022.emnlp.org/ |
Publication series
Name | Findings of the Association for Computational Linguistics |
---|---|
Publisher | ACL |
ISSN (Print) | 0891-2017 |
ISSN (Electronic) | 1530-9312 |
Conference
Conference | The 2022 Conference on Empirical Methods in Natural Language Processing |
---|---|
Abbreviated title | EMNLP 2022 |
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 7/12/22 → 11/12/22 |
Internet address |