Detoxifying Language Models Risks Marginalizing Minority Voices

机译：解毒语言模型风险限制边缘化少数民族声音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Language models (LMs) must be both safe and equitable to be responsibly deployed in practice. With safety in mind, numerous detoxification techniques (e.g., Dathathri et al. 2020; Krause et al. 2020) have been proposed to mitigate toxic LM generations. In this work, we show that these detoxification techniques hurt equity: they decrease the utility of LMs on language used by marginalized groups (e.g., African-American English and minority identity mentions). In particular, we perform automatic and human evaluations of text generation quality when LMs are conditioned on inputs with different dialects and group identifiers. We find that detoxification makes LMs more brittle to distribution shift, especially on language used by marginalized groups. We identify that these failures stem from detoxification methods exploiting spurious correlations in toxicity datasets. Overall, our results highlight the tension between the controllability and distributional robustness of LMs.

机译：语言模型（LMS）必须安全且公平地在实践中负责任地部署。为了安全记，许多解毒技术（例如，Dathathri等，2020; Krause等，2020）已提出减轻毒性LM代。在这项工作中，我们表明这些排毒技术受到公平的伤害：它们减少了边缘化群体使用的语言的LMS的效用（例如，非洲裔美国英语和少数民族身份提出）。特别是，当LMS在具有不同方言和组标识符的输入上调节LMS时，我们执行文本生成质量的自动和人为评估。我们发现解毒使LMS更加脆弱，尤其是边缘化群体使用的语言。我们确定这些失败源于解毒方法，利用毒性数据集的杂散相关性。总体而言，我们的结果突出了LMS的可控性和分布稳健性之间的张力。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|2390-2397|共8页
会议地点
作者
Albert Xu; Eshaan Pathak; Eric Wallace; Suchin Gururangan; Maarten Sap; Dan Klein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:56:41

相似文献

外文文献
中文文献
专利

1. "But You're Not a Real Minority": The Marginalization of Asian Voices in Paleoanthropology [J] . Athreya Sheela American Anthropologist . 2019,第2期

机译：“但你不是一个真正的少数民族”：古体内亚洲声音的边缘化
2. The learner's voice: exploring bilingual children's selective language use and perceptions of minority language competence [J] . Enlli Mon Thomas, Dafydd Apolloni, Gwyn Lewis Language and education . 2014,第4期

机译：学习者的心声：探索双语儿童对选择性语言的使用和对少数民族语言能力的看法
3. Using Visual Arts as a Proxy for Language: Addressing the Marginalization of Linguistic Minority Parents [J] . Rashmi Kumara Equity & Excellence in Education . 2011,第4期

机译：使用视觉艺术作为语言的代名词：解决语言少数群体父母的边缘化问题
4. Using the Web for fast language model construction in minority languages [C] . Viet Bac LE, Brigitte BIGI, Laurent BESACIER, European Conference on Speech Communication and Technology . 2003

机译：在少数群体语言中使用Web进行快速语言模型构建
5. Risk perception and decision-making in minority and marginalized communities [D] . Rivers, Louie, III 2006

机译：少数群体和边缘化社区的风险感知和决策
6. Marginal models for clustered time to event data with competing risks using pseudovalues [O] . Brent R. Logan, Mei-Jie Zhang, John P. Klein -1

机译：使用PseudoPores的竞争风险的集群时间的边际模型
7. Detoxifying Language Models Risks Marginalizing Minority Voices [O] . Albert Xu, Eshaan Pathak, Eric Wallace, 2021

机译：解毒语言模型风险边缘化少数民族声音
8. Diverse Voices: The Inclusion of Language-Minority Populations in National Studies: Challenge and Opportunities [R] . Li, R. M., McCardle, P., Clark, R. L., 2001

机译：多元化的声音：国家研究中语言少数民族的包容：挑战和机遇

Detoxifying Language Models Risks Marginalizing Minority Voices

摘要

著录项

相似文献

相关主题

期刊订阅