Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models

机译：刻板印象和歪曲：定量预先训练和微调语言模型中的性别偏见

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes two intuitive metrics, skew and stereotype, that quantify and analyse the gender bias present in contextual language models when tackling the WinoBias pronoun resolution task. We find evidence that gender stereotype correlates approximately negatively with gender skew in out-of-the-box models, suggesting that there is a trade-off between these two forms of bias. We investigate two methods to mitigate bias. The first approach is an online method which is effective at removing skew at the expense of stereotype. The second, inspired by previous work on ELMo, involves the fine-tuning of BERT using an augmented gender-balanced dataset. We show that this reduces both skew and stereotype relative to its unaugmented fine-tuned counterpart. However, we find that existing gender bias benchmarks do not fully probe professional bias as pronoun resolution may be obfuscated by cross-correlations from other manifestations of gender prejudice. Our code is available online.

机译：本文提出了两种直观的指标，偏斜和刻板印象，在解决Winobias代词解析任务时量化和分析语境语言模型中存在的性别偏见。我们发现有证据表明，性别刻板印象在出箱开箱即用的模型中与性别偏差相关，这表明这两种形式的偏差之间存在权衡。我们调查了两种缓解偏见的方法。第一种方法是一种在线方法，它有效地以刻板印象的牺牲消除偏移。其次受到以前的Elmo工作的推动，涉及使用增强的性别平衡数据集进行微调伯特。我们表明，这可以减少相对于其未占用的微调对应物的偏斜和刻板印象。但是，我们发现现有的性别偏见基准并没有完全探测专业偏见，因为代词分辨率可能会被来自其他性别偏见的其他表现形式的交叉相关性混淆。我们的代码可在线获取。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2021年|2232-2242|共11页
会议地点
作者
Daniel de Vassimon Manela; David Errington; Thomas Fisher; Boris van Breugel; Pasquale Minervini;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Separating Implicit Gender Stereotypes regarding Math and Language: Implicit Ability Stereotypes are Self-serving for Boys and Men, but not for Girls and Women [J] . Melanie C. Steffens, Petra Jelenec Sex Roles . 2011,第5a6期

机译：区分关于数学和语言的内隐性别刻板印象：内隐能力刻板印象对男孩和男人来说都是自利的，但对女孩和女人而言却不是。
2. Separating Implicit Gender Stereotypes regarding Math and Language: Implicit Ability Stereotypes are Self-serving for Boys and Men, but not for Girls and Women [J] . Steffens M.C., Jelenec P. Sex roles . 2011,第5a6期

机译：区分关于数学和语言的内隐性别刻板印象：内隐能力刻板印象对男孩和男人来说都是自利的，但对女孩和女人而言却不是。
3. Gender stereotype reinforcement: Measuring the gender bias conveyed by ranking algorithms [J] . Alessandro Fabris, Alberto Purpura, Gianmaria Silvello, Information Processing & Management . 2020,第6期

机译：性别刻板印象加固：测量排名算法输送的性别偏差
4. Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function [C] . Yusu Qian, Urwa Muaz, Ben Zhang, Annual meeting of the Association for Computational Linguistics . 2019

机译：利用性别均等损失函数减少单词级语言模型中的性别偏见
5. Bias, Gender Stereotyping, and Judgments in STEM and Creative Writing [D] . DeFrank, Melanie 2019

机译：词干和创造性写作的偏见，性别陈规定型观念和判断
6. Gender Difference in Gender Bias: Transcranial Direct Current Stimulation Reduces Male’s Gender Stereotypes [O] . Siqi Wang, Jinjin Wang, Wenmin Guo, 2019

机译：性别偏见中的性别差异：经颅直流电刺激可减少男性的性别刻板印象
7. Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function [O] . Yusu Qian, Urwa Muaz, Ben Zhang, 2019

机译：用性别均衡损失函数减少单词级语言模型中的性别偏差

Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models

摘要

著录项

相似文献

相关主题

期刊订阅