首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >It's All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution
【24h】

It's All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution

机译:一切都在名称中:通过基于名称的反事实数据替代来缓解性别偏见

获取原文

摘要

This paper treats gender bias latent in word embeddings. Previous mitigation attempts rely on the operationalisation of gender bias as a projection over a linear subspace. An alternative approach is Counter/actual Data Augmentation (CDA), in which a corpus is duplicated and augmented to remove bias, e.g. by swapping all inherently-gendered words in the copy. We perform an empirical comparison of these approaches on the English Gigaword and Wikipedia, and find that whilst both successfully reduce direct bias and perform well in tasks which quantify embedding quality, CDA variants outperform projection-based methods at the task of drawing non-biased gender analogies by an average of 19% across both corpora. We propose two improvements to CDA: Counterfactual Data Substitution (CDS), a variant of CDA in which potentially biased text is randomly substituted to avoid duplication, and the Names Intervention, a novel name-pairing technique that vastly increases the number of words being treated. CDA/S with the Names Intervention is the only approach which is able to mitigate indirect gender bias: following de-biasing, previously biased words are significantly less clustered according to gender (cluster purity is reduced by 49%), thus improving on the state-of-the-art for bias mitigation.
机译:本文研究了词嵌入中潜在的性别偏见。先前的缓解尝试依赖于将性别偏见的可操作性作为线性子空间上的投影。另一种方法是计数器/实际数据增强(CDA),其中复制并增强语料库以消除偏见,例如通过交换副本中所有固有性别的单词。我们在英语Gigaword和Wikipedia上对这些方法进行了实证比较,发现尽管成功地减少了直接偏差并且在量化嵌入质量的任务中表现良好,但CDA变体在绘制无偏见性别的任务上胜过了基于投影的方法。两种语料库的平均比喻为19%。我们建议对CDA进行两项改进:对抗事实数据替换(CDS),一种CDA的变体,其中为了避免重复而随机替换有潜在偏见的文本,以防止重复;以及Names Intervention,一种新颖的名称配对技术,可大大增加要处理的单词数。具有名称干预功能的CDA / S是唯一能够减轻间接性别偏见的方法:去偏后,以前偏向的单词根据性别的聚类程度明显降低(群集纯度降低了49%),从而改善了状态最先进的缓解偏见。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号