首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings
【24h】

Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

机译:黑人是罪犯,白种人是警察:侦查和消除单词嵌入中的多类偏见

获取原文

摘要

Online texts-across genres, registers, domains, and styles-are riddled with human stereotypes, expressed in overt or subtle ways. Word embeddings, trained on these texts, perpetuate and amplify these stereotypes, and propagate biases to machine learning models that use word embeddings as features. In this work, we propose a method to debias word embeddings in multiclass settings such as race and religion, extending the work of (Boluk-basi et al., 2016) from the binary setting, such as binary gender. Next, we propose a novel methodology for the evaluation of multiclass debiasing. We demonstrate that our multiclass debiasing is robust and maintains the efficacy in standard NLP tasks.
机译:各种体裁,注册,领域和样式的在线文本都充斥着人类的刻板印象,以明显或微妙的方式表达出来。在这些文本上受过训练的词嵌入,使这些陈规定型观念得以延续和扩大,并向使用词嵌入作为特征的机器学习模型传播偏见。在这项工作中,我们提出了一种在种族和宗教等多类环境中消除单词嵌入偏差的方法,将(Boluk-basi et al。,2016)的工作从二进制环境(例如性别性别)扩展了出来。接下来,我们提出了一种用于评估多类去偏置的新颖方法。我们证明了我们的多类去偏置功能强大,并且可以在标准NLP任务中保持效力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号