【24h】

Attenuating Bias in Word vectors

机译:减轻字向量中的偏差

获取原文
           

摘要

Word vector representations are well developed tools for various NLP and Machine Learning tasks and are known to retain significant semantic and syntactic structure of languages. But they are prone to carrying and amplifying bias which can perpetrate discrimination in various applications. In this work, we explore new simple ways to detect the most stereotypically gendered words in an embedding and remove the bias from them. We verify how names are masked carriers of gender bias and then use that as a tool to attenuate bias in embeddings. Further, we extend this property of names to show how names can be used to detect other types of bias in the embeddings such as bias based on race, ethnicity, and age.
机译:词向量表示法是用于各种NLP和机器学习任务的完善工具,已知保留重要的语言语义和句法结构。但是它们易于携带和扩大偏见,这会在各种应用中造成歧视。在这项工作中,我们探索了新的简单方法来检测嵌入中最陈规定型性别的单词,并消除其中的偏见。我们验证姓名如何掩盖性别偏见的载体,然后将其用作减轻嵌入偏见的工具。此外,我们扩展了名称的此属性,以显示名称如何用于检测嵌入中的其他类型的偏差,例如基于种族,种族和年龄的偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号