首页> 外文会议>AAAI Conference on Artificial Intelligence >FuzzE: Fuzzy Fairness Evaluation of Offensive Language Classifiers on African-American English
【24h】

FuzzE: Fuzzy Fairness Evaluation of Offensive Language Classifiers on African-American English

机译:Fuzze:非洲裔美国英语攻击性语言分类器的模糊公平评价

获取原文

摘要

Hate speech and offensive language are rampant on social media. Machine learning has provided a way to moderate foul language at scale. However, much of the current research focuses on overall performance. Models may perform poorly on text written in a minority dialectal language. For instance, a hate speech classifier may produce more false positives on tweets written in African-American Vernacular English (AAVE). To measure these problems, we need text written in both AAVE and Standard American English (SAE). Unfortunately, it is challenging to curate data for all linguistic styles in a timely manner - especially when we are constrained to specific problems, social media platforms, or by limited resources. In this paper, we answer the question, "How can we evaluate the performance of classifiers across minority dialectal languages when they are not present within a particular dataset?" Specifically, we propose an automated fairness fuzzing tool called FuzzE to quantify the fairness of text classifiers applied to AAVE text using a dataset that only contains text written in SAE. Overall, we find that the fairness estimates returned by our technique moderately correlates with the use of real ground-truth AAVE text. Warning: Offensive language is displayed in this manuscript.
机译:讨厌的言论和攻击性语言在社交媒体上猖獗。机器学习提供了一种在规模中适度犯规的方法。然而,大部分目前的研究侧重于整体性能。模型可能在少数辩角语言语言中写的文本上表现不佳。例如,仇恨语音分类器可能会在非洲裔美国白话英语(AAVE)编写的推文上产生更多误报。为了衡量这些问题,我们需要在Aave和标准美国英语(SAE)中写的文本。不幸的是,及时策划所有语言风格的数据充满挑战 - 特别是当我们被限制为特定问题,社交媒体平台或有限的资源时。在本文中,我们回答了这个问题,“我们如何在特定数据集中不存在时评估少数群体方言语语言语言语言的分类器的性能?”具体而言,我们提出了一种被称为Fuzze的自动公平性模糊工具,以使用仅包含在SAE中写入的文本的数据集来量化文本分类器的公平性。总体而言,我们发现我们的技术返回的公平估计与使用真正的地面真理AAVE文本时期恢复。警告:此手稿中显示了令人反感的语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号