首页> 外文会议>International Conference on Machine Learning >Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
【24h】

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing

机译:公平性和准确性之间是否有权衡? 使用非匹配假设检测的透视图

获取原文

摘要

A trade-off between accuracy and fairness is almost taken as a given in the existing literature on fairness in machine learning. Yet, it is not preordained that accuracy should decrease with increased fairness. Novel to this work, we examine fair classification through the lens of mismatched hypothesis testing: trying to find a classifier that distinguishes between two ideal distributions when given two mismatched distributions that are biased. Using Chernoff information, a tool in information theory, we theoretically demonstrate that, contrary to popular belief, there always exist ideal distributions such that optimal fairness and accuracy (with respect to the ideal distributions) are achieved simultaneously: there is no trade-off. Moreover, the same classifier yields the lack of a trade-off with respect to ideal distributions while yielding a trade-off when accuracy is measured with respect to the given (possibly biased) dataset. To complement our main result, we formulate an optimization to find ideal distributions and derive fundamental limits to explain why a trade-off exists on the given biased dataset. We also derive conditions under which active data collection can alleviate the fairness-accuracy trade-off in the real world. Our results lead us to contend that it is problematic to measure accuracy with respect to data that reflects bias, and instead, we should be considering accuracy with respect to ideal, unbiased data.
机译:准确性和公平性之间的权衡几乎是在机器学习的公平性的现有文献中的一个。然而,没有预先认为准确性应该随着公平而减少。对此作品进行了新颖,我们通过错配的假设检测镜头来检查公平分类:试图找到一个分类器,当给定两个错配的分布时,它们在偏置的两个不匹配的分布时。使用Chernoff Information,一种在信息理论中的工具,我们理论上证明,与流行的信念相反,总是存在理想的分布,使得最佳公平性和准确度(关于理想的分布)同时实现:没有权衡。此外,相同的分级器产生关于在相对于给定(可能偏置的)数据集的精度测量精度时产生折衷的理想分布的折衷。为了补充我们的主要结果,我们制定了优化,以找到理想的分布,并导出基本限制,以解释为什么在给定的偏见数据集上存在权衡。我们还导出了活动数据收集可以减轻现实世界中的公平准确性权衡的条件。我们的结果导致我们争辩说,衡量反映偏见的数据的准确性是有问题的,而且,我们应该考虑到理想,无偏的数据的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号