首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Incorporating Priors with Feature Attribution on Text Classification
【24h】

Incorporating Priors with Feature Attribution on Text Classification

机译:将先验与文本分类的特征归因相结合

获取原文

摘要

Feature attribution methods, proposed recently, help users interpret the predictions of complex models. Our approach integrates feature attributions into the objective function to allow machine learning practitioners to incorporate priors in model building. To demonstrate the effectiveness our technique, we apply it to two tasks: (1) mitigating unintended bias in text classifiers by neutralizing identity terms; (2) improving classifier performance in a scarce data setting by forcing the model to focus on toxic terms. Our approach adds an L_(2) distance loss between feature attributions and task-specific prior values to the objective. Our experiments show that i) a classifier trained with our technique reduces undesired model biases without a tradeoff on the original task; ii) incorporating priors helps model performance in scarce data settings.
机译:最近提出的特征归因方法可以帮助用户解释复杂模型的预测。我们的方法将特征归因集成到目标函数中,以允许机器学习从业人员将先验知识纳入模型构建中。为了证明我们的技术的有效性,我们将其应用于两个任务:(1)通过中和身份术语来缓解文本分类器中的意外偏见; (2)通过强制模型将重点放在有毒术语上来提高分类器在稀缺数据集中的性能。我们的方法在目标的特征归因和特定于任务的先验值之间增加了L_(2)距离损失。我们的实验表明:i)使用我们的技术训练的分类器可以减少不需要的模型偏差,而无需权衡原始任务; ii)合并先验信息有助于在稀缺数据设置中对性能进行建模。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号