Learning to Faithfully Rationalize by Construction

机译：学会通过构建忠实地合理化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many settings it is important for one to be able to understand why a model made a particular prediction. In NLP this often entails extracting snippets of an input text 'responsible for' corresponding model output; when such a snippet comprises tokens that indeed informed the model's prediction, it is a faithful explanation. In some settings, faithfulness may be critical to ensure transparency. Lei et al. (2016) proposed a model to produce faithful rationales for neural text classification by defining independent snippet extraction and prediction modules. However, the discrete selection over input tokens performed by this method complicates training, leading to high variance and requiring careful hyperparameter tuning. We propose a simpler variant of this approach that provides faithful explanations by construction. In our scheme, named FRESH, arbitrary feature importance scores (e.g., gradients from a trained model) are used to induce binary labels over token inputs, which an extractor can be trained to predict. An independent classifier module is then trained exclusively on snippets provided by the extractor; these snippets thus constitute faithful explanations, even if the classifier is arbitrarily complex. In both automatic and manual evaluations we find that variants of this simple framework yield predictive performance superior to 'end-to-end' approaches, while being more general and easier to train.

机译：在许多情况下，理解模型为何做出特定预测是很重要的。在NLP中，这通常需要提取“负责”相应模型输出的输入文本片段；当这样一个片段包含确实通知了模型预测的标记时，它是一个忠实的解释。在某些情况下，忠诚可能是确保透明度的关键。Lei等人（2016）提出了一个模型，通过定义独立的片段提取和预测模块，为神经文本分类提供可靠的理论依据。然而，这种方法对输入标记进行的离散选择使训练复杂化，导致高方差，需要仔细调整超参数。我们提出了这种方法的一个更简单的变体，它通过构造提供了可靠的解释。在我们的方案中，命名为FRESH的任意特征重要性分数（例如，来自训练模型的梯度）用于在标记输入上诱导二进制标签，提取器可以训练预测。然后，独立的分类器模块专门根据提取器提供的片段进行训练；因此，这些片段构成了忠实的解释，即使分类器任意复杂。在自动和手动评估中，我们发现这种简单框架的变体产生的预测性能优于“端到端”方法，同时更通用、更易于培训。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics》|2020年|4459-4473|共15页
会议地点
作者
Sarthak Jain; Sarah Wiegreffe; Yuval Pinter; Byron C. Wallace;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. CSELM-QE: A Composite Semi-supervised Extreme Learning Machine with Unlabeled RSS Quality Estimation for Radio Map Construction [J] . ZHAO Jianli, WANG Wei, SUN Qiuxia, 电子学报（英文版） . 2020,第006期
2. Research on the Construction of a Fully-participatory Extracurricular Learning Platform with Multiple Linkage Effects [J] . Jiasen PAN, Xuewei ZHU, Qifa LI, 亚洲农业研究：英文版 . 2020,第005期
3. Rationalization of the Construction System with Ceramic Blocks [J] . Luan de Jesus Gon?alves, Yuri Sotero Bomfim Fraga International Journal of Engineering Research and Applications . 2020,第2S1期

机译：陶瓷砌块建筑系统的合理化
4. Rationalizing 'gender-wash': empowerment, efficiency and knowledge construction [J] . Kelly Gerard Review of International Political Economy . 2019,第5期

机译：合理化“洗性别”：赋权，效率和知识建设
5. The Construction Kit and the Assembly Line—Walter Gropius’ Concepts for Rationalizing Architecture [J] . Atli Magnus Seelow Arts . 2018,第4期

机译：施工套件和装配线— Walter Gropius的合理化建筑概念
6. Rationalization of Strategic Management Principles as a Tool to Improve a Construction Company Services [C] . O.V. Kliuchnikova, O.A. Pobegaylov International Conference on Industrial Engineering . 2016

机译：战略管理原则的合理化作为改进建筑公司服务的工具
7. THE HUMAN PROBLEMS OF TECHNOLOGICAL SOCIETY: AN INQUIRY INTO THE NATURE OF ROLE RATIONALIZATION, FORMATION OF COLLECTIVE INTELLIGENCE AND SYSTEM LEARNING IN A COMMUNICATION STRUCTURE. [D] . TESCHENDORF, MELVIN CARL. 1976

机译：技术社会的人类问题：在通信结构中角色合理化的本质，集体智慧的形成和系统学习的探讨。
8. Predicting Isoform-Selective Carbonic Anhydrase Inhibitorsvia Machine Learning and Rationalizing Structural Features Importantfor Selectivity [O] . Salvatore Galati, Dimitar Yonchev, Raquel Rodríguez-Pérez, 2021

机译：预测同种型选择性碳酸酐酶抑制剂通过机器学习和合理化结构特征重要适合选择性
9. Learning to Faithfully Rationalize by Construction [O] . Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, 2020

机译：学习忠实地通过建设合理化
10. Introspective Multistrategy Learning: On the Construction of Learning Strategies [R] . Cox, M. T., Ram, A. 1999

机译：反思性多学科学习：学习策略的构建

Learning to Faithfully Rationalize by Construction

摘要

著录项

相似文献

相关主题

期刊订阅