Semantically Equivalent Adversarial Rules for Debugging NLP Models

机译：用于调试NLP模型的语义等效的对抗条件

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Complex machine learning models for NLP are often brittle, making different predictions for input instances that are extremely similar semantically. To automatically detect this behavior for individual instances, we present semantically equivalent adversaries (SEAs) - semantic-preserving perturbations that induce changes in the model's predictions. We generalize these adversaries into semantically equivalent adversarial rules (SEARs) - simple, universal replacement rules that induce adversaries on many instances. We demonstrate the usefulness and flexibility of SEAs and SEARs by detecting bugs in black-box state-of-the-art models for three domains: machine comprehension, visual question-answering, and sentiment analysis. Via user studies, we demonstrate that we generate high-quality local adversaries for more instances than humans, and that SEARs induce four times as many mistakes as the bugs discovered by human experts. SEARs are also actionable: retraining models using data augmentation significantly reduces bugs, while maintaining accuracy.

机译：复杂的机器学习模型NLP往往易碎，因此对于那些极其相似语义上输入情况下不同的预测。自动检测此行为个别情况下，我们提出了语义上等价的对手有限公司（SEAS） - 语义保留摄动诱导模型的预测变化。我们概括这些对手成语义等价对抗规则（西尔斯） - 简单，通用的替换规则诱发许多情况下对手。机器理解，视觉问题回答和情感分析：我们通过检测在国家的最先进的黑盒模型错误的三个领域展示海洋和西尔斯的实用性和灵活性。通过用户研究，我们证明了我们产生比人类更实例当地优质的敌人，西尔斯诱导四倍多的错误作为错误发现人类专家。西尔斯也是可行的：使用再培训模型的数据增强显著减少错误，同时保持准确度。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|lxx p. 687-1371|共10页
会议地点
作者
Marco Tulio Ribeiro; Sameer Singh; Carlos Guestrin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. ALGORITHMIC DEBUGGING OF EQUIVALENT TRANSFORMATION PROGRAMS USING ORACLE RULES [J] . Shinya Miyajima, Kiyoshi Akama, Hiroshi Mabuchi International Journal of Innovative Computing Information and Control . 2011,第8期

机译：使用Oracle规则对等效转换程序进行算法调试
2. Knowledge Authoring with ORE: Testing, Debugging and Validating Knowledge Rules in a Semantic Web Framework [J] . Andres Mu?oz Ortega, Jose M. Alcaraz Calero, Juan A. Botía Blaya, Journal of Universal Computer Science . 2010,第9期

机译：使用ORE进行知识创作：在语义Web框架中测试，调试和验证知识规则
3. On the Robustness of Semantic Segmentation Models to Adversarial Attacks [J] . Arnab Anurag, Miksik Ondrej, Torr Philip H. S. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第12期

机译：论对抗对抗攻击的语义分割模型的鲁棒性
4. Semantically Equivalent Adversarial Rules for Debugging NLP Models [C] . Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin Annual meeting of the Association for Computational Linguistics . 2018

机译：用于调试NLP模型的语义等效对抗规则
5. Simplified Semantics and Debugging of Concurrent Programs via Targeted Race Detection. [D] . Marino, Daniel Luke. 2011

机译：通过定向种族检测简化了语义并发程序的调试。
6. Annotation of SBML models through rule-based semantic integration [O] . Allyson L Lister, Phillip Lord, Matthew Pocock, 2010

机译：通过基于规则的语义集成对SBML模型进行注释
7. Neural and rule-based Finnish NLP models—expectations, experiments and experiences [O] . Tommi A Pirinen 2019

机译：基于神经和规则的芬兰NLP模型 - 期望，实验和经验
8. Trois Semantiques Equivalentes pour Css (Flow Models of Distributed Computations:Three Equivalent Semantics for Ccs) [R] . Boudol, G., Castellani, I. 1991

机译：Trois semantiques Equivalentes pour Css（分布式计算的流模型：Ccs的三个等价语义）

Semantically Equivalent Adversarial Rules for Debugging NLP Models

摘要

著录项

相似文献

相关主题

期刊订阅