Semantically Equivalent Adversarial Rules for Debugging NLP Models

机译：用于调试NLP模型的语义等效对抗规则

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Complex machine learning models for NLP are often brittle, making different predictions for input instances that are extremely similar semantically. To automatically detect this behavior for individual instances, we present semantically equivalent adversaries (SEAs) - semantic-preserving perturbations that induce changes in the model's predictions. We generalize these adversaries into semantically equivalent adversarial rules (SEARs) - simple, universal replacement rules that induce adversaries on many instances. We demonstrate the usefulness and flexibility of SEAs and SEARs by detecting bugs in black-box state-of-the-art models for three domains: machine comprehension, visual question-answering, and sentiment analysis. Via user studies, we demonstrate that we generate high-quality local adversaries for more instances than humans, and that SEARs induce four times as many mistakes as the bugs discovered by human experts. SEARs are also actionable: retraining models using data augmentation significantly reduces bugs, while maintaining accuracy.

机译：用于NLP的复杂机器学习模型通常很脆弱，会对语义上极为相似的输入实例做出不同的预测。为了自动检测单个实例的这种行为，我们提出了语义上等效的敌人（SEA）-保留语义的扰动，这些扰动会引起模型预测的变化。我们将这些对手归纳为语义上等效的对手规则（SEAR）-简单，通用的替换规则，这些规则会在许多情况下诱发对手。通过检测三个领域的黑匣子最新模型中的错误，我们证明了SEA和SEAR的有用性和灵活性：机器理解，视觉问题解答和情感分析。通过用户研究，我们证明我们产生的高质量本地对手的实例比人类更多，并且SEAR导致的错误是人类专家发现的错误的四倍。 SEAR也是可行的：使用数据增强来重新训练模型可在保持准确性的同时显着减少错误。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|856-865|共10页
会议地点
作者
Marco Tulio Ribeiro; Sameer Singh; Carlos Guestrin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. ALGORITHMIC DEBUGGING OF EQUIVALENT TRANSFORMATION PROGRAMS USING ORACLE RULES [J] . Shinya Miyajima, Kiyoshi Akama, Hiroshi Mabuchi International Journal of Innovative Computing Information and Control . 2011,第8期

机译：使用Oracle规则对等效转换程序进行算法调试
2. Knowledge Authoring with ORE: Testing, Debugging and Validating Knowledge Rules in a Semantic Web Framework [J] . Andres Mu?oz Ortega, Jose M. Alcaraz Calero, Juan A. Botía Blaya, Journal of Universal Computer Science . 2010,第9期

机译：使用ORE进行知识创作：在语义Web框架中测试，调试和验证知识规则
3. On the Robustness of Semantic Segmentation Models to Adversarial Attacks [J] . Arnab Anurag, Miksik Ondrej, Torr Philip H. S. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第12期

机译：论对抗对抗攻击的语义分割模型的鲁棒性
4. Semantically Equivalent Adversarial Rules for Debugging NLP Models [C] . Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin Annual meeting of the Association for Computational Linguistics . 2018

机译：用于调试NLP模型的语义等效的对抗条件
5. Simplified Semantics and Debugging of Concurrent Programs via Targeted Race Detection. [D] . Marino, Daniel Luke. 2011

机译：通过定向种族检测简化了语义并发程序的调试。
6. Annotation of SBML models through rule-based semantic integration [O] . Allyson L Lister, Phillip Lord, Matthew Pocock, 2010

机译：通过基于规则的语义集成对SBML模型进行注释
7. Neural and rule-based Finnish NLP models—expectations, experiments and experiences [O] . Tommi A Pirinen 2019

机译：基于神经和规则的芬兰NLP模型 - 期望，实验和经验
8. Trois Semantiques Equivalentes pour Css (Flow Models of Distributed Computations:Three Equivalent Semantics for Ccs) [R] . Boudol, G., Castellani, I. 1991

机译：Trois semantiques Equivalentes pour Css（分布式计算的流模型：Ccs的三个等价语义）

Semantically Equivalent Adversarial Rules for Debugging NLP Models

摘要

著录项

相似文献

相关主题

期刊订阅