首页> 外文会议>International Joint Conference on Neural Networks >Combating Word-level Adversarial Text with Robust Adversarial Training
【24h】

Combating Word-level Adversarial Text with Robust Adversarial Training

机译:通过强大的对抗性训练对抗单词级对抗性文本

获取原文

摘要

NLP models perform well on many tasks, but they are also easy to be fooled by adversarial examples. A small perturbation can change the output of the deep neural network model. This kind of perturbation is hard to be perceived by humans, especially adversarial examples generated by word-level adversarial attack. Character-level adversarial attack can be defended by grammar detection and word recognition. The existing word-level textual adversarial attacks are based on synonym replacement, so adversarial texts usually have correct grammar and semantics. The defense of word-level adversarial attack is more challenging. In this paper, we propose a framework which is called Robust Adversarial Training (RAT) to defend against word-level adversarial attacks. RAT enhances the model by combining adversarial training and data perturbation during training. Our experiments on two datasets show that the model based on our framework can effectively defend against word-level adversarial attacks. Compared with the existing defense methods, the model trained under RAT has a higher defense success rate on 1000 adversarial examples. In addition, the accuracy of our model on the standard testing set is also better than the existing defense methods, and the accuracy is very close to or even higher than that of the standard model.
机译:NLP模型在许多任务上表现良好,但它们也很容易被敌对的例子愚弄。小扰动可以改变深层神经网络模型的输出。这种干扰很难被人类察觉,尤其是单词级对抗性攻击产生的对抗性示例。字符级敌对攻击可以通过语法检测和单词识别进行防御。现有的词级文本对抗攻击都是基于同义词替换的,因此对抗文本通常具有正确的语法和语义。单词级对抗性攻击的防御更具挑战性。在本文中,我们提出了一个称为鲁棒对抗训练(RAT)的框架来防御单词级的对抗性攻击。RAT通过结合对抗性训练和训练期间的数据扰动来增强模型。我们在两个数据集上的实验表明,基于我们框架的模型可以有效地抵御单词级的对抗性攻击。与现有的防御方法相比,在鼠下训练的模型在1000个对抗实例上具有更高的防御成功率。此外,我们的模型在标准测试集上的精度也优于现有的防御方法,并且精度非常接近甚至高于标准模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号