Evaluating Defensive Distillation for Defending Text Processing Neural Networks Against Adversarial Examples

机译：评估防御蒸馏以防御文本处理神经网络，以对抗对手

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Adversarial examples are artificially modified input samples which lead to misclassifications, while not being detectable by humans. These adversarial examples are a challenge for many tasks such as image and text classification, especially as research shows that many adversarial examples are transferable between different classifiers. In this work, we evaluate the performance of a popular defensive strategy for adversarial examples called defensive distillation, which can be successful in hardening neural networks against adversarial examples in the image domain. However, instead of applying defensive distillation to networks for image classification, we examine, for the first time, its performance on text classification tasks and also evaluate its effect on the transferability of adversarial text examples. Our results indicate that defensive distillation only has a minimal impact on text classifying neural networks and does neither help with increasing their robustness against adversarial examples nor prevent the transferability of adversarial examples between neural networks.

机译：对抗性示例是人为修改的输入样本，这会导致分类错误，而人类则无法检测到。这些对抗性示例对于诸如图像和文本分类之类的许多任务都是一个挑战，尤其是当研究表明许多对抗性示例可在不同分类器之间转移时尤其如此。在这项工作中，我们评估一种称为防御蒸馏的对抗示例的流行防御策略的性能，该策略可以成功地针对图像领域中的对抗示例强化神经网络。但是，我们没有将防御性蒸馏应用于网络进行图像分类，而是首次检查了其在文本分类任务中的性能，并评估了其对对抗性文本示例的可传递性的影响。我们的结果表明，防御性蒸馏仅对文本分类神经网络产生最小的影响，既无助于提高其对对抗示例的鲁棒性，也无助于对抗示例在神经网络之间的传递。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|685-696|共12页
会议地点
作者
Marcus Soll; Tobias Hinz; Sven Magg; Stefan Wermter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Adversarial examples; Defensive distillation; Text classification; Convolutional neural network; Robustness;

机译：对抗性例子;防御性蒸馏;文字分类;卷积神经网络坚固性;

相似文献

外文文献
中文文献
专利

1. Detecting audio adversarial examples for protecting speech-to-text transcription neural networks [J] . Keiichi Tamura, Akitada Omagari, Hajime Ito, International journal of computational intelligence studies . 2021,第2a3期

机译：检测保护语音到文本转录神经网络的音频逆势示例
2. Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification [J] . Poon Hoon-Keng, Yap Wun-She, Tee Yee-Kai, Neural Networks: The Official Journal of the International Neural Network Society . 2019,第期

机译：文本分类对抗性和虚拟对抗培训的分层门控经常性神经网络
3. Detecting Adversarial Image Examples in Deep Neural Networks with Adaptive Noise Reduction [J] . Liang Bin, Li Hongcheng, Su Miaoqiang, IEEE transactions on dependable and secure computing . 2021,第1期

机译：具有自适应降噪的深神经网络中的对抗性图像示例
4. Evaluating Defensive Distillation for Defending Text Processing Neural Networks Against Adversarial Examples [C] . Marcus Soll, Tobias Hinz, Sven Magg, International Conference on Artificial Neural Networks . 2019

机译：评估防守蒸馏防御文本处理神经网络对抗对抗例
5. Defending Neural Networks against Adversarial Examples [D] . Barton, Armon. 2018

机译：对抗神经网络对抗对手的例子
6. Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain [O] . Chi Zhang, Xiao-Han Duan, Lin-Yuan Wang, 2021

机译：卷积神经网络与人脑的离前事实扰动图像的可解离神经表示
7. Defensive dropout for hardening deep neural networks under adversarial attacks [O] . Siyue Wang, Xiao Wang, Pu Zhao, 2018

机译：在对抗性攻击下硬化深神经网络的防守辍学

Evaluating Defensive Distillation for Defending Text Processing Neural Networks Against Adversarial Examples

摘要

著录项

相似文献

相关主题

期刊订阅