Ensemble Generative Cleaning With Feedback Loops for Defending Adversarial Attacks

机译：集成带有反馈回路的生成式清洗，以防御对抗性攻击

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Effective defense of deep neural networks against adversarial attacks remains a challenging problem, especially under powerful white-box attacks. In this paper, we develop a new method called ensemble generative cleaning with feedback loops (EGC-FL) for effective defense of deep neural networks. The proposed EGC-FL method is based on two central ideas. First, we introduce a transformed deadzone layer into the defense network, which consists of an orthonormal transform and a deadzone-based activation function, to destroy the sophisticated noise pattern of adversarial attacks. Second, by constructing a generative cleaning network with a feedback loop, we are able to generate an ensemble of diverse estimations of the original clean image. We then learn a network to fuse this set of diverse estimations together to restore the original image. Our extensive experimental results demonstrate that our approach improves the state-of-art by large margins in both white-box and black-box attacks. It significantly improves the classification accuracy for white-box PGD attacks upon the second best method by more than 29% on the SVHN dataset and more than 39% on the challenging CIFAR-10 dataset.

机译：有效防御神经网络对抗对抗攻击仍然是一个具有挑战性的问题，尤其是在强大的白盒攻击下。在本文中，我们开发了一种新的方法，该方法称为带反馈环路的集成生成清洗（EGC-FL），用于有效防御深度神经网络。提出的EGC-FL方法基于两个中心思想。首先，我们将变换后的死区层引入防御网络，该层由正交变换和基于死区的激活函数组成，以破坏对抗性攻击的复杂噪声模式。其次，通过构建带有反馈回路的生成式清洁网络，我们能够生成对原始清洁图像的各种估计的集合。然后，我们学习一个网络，将这组多样化的估计融合在一起，以恢复原始图像。我们广泛的实验结果表明，我们的方法在白盒和黑盒攻击中都大大改进了现有技术。在SVHN数据集上，针对次优方法的白盒PGD攻击的分类准确性显着提高了29％以上，在具有挑战性的CIFAR-10数据集上，提高了39％以上。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|578-587|共10页
会议地点
作者
Jianhe Yuan; Zhihai He;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cleaning; Feedback loop; Transforms; Neural networks; Estimation; Fuses; Iterative methods;

机译：清洗;反馈回路;变换;神经网络;估计;保险丝;迭代方法;

相似文献

外文文献
中文文献
专利

1. Fighting fire with fire: A spatial-frequency ensemble relation network with generative adversarial learning for adversarial image classification [J] . Wenbo Zheng, LanYan, Chao Gou, International Journal of Intelligent Systems . 2021,第5期

机译：用火力战斗：一种空间频合体关系网络，具有对抗对抗图像分类的生成对抗性学习
2. attackGAN: Adversarial Attack against Black-box IDS using Generative Adversarial Networks [J] . Shuang Zhao, Jing Li, Jianmin Wang, Procedia Computer Science . 2021,第a期

机译：通过生成的对抗网络，进攻：对黑匣子ID的对抗攻击
3. Adversarial Examples Detection for XSS Attacks Based on Generative Adversarial Networks [J] . Zhang Xueqin, Zhou Yue, Pei Songwen, Quality Control, Transactions . 2020,第期

机译：基于生成对抗网络的XSS攻击对抗对抗示例检测
4. DeepIris: An ensemble approach to defending Iris recognition classifiers against Adversarial Attacks [C] . Tamizhiniyan S R, Aman Ojha, Meenakshi K, International Conference on Computer Communication and Informatics . 2021

机译：Deepiris：一种捍卫虹膜识别分类器的集合方法，对抗对抗攻击
5. Robust Filtering Schemes for Machine Learning Systems to Defend Adversarial Attack [D] . Gupta, Kishor Datta. 2021

机译：用于捍卫对抗攻击的机器学习系统的强大过滤方案
6. Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network [O] . Dat Tien Nguyen, Tuyen Danh Pham, Ganbayar Batchuluun, 2020

机译：基于深度生成对抗网络的表示攻击面图像生成
7. Adversarial Examples Detection for XSS Attacks Based on Generative Adversarial Networks [O] . Xueqin Zhang, Yue Zhou, Songwen Pei, 2020

机译：基于生成对抗网络的XSS攻击对抗对抗示例检测

Ensemble Generative Cleaning With Feedback Loops for Defending Adversarial Attacks

摘要

著录项

相似文献

相关主题

期刊订阅