首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Ensemble Generative Cleaning With Feedback Loops for Defending Adversarial Attacks
【24h】

Ensemble Generative Cleaning With Feedback Loops for Defending Adversarial Attacks

机译:集成带有反馈回路的生成式清洗,以防御对抗性攻击

获取原文

摘要

Effective defense of deep neural networks against adversarial attacks remains a challenging problem, especially under powerful white-box attacks. In this paper, we develop a new method called ensemble generative cleaning with feedback loops (EGC-FL) for effective defense of deep neural networks. The proposed EGC-FL method is based on two central ideas. First, we introduce a transformed deadzone layer into the defense network, which consists of an orthonormal transform and a deadzone-based activation function, to destroy the sophisticated noise pattern of adversarial attacks. Second, by constructing a generative cleaning network with a feedback loop, we are able to generate an ensemble of diverse estimations of the original clean image. We then learn a network to fuse this set of diverse estimations together to restore the original image. Our extensive experimental results demonstrate that our approach improves the state-of-art by large margins in both white-box and black-box attacks. It significantly improves the classification accuracy for white-box PGD attacks upon the second best method by more than 29% on the SVHN dataset and more than 39% on the challenging CIFAR-10 dataset.
机译:有效防御神经网络对抗对抗攻击仍然是一个具有挑战性的问题,尤其是在强大的白盒攻击下。在本文中,我们开发了一种新的方法,该方法称为带反馈环路的集成生成清洗(EGC-FL),用于有效防御深度神经网络。提出的EGC-FL方法基于两个中心思想。首先,我们将变换后的死区层引入防御网络,该层由正交变换和基于死区的激活函数组成,以破坏对抗性攻击的复杂噪声模式。其次,通过构建带有反馈回路的生成式清洁网络,我们能够生成对原始清洁图像的各种估计的集合。然后,我们学习一个网络,将这组多样化的估计融合在一起,以恢复原始图像。我们广泛的实验结果表明,我们的方法在白盒和黑盒攻击中都大大改进了现有技术。在SVHN数据集上,针对次优方法的白盒PGD攻击的分类准确性显着提高了29%以上,在具有挑战性的CIFAR-10数据集上,提高了39%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号