首页> 外文会议>International Conference on Computer Vision >CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising
【24h】

CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising

机译:CIIDefence:通过融合特定于类别的图像修复和图像去噪来对抗对抗性攻击

获取原文

摘要

This paper presents a novel approach for protecting deep neural networks from adversarial attacks, i.e., methods that add well-crafted imperceptible modifications to the original inputs such that they are incorrectly classified with high confidence. The proposed defence mechanism is inspired by the recent works mitigating the adversarial disturbances by the means of image reconstruction and denoising. However, unlike the previous works, we apply the reconstruction only for small and carefully selected image areas that are most influential to the current classification outcome. The selection process is guided by the class activation map responses obtained for multiple top-ranking class labels. The same regions are also the most prominent for the adversarial perturbations and hence most important to purify. The resulting inpainting task is substantially more tractable than the full image reconstruction, while still being able to prevent the adversarial attacks. Furthermore, we combine the selective image inpainting with wavelet based image denoising to produce a non differentiable layer that prevents attacker from using gradient backpropagation. Moreover, the proposed nonlinearity cannot be easily approximated with simple differentiable alternative as demonstrated in the experiments with Backward Pass Differentiable Approximation (BPDA) attack. Finally, we experimentally show that the proposed Class-specific Image Inpainting Defence (CIIDefence) is able to withstand several powerful adversarial attacks including the BPDA. The obtained results are consistently better compared to the other recent defence approaches.
机译:本文提出了一种保护深度神经网络免受对抗攻击的新颖方法,即为原始输入添加精心设计的,不可感知的修改的方法,从而使它们以高置信度被错误地分类。提出的防御机制受到最近通过图像重建和降噪技术减轻对抗性干扰的工作的启发。但是,与以前的作品不同,我们仅将重建应用于对当前分类结果影响最大的细小且精心选择的图像区域。选择过程以为多个顶级类别标签获得的类别激活图响应为指导。相同区域也是对抗性扰动最突出的区域,因此最重要的是进行净化。最终的修复任务比完整的图像重建更容易处理,同时仍然能够防止对抗性攻击。此外,我们将选择性图像修复与基于小波的图像去噪相结合,以产生不可区分的图层,从而防止攻击者使用梯度反向传播。此外,所提出的非线性无法通过简单的可微选择轻松地近似,如反向通过可微逼近(BPDA)攻击的实验所证明的那样。最后,我们通过实验证明所提出的特定于类别的图像修复防御(CIIDefence)能够抵御包括BPDA在内的几种强大的对抗攻击。与其他最近的防御方法相比,获得的结果始终更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号