【24h】

Interpretation Area-Guided Detection of Adversarial Samples

机译:解释面积引导对抗性样本的检测

获取原文

摘要

Deep learning systems are known to be vulnerable to adversarial samples, which are implemented to change the prediction results by adding small perturbations to benign samples. It is significant to defend against an adversarial attack in critical fields such as automatic drive. In this paper, we propose an interpretation area-guided detection method of adversarial samples, which can improve the performance of the typical feature squeezing method by combining the generated interpretation results. Specifically, we divide the input image into two main parts, the interpretation part, and the non-interpretation part. Then we only squeeze the non-interpretation part, which can reduce the side-effect for benign samples. We evaluate our approach on two widely used datasets, and the results demonstrate that our approach outperforms the original feature squeezing method.
机译:已知深度学习系统容易受到对抗的样本,这被实施以通过向良性样本增加小扰动来改变预测结果。在诸如自动驱动等临界领域的对抗性攻击是重要的。在本文中,我们提出了一种口语区域引导检测方法,其通过组合产生的解释结果来提高典型特征挤压方法的性能。具体地,我们将输入图像分为两个主要部分,解释部分和非解释部分。然后我们只挤压非解释部分,可以减少良性样本的副作用。我们在两个广泛使用的数据集中评估我们的方法,结果表明我们的方法优于原始特征挤压方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号