Adversarial Attack Type I: Cheat Classifiers by Significant Changes

Tang Sanli; Huang Xiaolin; Chen Mingjian; Sun Chengjin; Yang Jie

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Adversarial Attack Type I: Cheat Classifiers by Significant Changes

【24h】

Adversarial Attack Type I: Cheat Classifiers by Significant Changes

机译：对抗攻击类型I：通过重大变化作弊分类器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Despite the great success of deep neural networks, the adversarial attack can cheat some well-trained classifiers by small permutations. In this paper, we propose another type of adversarial attack that can cheat classifiers by significant changes. For example, we can significantly change a face but well-trained neural networks still recognize the adversarial and the original example as the same person. Statistically, the existing adversarial attack increases Type II error and the proposed one aims at Type I error, which are hence named as Type II and Type I adversarial attack, respectively. The two types of attack are equally important but are essentially different, which are intuitively explained and numerically evaluated. To implement the proposed attack, a supervised variation autoencoder is designed and then the classifier is attacked by updating the latent variables using gradient information. Besides, with pre-trained generative models, Type I attack on latent spaces is investigated as well. Experimental results show that our method is practical and effective to generate Type I adversarial examples on large-scale image datasets. Most of these generated examples can pass detectors designed for defending Type II attack and the strengthening strategy is only efficient with a specific type attack, both implying that the underlying reasons for Type I and Type II attack are different.

机译：尽管深度神经网络的成功良好，但对抗性攻击可以通过小排放来欺骗一些训练有素的分类器。在本文中，我们提出了另一种类型的对抗攻击，可以通过重大变化来欺骗分类器。例如，我们可以显着改变面部，但训练有素的神经网络仍然识别对抗的和原始示例作为同一个人。统计上，现有的对手攻击增加了II型错误，并且所提出的一个目标是I误差，因此分别命名为II型和I型对抗攻击。两种类型的攻击同样重要，但基本上是不同的，这是直观地解释和数值评估的。为了实现建议的攻击，设计了一个监督变量的AutoEncoder，然后通过使用梯度信息更新潜伏变量来攻击分类器。此外，对于预先训练的生成模型，还研究了I型攻击潜伏的空间。实验结果表明，我们的方法是实用且有效的，在大规模图像数据集上生成I型对抗性示例。这些产生的示例中的大多数都可以通过专门用于防御II型攻击的探测器，并且强化策略仅效率特定类型的攻击，这意味着I型和II型攻击的潜在原因不同。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2021年第3期|1100-1109|共10页
作者
Tang Sanli; Huang Xiaolin; Chen Mingjian; Sun Chengjin; Yang Jie;
展开▼
作者单位

Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ Inst Med Robot Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ MOE Key Lab Syst Control & Informat Proc Shanghai 200240 Peoples R China;

Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ Inst Med Robot Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ MOE Key Lab Syst Control & Informat Proc Shanghai 200240 Peoples R China;

Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ Inst Med Robot Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ MOE Key Lab Syst Control & Informat Proc Shanghai 200240 Peoples R China;

Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ Inst Med Robot Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ MOE Key Lab Syst Control & Informat Proc Shanghai 200240 Peoples R China;

Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ Inst Med Robot Shanghai 200240 Peoples R China|Shanghai Jiao Tong Univ MOE Key Lab Syst Control & Informat Proc Shanghai 200240 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural networks; Training; Aerospace electronics; Toy manufacturing industry; Sun; Face recognition; Task analysis; Adversarial attack; type I error; supervised variational autoencoder;

机译：神经网络;训练;航空航天电子;玩具制造业;太阳;人脸识别;任务分析;对抗攻击;I型错误;监督变分AutoEncoder;

相似文献

外文文献
中文文献
专利

1. Adversarial attacks on a lexical sentiment analysis classifier [J] . de Oliveira Junior Gildasio Antonio, de Sousa Jr Rafael Timoteo, Albuquerque Robson de Oliveira, Computer Communications . 2021,第Juna期

机译：对词汇情报分析分类器的对抗攻击
2. Friend-safe evasion attack: An adversarial example that is correctly recognized by a friendly classifier [J] . Kwon Hyun, Kim Yongchul, Park Ki-Woong, Computers & Security . 2018,第sepa期

机译：朋友安全的回避攻击：一个友善的分类器正确识别的对抗示例
3. Data driven exploratory attacks on black box classifiers in adversarial domains [J] . Sethi Tegjyot Singh, Kantardzic Mehmed Neurocomputing . 2018,第MAY10期

机译：对抗领域中黑盒分类器的数据驱动探索性攻击
4. Code-Bridged Classifier (CBC): A Low or Negative Overhead Defense for Making a CNN Classifier Robust Against Adversarial Attacks [C] . Farnaz Behnia, Ali Mirzaeian, Mohammad Sabokrou, International Symposium on Quality Electronic Design . 2020

机译：代码桥接分类器（CBC）：一种低或负开销防御措施，用于使CNN分类器对付对抗攻击具有鲁棒性
5. Hardening Inline DGA Classifiers against Adversarial Attacks [D] . Sivaguru, Raaghavi . 2019

机译：硬化内联DGA分类器对抗对抗攻击
6. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [O] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, 2021

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
7. Code-Bridged Classifier (CBC): A Low or Negative Overhead Defense for Making a CNN Classifier Robust Against Adversarial Attacks [O] . Farnaz Behnia, Ali Mirzaeian, Mohammad Sabokrou, 2020

机译：代码桥接分类器（CBC）：用于制作CNN分类器的低或负面的支架防范对抗对抗攻击

Adversarial Attack Type I: Cheat Classifiers by Significant Changes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅