Learning to Attack: Adversarial Transformation Networks

机译：学习攻击：对抗转型网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the rapidly increasing popularity of deep neural networks for image recognition tasks, a parallel interest in generating adversarial examples to attack the trained models has arisen. To date, these approaches have involved either directly computing gradients with respect to the image pixels or directly solving an optimization on the image pixels. We generalize this pursuit in a novel direction: can a separate network be trained to efficiently attack another fully trained network? We demonstrate that it is possible, and that the generated attacks yield startling insights into the weaknesses of the target network. We call such a network an Adversarial Transformation Network (ATN). ATNs transform any input into an adversarial attack on the target network, while being minimally perturbing to the original inputs and the target network's outputs. Further, we show that ATNs are capable of not only causing the target network to make an error, but can be constructed to explicitly control the type of misclassification made. We demonstrate ATNs on both simple MNIST-digit classifiers and state-of-the-art ImageNet classifiers deployed by Google, Inc.: Inception ResNet-v2.

机译：随着对图像识别任务的深度神经网络的普及迅速增加，已经出现了产生对抗攻击训练模型的对抗性示例的平行兴趣。迄今为止，这些方法已经涉及直接计算梯度相对于图像像素或直接解决图像像素上的优化。我们以小说方向推广这一追求：可以培训单独的网络，以有效地攻击另一个完全训练的网络吗？我们证明它是可能的，并且产生的攻击可能会对目标网络的弱点产生惊人的洞察力。我们称这种网络是对抗性转换网络（ATN）。 ATN将任何输入转换为对目标网络的对策攻击，同时对原始输入和目标网络的输出最小地扰乱。此外，我们表明ATNS不仅能够导致目标网络出错，而且可以构建以明确控制所做的错误分类类型。我们在简单的Mnist-digit分类器和由Google，Inc.: inception resnet-v2部署的简单Mnist-digit分类器和最先进的ImageNet分类器上的ATYS。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|2668-3603p|共9页
会议地点
作者
Shumeet Baluja; Ian Fischer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Hardening machine learning denial of service (DoS) defences against adversarial attacks in IoT smart home networks [J] . Eirini Anthi, Lowri Williams, Amir Javed, Computers & Security . 2021,第Sepa期

机译：硬化机器学习拒绝服务（DOS）防御IOT智能家庭网络的对抗攻击
2. Learning one-to-many stylised Chinese character transformation and generation by generative adversarial networks [J] . Image Processing, IET . 2019,第14期

机译：通过生成的对抗网络学习一对多的程式化汉字转换和生成
3. Adversarial Learning Targeting Deep Neural Network Classification: A Comprehensive Review of Defenses Against Attacks [J] . Proceedings of the IEEE . 2020,第3期

机译：针对深度神经网络分类的对抗学习：针对攻击的防御方法的全面综述
4. Learning to Attack: Adversarial Transformation Networks [C] . Shumeet Baluja, Ian Fischer AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：学习攻击：对抗转型网络
5. Adversarial Machine Learning in Computer Vision: Attacks and Defenses on Machine Learning Models [D] . Qin, Yi. 2021

机译：计算机视觉上的对抗机器学习：机器学习模型的攻击和防御
6. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [O] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, 2021

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
7. Non-Intrusive Detection of Adversarial Deep Learning Attacks via Observer Networks [O] . Kirthi Shankar Sivamani, Rajeev Sahay, Aly El Gamal 2020

机译：通过观察者网络的对抗性深度学习攻击的非侵入性检测

Learning to Attack: Adversarial Transformation Networks

摘要

著录项

相似文献

相关主题

期刊订阅