Model-Agnostic Adversarial Example Detection Through Logit Distribution Learning

机译：通过Logit分布学习的模型 - 无症侵犯示例示例检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent research on vision-based tasks has achieved great improvement due to the development of deep learning solutions. However, deep models have been found vulnerable to adversarial attacks where the original inputs are maliciously manipulated and cause dramatic shifts to the outputs. In this paper, we focus on adversarial attacks in image classifiers built with deep neural networks and propose a model-agnostic approach to detect adversarial inputs. We argue that the logit semantics of adversarial inputs follow a different evolution with respect to original inputs, and construct a logits-based embedding of features for effective representation learning. We train an LSTM network to further analyze the sequence of logits-based features to detect adversarial examples. Experimental results on the MNIST, CFAR-10, and CFAR-100 datasets show that our method achieves state-of-the-art accuracy for detecting adversarial examples and has strong generalizability.

机译：由于深度学习解决方案的发展，最近基于视觉的任务的研究取得了很大的改善。然而，已经发现深层模型容易受到对抗的侵袭，其中原始投入的恶意操纵并导致输出剧烈移位。在本文中，我们专注于使用深神经网络构建的图像分类器中的对抗性攻击，并提出了一种检测对抗性投入的模型 - 不可知方法。我们认为对抗性输入的Logit语义遵循不同的进化以及原始输入，并构建基于Logits的特征嵌入，以获得有效的表示学习。我们训练LSTM网络以进一步分析基于基于逻辑的特征序列以检测对抗性示例。 MNIST，CFAR-10和CFAR-100数据集上的实验结果表明，我们的方法实现了最先进的准确性，用于检测对抗性实例并具有强的相互性。

著录项

来源
《IEEE International Conference on Image Processing》|2021年|3617-3621|共5页
会议地点
作者
Yaopeng Wang; Lehui Xie; Ximeng Liu; Jia-Li Yin; Tingjie Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Resistance; Conferences; Semantics; Feature extraction; Task analysis;

机译：深入学习;抵抗;会议;语义;特征提取;任务分析;

相似文献

外文文献
中文文献
专利

1. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [J] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, Data in Brief . 2021,第3期

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
2. Dual-stream generative adversarial networks for distributionally robust zero-shot learning [J] . Information Sciences: An International Journal . 2020,第期

机译：双流生成的对抗网络，用于分布鲁棒零射击学习
3. Semisupervised learning with adversarial training among joint distributions [J] . Zhou Linyong, Liu Zhijie, Tan Hongwei, Journal of electronic imaging . 2019,第5期

机译：联合分布之间的对抗训练的半监督学习
4. Model-Agnostic Boundary-Adversarial Sampling for Test-Time Generalization in Few-Shot Learning [C] . Jaekyeom Kim, Hyoungseok Kim, Gunhee Kim European Conference on Computer Vision . 2020

机译：几次射击学习中试验时间概括的模型 - 不可知的边界 - 对手采样
5. On Concept Drift, Deployability, and Adversarial Selection in Machine Learning-Based Malware Detection. [D] . Singh, Anshuman. 2012

机译：基于机器学习的恶意软件检测中的概念漂移，可部署性和对抗选择。
6. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [O] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, 2021

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
7. Combining Monte Carlo with Deep Learning: Predicting High-resolution, Low-noise Dose Distributions Using a Generative Adversarial Network for Fast and Precise Monte Carlo Simulations [O] . V. Vasudevan, C. Huang, E. Simiele, 2020

机译：将蒙特卡洛与深层学习结合起来：预测使用生成的对冲网络进行高分辨率，低噪音剂量分布，以快速和精确的蒙特卡罗模拟

Model-Agnostic Adversarial Example Detection Through Logit Distribution Learning

摘要

著录项

相似文献

相关主题

期刊订阅