首页> 外文会议>IEEE Security and Privacy Workshops >Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

【24h】

Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

机译：黑盒生成的对抗性文本序列可逃避深度学习分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although various techniques have been proposed to generate adversarial samples for white-box attacks on text, little attention has been paid to a black-box attack, which is a more realistic scenario. In this paper, we present a novel algorithm, DeepWordBug, to effectively generate small text perturbations in a black-box setting that forces a deep-learning classifier to misclassify a text input. We develop novel scoring strategies to find the most important words to modify such that the deep classifier makes a wrong prediction. Simple character-level transformations are applied to the highest-ranked words in order to minimize the edit distance of the perturbation. We evaluated DeepWordBug on two real-world text datasets: Enron spam emails and IMDB movie reviews. Our experimental results indicate that DeepWordBug can reduce the classification accuracy from 99% to 40% on Enron and from 87% to 26% on IMDB. Our results strongly demonstrate that the generated adversarial sequences from a deep-learning model can similarly evade other deep models.

机译：尽管已提出了多种技术来生成针对文本的白盒攻击的对抗性样本，但很少有人关注黑盒攻击，这是一种更为现实的情况。在本文中，我们提出了一种新颖的算法DeepWordBug，该算法可在黑盒设置中有效生成小文本扰动，从而迫使深度学习分类器对文本输入进行误分类。我们开发了新颖的评分策略，以找到最重要的单词进行修改，从而使深度分类器做出错误的预测。将简单的字符级转换应用于排名最高的单词，以最小化扰动的编辑距离。我们在两个真实的文本数据集上评估了DeepWordBug：Enron垃圾邮件和IMDB电影评论。我们的实验结果表明，DeepWordBug在Enron上可以将分类准确度从99％降低到40％，在IMDB上可以将分类准确度从87％降低到26％。我们的结果有力地证明，从深度学习模型生成的对抗序列可以类似地规避其他深度模型。

著录项

来源
《IEEE Security and Privacy Workshops》|2018年|50-56|共7页
会议地点
作者
Ji Gao; Jack Lanchantin; Mary Lou Soffa; Yanjun Qi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Perturbation methods; Machine learning; Task analysis; Recurrent neural networks; Prediction algorithms; Sentiment analysis;

机译：摄动方法;机器学习;任务分析;递归神经网络;预测算法;情感分析;

相似文献

外文文献
中文文献
专利

1. Ensemble adversarial black-box attacks against deep learning systems [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：对抗深度学习系统的集合对抗性黑匣子攻击
2. Perceptual quality-preserving black-box attack against deep learning image classifiers [J] . Gragnaniello Diego, Marra Francesco, Verdoliva Luisa, Pattern recognition letters . 2021,第Jula期

机译：感知质量保存的黑匣子攻击对抗深层学习图像分类器
3. A Generative Model Based Adversarial Security of Deep Learning and Linear Classifier Models [J] . Samed Sivaslioglu, Ferhat Ozgur Catak, Kevser ?ahinba? Informatica: An International Journal of Computing and Informatics . 2021,第1期

机译：基于生成模型的深度学习和线性分类器模型的敌对安全性
4. Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers [C] . Ji Gao, Jack Lanchantin, Mary Lou Soffa, IEEE Security and Privacy Workshops . 2018

机译：黑箱生成对抗的文本序列，以逃避深度学习分类器
5. Empirical studies of default hierarchies and sequences of rules in learning classifier systems. [D] . Riolo, Rick L. 1988

机译：对学习分类器系统中默认层次结构和规则序列的实证研究。
6. Impact of De-Identification on Clinical Text Classification UsingTraditional and Deep Learning Classifiers [O] . Jihad S. Obeid, Paul M. Heider, Erin R. Weeda, -1

机译：取消识别对临床文本分类的影响传统和深度学习分类器
7. Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers [O] . Gao, Ji, Lanchantin, Jack, Soffa, Mary Lou, 2018

机译：黑盒生成的对抗性文本序列躲避深层次学习量词

Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

摘要

著录项

相似文献

相关主题

期刊订阅