【24h】

Simtrojan: Stealthy Backdoor Attack

机译:Simtrojan:隐身后门攻击

获取原文

摘要

Recent researches indicate deep learning models are vulnerable to adversarial attacks. Backdoor attack, also called trojan attack, is a variant of adversarial attacks. An malicious attacker can inject backdoor to models in training phase. As a result, the backdoor model performs normally on clean samples and can be triggered by a backdoor pattern to recognize backdoor samples as a wrong target label specified by the attacker. However, the vanilla backdoor attack method causes a measurable difference between clean and backdoor samples in latent space. Several state-of-the-art defense methods utilize this to identify backdoor samples. In this paper, we propose a novel backdoor attack method called SimTrojan, which aims to inject backdoor in models stealthily. Specifically, SimTrojan makes clean and backdoor samples have indistinguishable representations in latent space to evade current defense methods. Experiments demonstrate that SimTrojan achieves a high attack success rate and is undetectable by state-of-the-art defense methods. The study suggests the urgency of building more effective defense methods.
机译:最近的研究表明深入学习模型容易受到对抗的攻击。后门攻击,也称为木马攻击,是对抗攻击的变种。恶意攻击者可以在训练阶段注入后门。结果,后门模型通常在清洁样本上执行,并且可以通过后门图案触发,以将后门样本识别为攻击者指定的错误目标标签。然而,香草后卫攻击方法导致潜伏空间中的清洁和后门样品之间的可测量差异。几种最先进的防御方法利用它来识别后门样品。在本文中,我们提出了一种新的后门攻击方法,称为Simtrojan,旨在悄悄地注入模型中的后门。具体而言,Simtrojan使得干净和后门样本在潜在空间中具有无法区分的表示,以逃避当前的防御方法。实验表明,Simtrojan实现了高攻击成功率,并且最先进的防御方法无法察觉。该研究表明建立更有效的防御方法的紧迫性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号