Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems

Xie Yi; Li Zhuohang; Shi Cong; Liu Jian; Chen Yingying; Yuan Bo

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems

【24h】

Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems

机译：对扬声器识别系统的实时，鲁棒和适应性的普遍对抗攻击

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Voice user interface (VUI) has become increasingly popular in recent years. Speaker recognition system, as one of the most common VUIs, has emerged as an important technique to facilitate security-required applications and services. In this paper, we propose to design, for the first time, a real-time, robust, and adaptive universal adversarial attack against the state-of-the-art deep neural network (DNN) based speaker recognition systems in the white-box scenario. By developing an audio-agnostic universal perturbation, we can make the DNN-based speaker recognition systems to misidentify the speaker as the adversary-desired target label, with using a single perturbation that can be applied on arbitrary enrolled speaker's voice. In addition, we improve the robustness of our attack by modeling the sound distortions caused by the physical over-the-air propagation through estimating room impulse response (RIR). Moreover, we propose to adaptively adjust the magnitude of perturbations according to each individual utterance via spectral gating. This can further improve the imperceptibility of the adversarial perturbations with minor increase of attack generation time. Experiments on a public dataset of 109 English speakers demonstrate the effectiveness and robustness of the proposed attack. Our attack method achieves average 90% attack success rate on both X-vector and d-vector speaker recognition systems. Meanwhile, our method achieves 100 x speedup on attack launching time, as compared to the conventional non-universal attacks.

机译：近年来，语音用户界面（VUI）变得越来越受欢迎。扬声器识别系统作为最常见的VUI之一，它成为促进安全所需的应用和服务的重要技术。在本文中，我们首次提出设计，对白盒子中的最先进的深神经网络（DNN）的扬声器识别系统进行实时，强大，适应性的通用对抗攻击设想。通过开发音频不可知的通用扰动，我们可以使基于DNN的扬声器识别系统将扬声器定制为逆境所需的目标标签，并使用可以在任意注册的扬声器的语音上应用的单一扰动。此外，我们通过通过估计房间脉冲响应（RIR）来提高由物理过空气传播引起的声音扭曲来改善我们攻击的稳健性。此外，我们建议根据每个单独的话语通过光谱栅极自适应地调整扰动的大小。这可以进一步提高对攻击生成时间的轻微增加的对抗扰动的难以察觉。 109英语扬声器公共数据集的实验证明了拟议攻击的有效性和稳健性。我们的攻击方法在X载体和D载向量扬声器识别系统上实现了90％的攻击成功率。同时，与传统的非普遍攻击相比，我们的方法在攻击发射时间上实现了100倍的加速。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2021年第10期|1187-1200|共14页
作者
Xie Yi; Li Zhuohang; Shi Cong; Liu Jian; Chen Yingying; Yuan Bo;
展开▼
作者单位

Rutgers State Univ Dept Elect & Comp Engn New Brunswick NJ USA;

Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN USA;

Rutgers State Univ Dept Elect & Comp Engn New Brunswick NJ USA;

Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN USA;

Rutgers State Univ New Brunswick NJ 08901 USA;

Rutgers State Univ New Brunswick NJ 08901 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speaker recognition systems; Adversarial examples; Universal adversarial attack;

机译：扬声器识别系统;对抗例;普遍对抗攻击;

相似文献

外文文献
中文文献
专利

1. Adversarial attack and defense strategies for deep speaker recognition systems [J] . Arindam Jati, Chin-Cheng Hsu, Monisankha Pal, Computer speech and language . 2021,第Jula期

机译：深层扬声器识别系统的对抗攻击与防御策略
2. Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM [J] . Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa Speech Communication . 2007,第6期

机译：通过结合特定于说话人的GMM和适用于说话人的HMM，基于位置相关的CMN进行鲁棒的远方说话人识别
3. Noise robustness of speaker adapted recognition system [J] . Mikio Mori, Shuji Taniguchi, Hideji Sakamoto, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第240期

机译：说话人自适应识别系统的噪声鲁棒性
4. Real-Time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems [C] . Yi Xie, Cong Shi, Zhuohang Li, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：对说话人识别系统的实时，普遍和强大的对抗攻击
5. CAG: a Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator [D] . Phan, Van Nhat Huy. 2020

机译：CAG：实时低成本增强稳健性高转移性内容感知对抗发生器
6. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [O] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, 2021

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
7. Adversarial attack and defense strategies for deep speaker recognition systems [O] . Arindam Jati, Chin-Cheng Hsu, Monisankha Pal, 2021

机译：深层扬声器识别系统的对抗攻击与防御策略

Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅