首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems
【24h】

Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems

机译:对扬声器识别系统的实时,鲁棒和适应性的普遍对抗攻击

获取原文
获取原文并翻译 | 示例
           

摘要

Voice user interface (VUI) has become increasingly popular in recent years. Speaker recognition system, as one of the most common VUIs, has emerged as an important technique to facilitate security-required applications and services. In this paper, we propose to design, for the first time, a real-time, robust, and adaptive universal adversarial attack against the state-of-the-art deep neural network (DNN) based speaker recognition systems in the white-box scenario. By developing an audio-agnostic universal perturbation, we can make the DNN-based speaker recognition systems to misidentify the speaker as the adversary-desired target label, with using a single perturbation that can be applied on arbitrary enrolled speaker's voice. In addition, we improve the robustness of our attack by modeling the sound distortions caused by the physical over-the-air propagation through estimating room impulse response (RIR). Moreover, we propose to adaptively adjust the magnitude of perturbations according to each individual utterance via spectral gating. This can further improve the imperceptibility of the adversarial perturbations with minor increase of attack generation time. Experiments on a public dataset of 109 English speakers demonstrate the effectiveness and robustness of the proposed attack. Our attack method achieves average 90% attack success rate on both X-vector and d-vector speaker recognition systems. Meanwhile, our method achieves 100 x speedup on attack launching time, as compared to the conventional non-universal attacks.
机译:近年来,语音用户界面(VUI)变得越来越受欢迎。扬声器识别系统作为最常见的VUI之一,它成为促进安全所需的应用和服务的重要技术。在本文中,我们首次提出设计,对白盒子中的最先进的深神经网络(DNN)的扬声器识别系统进行实时,强大,适应性的通用对抗攻击设想。通过开发音频不可知的通用扰动,我们可以使基于DNN的扬声器识别系统将扬声器定制为逆境所需的目标标签,并使用可以在任意注册的扬声器的语音上应用的单一扰动。此外,我们通过通过估计房间脉冲响应(RIR)来提高由物理过空气传播引起的声音扭曲来改善我们攻击的稳健性。此外,我们建议根据每个单独的话语通过光谱栅极自适应地调整扰动的大小。这可以进一步提高对攻击生成时间的轻微增加的对抗扰动的难以察觉。 109英语扬声器公共数据集的实验证明了拟议攻击的有效性和稳健性。我们的攻击方法在X载体和D载向量扬声器识别系统上实现了90%的攻击成功率。同时,与传统的非普遍攻击相比,我们的方法在攻击发射时间上实现了100倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号