【24h】

Additive attacks on speaker recognition

机译:对说话人识别的加性攻击

获取原文

摘要

Speaker recognition is used to identify a speaker's voice from among a group of known speakers. A common method of speaker recognition is a classification based on cepstral coefficients of the speaker's voice, using a Gaussian mixture model (GMM) to model each speaker. In this paper we try to fool a speaker recognition system using additive noise such that an intruder is recognized as a target user. Our attack uses a mixture selected from a target user's GMM model, inverting the cepstral transformation to produce noise samples. In our 5 speaker data base, we achieve an attack success rate of 50% with a noise signal at 10dB SNR, and 95% by increasing noise power to 0dB SNR. The importance of this attack is its simplicity and flexibility: it can be employed in real time with no processing of an attacker's voice, and little computation is needed at the moment of detection, allowing the attack to be performed by a small portable device. For any target user, knowing that user's model or voice sample is sufficient to compute the attack signal, and it is enough that the intruder plays it while he/she is uttering to be classified as the victim.
机译:说话者识别用于从一组已知说话者中识别说话者的声音。说话人识别的一种常见方法是使用高斯混合模型(GMM)对每个说话人建模,基于说话人语音的倒谱系数进行分类。在本文中,我们尝试使用附加噪声来欺骗说话者识别系统,从而将入侵者识别为目标用户。我们的攻击使用了从目标用户的GMM模型中选择的混合,反转倒频谱变换以生成噪声样本。在我们的5个扬声器数据库中,使用10dB SNR的噪声信号,我们可以达到50%的攻击成功率,而如果将噪声功率提高到0dB SNR,则可以达到95%的攻击成功率。这种攻击的重要性在于其简单性和灵活性:它可以实时使用而无需处理攻击者的声音,并且在检测时几乎不需要任何计算,从而可以由小型便携式设备执行攻击。对于任何目标用户,只要知道用户的模型或语音样本就足以计算攻击信号,并且入侵者在说出自己是受害者的时候就播放了攻击信号就足够了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号