Additive attacks on speaker recognition

机译：对说话人识别的加性攻击

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker recognition is used to identify a speaker's voice from among a group of known speakers. A common method of speaker recognition is a classification based on cepstral coefficients of the speaker's voice, using a Gaussian mixture model (GMM) to model each speaker. In this paper we try to fool a speaker recognition system using additive noise such that an intruder is recognized as a target user. Our attack uses a mixture selected from a target user's GMM model, inverting the cepstral transformation to produce noise samples. In our 5 speaker data base, we achieve an attack success rate of 50% with a noise signal at 10dB SNR, and 95% by increasing noise power to 0dB SNR. The importance of this attack is its simplicity and flexibility: it can be employed in real time with no processing of an attacker's voice, and little computation is needed at the moment of detection, allowing the attack to be performed by a small portable device. For any target user, knowing that user's model or voice sample is sufficient to compute the attack signal, and it is enough that the intruder plays it while he/she is uttering to be classified as the victim.

机译：说话者识别用于从一组已知说话者中识别说话者的声音。说话人识别的一种常见方法是使用高斯混合模型（GMM）对每个说话人建模，基于说话人语音的倒谱系数进行分类。在本文中，我们尝试使用附加噪声来欺骗说话者识别系统，从而将入侵者识别为目标用户。我们的攻击使用了从目标用户的GMM模型中选择的混合，反转倒频谱变换以生成噪声样本。在我们的5个扬声器数据库中，使用10dB SNR的噪声信号，我们可以达到50％的攻击成功率，而如果将噪声功率提高到0dB SNR，则可以达到95％的攻击成功率。这种攻击的重要性在于其简单性和灵活性：它可以实时使用而无需处理攻击者的声音，并且在检测时几乎不需要任何计算，从而可以由小型便携式设备执行攻击。对于任何目标用户，只要知道用户的模型或语音样本就足以计算攻击信号，并且入侵者在说出自己是受害者的时候就播放了攻击信号就足够了。

著录项

来源
《Conference on media watermarking, security, and forensics》|2014年|90280Q.1-90280Q.13|共13页
会议地点
作者
Alireza Farrokh Baroughi; Scott Craver;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker recognition; additive attack; MFCC; GMM;

机译：说话人识别;加性攻击; MFCC; GMM;

相似文献

外文文献
中文文献
专利

1. Real-time, Robust and Adaptive Universal Adversarial Attacks Against Speaker Recognition Systems [J] . Xie Yi, Li Zhuohang, Shi Cong, Journal of signal processing systems for signal, image, and video technology . 2021,第10期

机译：对扬声器识别系统的实时，鲁棒和适应性的普遍对抗攻击
2. Adversarial attack and defense strategies for deep speaker recognition systems [J] . Arindam Jati, Chin-Cheng Hsu, Monisankha Pal, Computer speech and language . 2021,第Jula期

机译：深层扬声器识别系统的对抗攻击与防御策略
3. A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition [J] . Gong Y. IEEE Transactions on Speech and Audio Proceessing . 2005,第5期

机译：一种独立于说话人的语音识别的加法和卷积失真联合补偿方法
4. Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection [C] . Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：我们可以使用说话人识别技术进行自我攻击吗？使用自动目标说话人选择增强模仿攻击
5. Attacks on Biometric Systems for Speaker and Face Recognition. [D] . Farrokh Baroughi, Alireza. 2016

机译：针对说话人和面部识别的生物识别系统的攻击。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. Adversarial attack and defense strategies for deep speaker recognition systems [O] . Arindam Jati, Chin-Cheng Hsu, Monisankha Pal, 2021

机译：深层扬声器识别系统的对抗攻击与防御策略
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Additive attacks on speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅