Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

Md. Sahidullah; Dennis Alexander Lehmann Thomsen; Rosa Gonzalez Hautamäki; Tomi Kinnunen; Zheng-Hua Tan; Robert Parts; Martti Pitkänen

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

【24h】

Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

机译：使用喉咙麦克风进行可靠的语音活动检测和说话人验证

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

While having a wide range of applications, automatic speaker verification (ASV) systems are vulnerable to spoofing attacks, in particular, replay attacks that are effective and easy to implement. Most prior work on detecting replay attacks uses audio from a single acoustic microphone only, leading to difficulties in detecting high-end replay attacks close to indistinguishable from live human speech. In this paper, we study the use of a special body-conducted sensor, throat microphone (TM), for combined voice liveness detection (VLD) and ASV in order to improve both robustness and security of ASV against replay attacks. We first investigate the possibility and methods of attacking a TM-based ASV system, followed by a pilot data collection. Second, we study the use of spectral features for VLD using both single-channel and dual-channel ASV systems. We carry out speaker verification experiments using Gaussian mixture model with universal background model (GMM-UBM) and i-vector based systems on a dataset of 38 speakers collected by us. We have achieved considerable improvement in recognition accuracy, with the use of dual-microphone setup. In experiments with noisy test speech, the false acceptance rate (FAR) of the dual-microphone GMM-UBM based system for recorded speech reduces from 69.69% to 18.75%. The FAR of replay condition further drops to 0% when this dual-channel ASV system is integrated with the new dual-channel voice liveness detector.

机译：自动扬声器验证（ASV）系统具有广泛的应用范围，但容易受到欺骗攻击，尤其是有效且易于实施的重播攻击。在检测重播攻击方面，大多数先前的工作仅使用来自单个声学麦克风的音频，从而导致难以检测到与现场人类语音几乎无法区分的高端重播攻击。在本文中，我们研究了一种特殊的人体传导传感器，即嗓音麦克风（TM），用于组合语音活动检测（VLD）和ASV，以提高ASV抵抗重放攻击的鲁棒性和安全性。我们首先研究攻击基于TM的ASV系统的可能性和方法，然后进行试验数据收集。其次，我们研究了使用单通道和双通道ASV系统的VLD频谱特征的使用。我们使用高斯混合模型与通用背景模型（GMM-UBM）以及基于i-vector的系统，对我们收集的38位说话者进行了说话人验证实验。通过使用双麦克风设置，我们在识别精度上取得了显着提高。在带有嘈杂测试语音的实验中，基于双麦克风GMM-UBM的用于录制语音的系统的错误接受率（FAR）从69.69％降低到18.75％。当此双通道ASV系统与新的双通道语音活动检测器集成在一起时，重播条件的FAR进一步降至0％。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2018年第1期|44-56|共13页
作者
Md. Sahidullah; Dennis Alexander Lehmann Thomsen; Rosa Gonzalez Hautamäki; Tomi Kinnunen; Zheng-Hua Tan; Robert Parts; Martti Pitkänen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Microphones; Speech; Acoustics; Robustness; Speech processing; Security;

机译：麦克风;语音;声学;健壮性;语音处理;安全性;

相似文献

外文文献
中文文献
专利

1. Robust face-voice based speaker identity verification using multilevel fusion [J] . Girija Chetty, Michael Wagner Image and Vision Computing . 2008,第9期

机译：使用多级融合的基于面部声音的健壮说话人身份验证
2. Speaker Verification Under Degraded Conditions Using Empirical Mode Decomposition Based Voice Activity Detection Algorithm [J] . M. S. Rudramurthy, V. Kamakshi Prasad, R. Kumaraswamy Journal of Intelligent Systems . 2014,第4期

机译：基于经验模式分解的语音活动检测算法在退化条件下的说话人验证
3. Early reflection detection using autocorrelation to improve robustness of speaker verification in reverberant conditions [J] . Khamis A. Al-Karawi, Duraid Y. Mohammed International journal of speech technology . 2019,第4期

机译：使用自相关的早期反射检测可提高混响条件下说话者验证的鲁棒性
4. The assessment of efficiency of the automatic speaker recognition system for voices registered using a throat microphone [C] . K. Kaminski, A. P. Dobrowolski, R. Taton Conference on Reconnaissance and Electronic Warfare Systems . 2019

机译：使用喉部麦克风注册的语音自动扬声器识别系统效率评估
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Dual Microphone Voice Activity Detection Based on Reliable Spatial Cues [O] . Soojoong Hwang, Yu Gwang Jin, Jong Won Shin 2019

机译：基于可靠空间线索的双麦克风语音活动检测
7. Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones [O] . Md. Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa Gonzalez Hautamaki, 2018

机译：使用喉部麦克风的强大语音激活检测和扬声器验证
8. Throat Microphone (Electro-Voice p/N 678-4881) for Helicopter Door Gunners/Crew Chiefs (Aca-23/67). [R] . 1966

机译：用于直升机门炮手/船员酋长的喉咙麦克风（Electro-Voice p / N 678-4881）（aca-23/67）。

Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅