首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones
【24h】

Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

机译:使用喉咙麦克风进行可靠的语音活动检测和说话人验证

获取原文
获取原文并翻译 | 示例
           

摘要

While having a wide range of applications, automatic speaker verification (ASV) systems are vulnerable to spoofing attacks, in particular, replay attacks that are effective and easy to implement. Most prior work on detecting replay attacks uses audio from a single acoustic microphone only, leading to difficulties in detecting high-end replay attacks close to indistinguishable from live human speech. In this paper, we study the use of a special body-conducted sensor, throat microphone (TM), for combined voice liveness detection (VLD) and ASV in order to improve both robustness and security of ASV against replay attacks. We first investigate the possibility and methods of attacking a TM-based ASV system, followed by a pilot data collection. Second, we study the use of spectral features for VLD using both single-channel and dual-channel ASV systems. We carry out speaker verification experiments using Gaussian mixture model with universal background model (GMM-UBM) and i-vector based systems on a dataset of 38 speakers collected by us. We have achieved considerable improvement in recognition accuracy, with the use of dual-microphone setup. In experiments with noisy test speech, the false acceptance rate (FAR) of the dual-microphone GMM-UBM based system for recorded speech reduces from 69.69% to 18.75%. The FAR of replay condition further drops to 0% when this dual-channel ASV system is integrated with the new dual-channel voice liveness detector.
机译:自动扬声器验证(ASV)系统具有广泛的应用范围,但容易受到欺骗攻击,尤其是有效且易于实施的重播攻击。在检测重播攻击方面,大多数先前的工作仅使用来自单个声学麦克风的音频,从而导致难以检测到与现场人类语音几乎无法区分的高端重播攻击。在本文中,我们研究了一种特殊的人体传导传感器,即嗓音麦克风(TM),用于组合语音活动检测(VLD)和ASV,以提高ASV抵抗重放攻击的鲁棒性和安全性。我们首先研究攻击基于TM的ASV系统的可能性和方法,然后进行试验数据收集。其次,我们研究了使用单通道和双通道ASV系统的VLD频谱特征的使用。我们使用高斯混合模型与通用背景模型(GMM-UBM)以及基于i-vector的系统,对我们收集的38位说话者进行了说话人验证实验。通过使用双麦克风设置,我们在识别精度上取得了显着提高。在带有嘈杂测试语音的实验中,基于双麦克风GMM-UBM的用于录制语音的系统的错误接受率(FAR)从69.69%降低到18.75%。当此双通道ASV系统与新的双通道语音活动检测器集成在一起时,重播条件的FAR进一步降至0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号