...
首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >An Investigation of Deep-Learning Frameworks for Speaker Verification Antispoofing
【24h】

An Investigation of Deep-Learning Frameworks for Speaker Verification Antispoofing

机译:说话人验证反欺骗的深度学习框架研究

获取原文
获取原文并翻译 | 示例
           

摘要

In this study, we explore the use of deep-learning approaches for spoofing detection in speaker verification. Most spoofing detection systems that have achieved recent success employ hand-craft features with specific spoofing prior knowledge, which may limit the feasibility to unseen spoofing attacks. We aim to investigate the genuine-spoofing discriminative ability from the back-end stage, utilizing recent advancements in deep-learning research. In this paper, alternative network architectures are exploited to target spoofed speech. Based on this analysis, a novel spoofing detection system, which simultaneously employs convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. In this framework, CNN is treated as a convolutional feature extractor applied on the speech input. On top of the CNN processed output, recurrent networks are employed to capture long-term dependencies across the time domain. Novel features including Teager energy operator critical band autocorrelation envelope, perceptual minimum variance distortionless response, and a more general spectrogram are also investigated as inputs to our proposed deep-learning frameworks. Experiments using the ASVspoof 2015 Corpus show that the integrated CNN–RNN framework achieves state-of-the-art single-system performance. The addition of score-level fusion further improves system robustness. A detailed analysis shows that our proposed approach can potentially compensate for the issue due to short duration test utterances, which is also an issue in the evaluation corpus.
机译:在这项研究中,我们探索了在说话者验证中使用深度学习方法进行欺骗检测。大多数最近获得成功的欺骗检测系统都采用具有特定欺骗先验知识的手工功能,这可能会限制看不见欺骗攻击的可行性。我们的目的是利用深度学习研究的最新进展,从后端阶段研究真正的欺骗性判别能力。在本文中,利用替代网络体系结构来针对欺骗性语音。在此基础上,提出了一种新颖的欺骗检测系统,该系统同时采用卷积神经网络(CNN)和递归神经网络(RNN)。在此框架中,CNN被视为应用于语音输入的卷积特征提取器。除了CNN处理的输出外,还使用循环网络来捕获整个时域的长期依赖关系。还研究了包括Teager能量算子临界带自相关包络,感知最小方差无失真响应和更通用的频谱图等新功能,作为我们提议的深度学习框架的输入。使用ASVspoof 2015语料库进行的实验表明,集成的CNN–RNN框架可实现最先进的单系统性能。得分级别融合的添加进一步提高了系统的鲁棒性。详细的分析表明,我们提出的方法可以弥补由于短时测试发声而引起的问题,这也是评估语料库中的一个问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号