首页> 外文会议>IEEE International Conference on Biometrics Theory, Applications and Systems >Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification
【24h】

Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

机译:博物馆增强暹罗卷积神经网络,用于独立于无关的扬声器验证

获取原文

摘要

In this paper a novel cross-device text-independent speaker verification architecture is proposed. Majority of the state-of-the-art deep architectures that are used for speaker verification tasks consider Mel-frequency cepstral coefficients. In contrast, our proposed Siamese convolutional neural network architecture uses Mel-frequency spectrogram coefficients to benefit from the dependency of the adjacent spectro-temporal features. Moreover, although spectro-temporal features have proved to be highly reliable in speaker verification models, they only represent some aspects of short-term acoustic level traits of the speaker's voice. However, the human voice consists of several linguistic levels such as acoustic, lexicon, prosody, and phonetics, that can be utilized in speaker verification models. To compensate for these inherited shortcomings in spectro-temporal features, we propose to enhance the proposed Siamese convolutional neural network architecture by deploying a multilayer perceptron network to incorporate the prosodic, jitter, and shimmer features. The proposed end-to-end verification architecture performs feature extraction and verification simultaneously. This proposed architecture displays significant improvement over classical signal processing approaches and deep algorithms for forensic cross-device speaker verification.
机译:在本文中,提出了一种新颖的跨装置独立的扬声器验证架构。用于扬声器验证任务的大多数最先进的深层架构考虑熔融频率谱系齐数。相比之下,我们所提出的暹罗卷积神经网络架构使用熔融频谱分系数来受益于相邻光谱 - 时间特征的依赖性。此外,尽管在扬声器验证模型中证明了光谱 - 时间特征是高度可靠的,但它们只代表了扬声器语音的短期声学级特征的一些方面。然而,人类的声音包括若干语言水平,例如声学,词典,韵律和语音学,可以用于扬声器验证模型。为了弥补光谱时间特征中的这些继承的缺点,我们建议通过部署多层Perceptron网络来加强暹罗卷积神经网络架构,以包含韵律,抖动和闪光功能。所提出的端到端验证架构同时执行特征提取和验证。该拟议的体系结构显示出对古典信号处理方法和深度算法的显着改进,用于取证交叉设备扬声器验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号