首页> 外文会议>IEEE International Conference on Biometrics Theory, Applications and Systems >Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification
【24h】

Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

机译:韵律增强的连体卷积神经网络用于跨设备的文本无关的说话人验证

获取原文

摘要

In this paper a novel cross-device text-independent speaker verification architecture is proposed. Majority of the state-of-the-art deep architectures that are used for speaker verification tasks consider Mel-frequency cepstral coefficients. In contrast, our proposed Siamese convolutional neural network architecture uses Mel-frequency spectrogram coefficients to benefit from the dependency of the adjacent spectro-temporal features. Moreover, although spectro-temporal features have proved to be highly reliable in speaker verification models, they only represent some aspects of short-term acoustic level traits of the speaker's voice. However, the human voice consists of several linguistic levels such as acoustic, lexicon, prosody, and phonetics, that can be utilized in speaker verification models. To compensate for these inherited shortcomings in spectro-temporal features, we propose to enhance the proposed Siamese convolutional neural network architecture by deploying a multilayer perceptron network to incorporate the prosodic, jitter, and shimmer features. The proposed end-to-end verification architecture performs feature extraction and verification simultaneously. This proposed architecture displays significant improvement over classical signal processing approaches and deep algorithms for forensic cross-device speaker verification.
机译:本文提出了一种新颖的跨设备独立于文本的说话者验证架构。用于说话者验证任务的大多数最新的深层架构都考虑了梅尔频率倒谱系数。相反,我们提出的暹罗卷积神经网络体系结构使用梅尔频率谱图系数来受益于相邻谱时特征的依赖性。此外,尽管频谱时态特征在说话者验证模型中被证明是高度可靠的,但它们仅代表说话者声音的短期声学声级特征的某些方面。但是,人的语音包含多种语言级别,例如声学,词典,韵律和语音,可以在说话者验证模型中使用。为了弥补光谱时态特征中的这些遗传缺陷,我们建议通过部署多层感知器网络以结合韵律,抖动和微光特征来增强建议的暹罗卷积神经网络体系结构。提出的端到端验证体系结构同时执行特征提取和验证。与传统的信号处理方法和用于法医跨设备说话者验证的深度算法相比,该提议的架构显示出显着的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号