Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

机译：韵律增强的连体卷积神经网络用于跨设备的文本无关的说话人验证

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper a novel cross-device text-independent speaker verification architecture is proposed. Majority of the state-of-the-art deep architectures that are used for speaker verification tasks consider Mel-frequency cepstral coefficients. In contrast, our proposed Siamese convolutional neural network architecture uses Mel-frequency spectrogram coefficients to benefit from the dependency of the adjacent spectro-temporal features. Moreover, although spectro-temporal features have proved to be highly reliable in speaker verification models, they only represent some aspects of short-term acoustic level traits of the speaker's voice. However, the human voice consists of several linguistic levels such as acoustic, lexicon, prosody, and phonetics, that can be utilized in speaker verification models. To compensate for these inherited shortcomings in spectro-temporal features, we propose to enhance the proposed Siamese convolutional neural network architecture by deploying a multilayer perceptron network to incorporate the prosodic, jitter, and shimmer features. The proposed end-to-end verification architecture performs feature extraction and verification simultaneously. This proposed architecture displays significant improvement over classical signal processing approaches and deep algorithms for forensic cross-device speaker verification.

机译：本文提出了一种新颖的跨设备独立于文本的说话者验证架构。用于说话者验证任务的大多数最新的深层架构都考虑了梅尔频率倒谱系数。相反，我们提出的暹罗卷积神经网络体系结构使用梅尔频率谱图系数来受益于相邻谱时特征的依赖性。此外，尽管频谱时态特征在说话者验证模型中被证明是高度可靠的，但它们仅代表说话者声音的短期声学声级特征的某些方面。但是，人的语音包含多种语言级别，例如声学，词典，韵律和语音，可以在说话者验证模型中使用。为了弥补光谱时态特征中的这些遗传缺陷，我们建议通过部署多层感知器网络以结合韵律，抖动和微光特征来增强建议的暹罗卷积神经网络体系结构。提出的端到端验证体系结构同时执行特征提取和验证。与传统的信号处理方法和用于法医跨设备说话者验证的深度算法相比，该提议的架构显示出显着的改进。

著录项

来源
《IEEE International Conference on Biometrics Theory, Applications and Systems》|2018年|1-7|共7页
会议地点
作者
Sobhan Soleymani; Ali Dabouei; Seyed Mehdi Iranmanesh; Hadi Kazemi; Jeremy Dawson; Nasser M. Nasrabadi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Jitter; Computer architecture; Acoustics; Hidden Markov models; Convolution; Frequency-domain analysis;

机译：特征提取抖动计算机体系结构声学隐马尔可夫模型卷积频域分析;

相似文献

外文文献
中文文献
专利

1. Efficient text-independent speaker verification with structural Gaussian mixture models and neural network [J] . Bing Xiang, Berger T. IEEE Transactions on Speech and Audio Proceessing . 2003,第5期

机译：利用结构高斯混合模型和神经网络进行有效的文本无关说话者验证
2. Efficient text-independent speaker verification with structural Gaussian mixture models and neural network [J] . Bing Xiang, Berger T. IEEE Transactions on Speech and Audio Proceeding . 2003,第5期

机译：利用结构高斯混合模型和神经网络进行有效的文本无关说话者验证
3. TEXT-INDEPENDENT SPEAKER VERIFICATION USING MINIMAL RESOURCE ALLOCATION NETWORKS [J] . LI GUOJIE, P. SARATCHANDRAN, N. SUNDARARAJAN International Journal of Neural Systems . 2004,第6期

机译：使用最小资源分配网络的文本无关的说话人验证
4. Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification [C] . Sobhan Soleymani, Ali Dabouei, Seyed Mehdi Iranmanesh, IEEE International Conference on Biometrics Theory, Applications and Systems . 2018

机译：博物馆增强暹罗卷积神经网络，用于独立于无关的扬声器验证
5. Speaker Recognition: Evaluation for GMM-UBM and 3D Convolutional Neural Networks Systems [D] . Alghamdi, Mohammad S. 2019

机译：说话者识别：对GMM-UBM和3D卷积神经网络系统的评估
6. A Data-Driven Damage Identification Framework Based on Transmissibility Function Datasets and One-Dimensional Convolutional Neural Networks: Verification on a Structural Health Monitoring Benchmark Structure [O] . Tongwei Liu, Hao Xu, Minvydas Ragulskis, 2020

机译：基于传递函数数据集和一维卷积神经网络的数据驱动型损伤识别框架：结构健康监测基准结构的验证
7. Generalized locally recurrent probabilistic neural networks with application to text-independent speaker verification [O] . Todor Ganchev, Dimitris K. Tasoulis, Michael N. Vrahatis, 2015

机译：广义局部递归概率神经网络及其在文本无关说话人验证中的应用

Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅