Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features

机译：扬声器识别通过非线性语音和扬声器 - 辨别特征

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We use a multi-layer perceptron (MLP) to transform cepstral features into features better suited for speaker recognition. Two types of MLP output targets are considered: phones (Tandem/HATS-MLP) and speakers (Speaker-MLP). In the former case, output activations are used as features in a GMM speaker recognition system, while for the latter, hidden activations are used as features in an SVM system. Using a smaller set of MLP training speakers, chosen through clustering, yields system performance similar to that of a Speaker-MLP trained with many more speakers. For the NIST Speaker Recognition Evaluation 2004, both Tandem/HATS-GMM and Speaker-SVM systems improve upon a basic GMM baseline, but are unable to contribute in a score-level combination with a state-of-the-art GMM system. It may be that the application of normalizations and channel compensation techniques to the current state-of-the-art GMM has reduced channel mismatch errors to the point that contributions of the MLP systems are no longer additive.

机译：我们使用多层Perceptron（MLP）来将倒谱特征转化为更适合扬声器识别的功能。考虑两种类型的MLP输出目标：手机（串联/帽子-MLP）和扬声器（扬声器-MLP）。在前一种情况下，输出激活用作GMM扬声器识别系统中的功能，而对于后者，隐藏的激活用作SVM系统中的功能。使用较小的MLP训练扬声器，通过聚类选择，产生类似于更多更多扬声器培训的扬声器-MLP的系统性能。对于NIST扬声器识别评估2004，串联/帽GMM和扬声器-SVM系统都会改进基本GMM基线，但无法在与最先进的GMM系统中贡献分数级组合。可能的是，将常规和信道补偿技术应用于当前最先进的GMM，这对MLP系统的贡献不再是附加的点的渠道错配误差降低了。

著录项

来源
《International Conference on Nonlinear Speech Processing》|2008年||共10页
会议地点
作者
Lara Stoll; Joe Frankel; Nikki Mirghafori;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker specific feature based clustering and its applications in language independent forensic speaker recognition [J] . Satyanand Singh, Pragya Singh International Journal of Electrical and Computer Engineering . 2020,第4期

机译：基于扬声器特定的功能的聚类及其在语言独立法医扬声器识别中的应用
2. High level speaker specific features modeling in automatic speaker recognition system [J] . Satyanand Singh International Journal of Electrical and Computer Engineering . 2020,第2期

机译：自动扬声器识别系统中高级扬声器特定功能造型
3. Histogram equalization using a reduced feature set of background speakers’ utterances for speaker recognition [J] . Myung-jae?Kim, Il-ho?Yang, Min-seok?Kim, Frontiers of Information Technology & Electronic Engineering . 2017,第5期

机译：使用减少的背景说话者特征集进行直方图均衡以识别说话者
4. Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features [C] . Lara Stoll, Joe Frankel, Nikki Mirghafori International Conference on Nonlinear Speech Processing . 2008

机译：扬声器识别通过非线性语音和扬声器 - 辨别特征
5. Physiologically-motivated feature extraction methods for speaker recognition. [D] . Wang, Jianglin. 2013

机译：用于说话人识别的生理动机特征提取方法。
6. Efficient Invariant Features for Sensor Variability Compensation in Speaker Recognition [O] . Abdennour Alimohad, *, Ahmed Bouridane, 2014

机译：说话人识别中传感器可变性补偿的高效不变性
7. Classification of Pitch and Gender of Speakers for Forensic Speaker Recognition from Disguised Voices Using Novel Features Learned by Deep Convolutional Neural Networks [O] . Athulya M. Swamidasan Unni Nair, Sathidevi P. Savithri 2021

机译：使用深度卷积神经网络学习的新功能，从伪装的声音识别法医扬声器识别的讲话者的分类
8. Graph Relational Features for Speaker Recognition and Mining. [R] . Karam, Z. N., Campbell, W. M., Dehak, N. 2016

机译：用于说话人识别和挖掘的图关系特征。

Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅