首页> 外文会议>International Conference on Nonlinear Speech Processing >Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features
【24h】

Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features

机译:扬声器识别通过非线性语音和扬声器 - 辨别特征

获取原文
获取外文期刊封面目录资料

摘要

We use a multi-layer perceptron (MLP) to transform cepstral features into features better suited for speaker recognition. Two types of MLP output targets are considered: phones (Tandem/HATS-MLP) and speakers (Speaker-MLP). In the former case, output activations are used as features in a GMM speaker recognition system, while for the latter, hidden activations are used as features in an SVM system. Using a smaller set of MLP training speakers, chosen through clustering, yields system performance similar to that of a Speaker-MLP trained with many more speakers. For the NIST Speaker Recognition Evaluation 2004, both Tandem/HATS-GMM and Speaker-SVM systems improve upon a basic GMM baseline, but are unable to contribute in a score-level combination with a state-of-the-art GMM system. It may be that the application of normalizations and channel compensation techniques to the current state-of-the-art GMM has reduced channel mismatch errors to the point that contributions of the MLP systems are no longer additive.
机译:我们使用多层Perceptron(MLP)来将倒谱特征转化为更适合扬声器识别的功能。考虑两种类型的MLP输出目标:手机(串联/帽子-MLP)和扬声器(扬声器-MLP)。在前一种情况下,输出激活用作GMM扬声器识别系统中的功能,而对于后者,隐藏的激活用作SVM系统中的功能。使用较小的MLP训练扬声器,通过聚类选择,产生类似于更多更多扬声器培训的扬声器-MLP的系统性能。对于NIST扬声器识别评估2004,串联/帽GMM和扬声器-SVM系统都会改进基本GMM基线,但无法在与最先进的GMM系统中贡献分数级组合。可能的是,将常规和信道补偿技术应用于当前最先进的GMM,这对MLP系统的贡献不再是附加的点的渠道错配误差降低了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号