首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Deep bottleneck features for i-vector based text-independent speaker verification
【24h】

Deep bottleneck features for i-vector based text-independent speaker verification

机译:基于i向量的独立文本说话者验证的深层瓶颈功能

获取原文

摘要

This paper describes the application of deep neural networks (DNNs), trained to discriminate among speakers, to improving performance in text-independent speaker verification. Activations from the bottleneck layer of these DNNs are used as features in an i-vector based speaker verification system. The features derived from this network are thought to be more robust with respect to phonetic variability, which is generally considered to have a negative impact on speaker verification performance. The verification performance using these features is evaluated on the 2012 NIST SRE core-core condition with models trained from a subset of the Fisher and Switchboard conversational speech corpora. It is found that improved performance, as measured by the minimum detection cost function (minDCF), can be obtained by appending speaker discriminative features to the more widely used mel-frequency cepstrum coefficients.
机译:本文介绍了经过训练以区分说话者的深度神经网络(DNN)在提高与文本无关的说话者验证中的性能的应用。这些DNN瓶颈层的激活被用作基于i-vector的说话者验证系统中的功能。从该网络派生的功能在语音可变性方面被认为更健壮,而语音可变性通常被认为会对说话者验证性能产生负面影响。在2012 NIST SRE核心-核心条件下,使用从Fisher和Switchboard对话语音语料库的子集训练的模型来评估使用这些功能的验证性能。已经发现,通过将说话者的判别特征附加到更广泛使用的梅尔频率倒谱系数上,可以获得通过最小检测成本函数(minDCF)衡量的改进性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号