首页> 外文会议>European Conference on Speech Communication and Technology v.2; 20010903-20010907; Aalborg; DK >SPEAKER VERIFICATION USING TARGET AND BACKGROUND DEPENDENT LINEAR TRANSFORMS AND MULTI-SYSTEM FUSION
【24h】

SPEAKER VERIFICATION USING TARGET AND BACKGROUND DEPENDENT LINEAR TRANSFORMS AND MULTI-SYSTEM FUSION

机译:使用目标和背景相关的线性变换和多系统融合对扬声器进行验证

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a GMM-based speaker verification system that uses speaker-dependent background models transformed by speaker-specific maximum likelihood linear transforms to achieve a sharper separation between the target and the non-target acoustic region. The effect of tying, or coupling, Gaussian components between the target and the background model is studied and shown to be a relevant factor with respect to the desired operating point. A fusion of scores from multiple systems built on different acoustic features via a neural network with performance gains over linear combination is also presented. The methods are experimentally studied on the 1999 NIST speaker recognition evaluation data.
机译:本文介绍了一种基于GMM的说话人验证系统,该系统使用通过说话人特定的最大似然线性变换转换的说话人相关背景模型,以在目标和非目标声学区域之间实现更清晰的分离。研究了在目标模型和背景模型之间绑扎或耦合高斯分量的影响,并证明这是与所需工作点有关的因素。还提出了通过神经网络将基于不同声学特征构建的多个系统的乐谱进行融合,并获得优于线性组合的性能。该方法是在1999年NIST说话人识别评估数据上进行实验研究的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号