首页> 外文会议>European Conference on Speech Communication and Technology v.4; 20010903-20010907; Aalborg; DK >Minimum Classification Error Training for Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution
【24h】

Minimum Classification Error Training for Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution

机译:基于多空间概率分布的高斯混合模型用于说话人识别的最小分类误差训练

获取原文
获取原文并翻译 | 示例

摘要

In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe an minimum classification error (MCE) training procedure for the MSD-GMM speaker models. MCE training is also applied to automatically estimate mixture-dependent stream weights for spectral and pitch streams. The MCE-based MSD-GMM speaker models are evaluated for a text-independent speaker identification task. Experimental results show that MCE training of the MSD-GMM parameters significantly reduces identification errors and system performance is further improved by appropriately weighting spectral and pitch streams using MCE training.
机译:在我们之前的工作中,我们提出了一种基于频谱和音高特征的说话人建模技术,用于基于多空间概率分布高斯混合模型(MSD-GMM)的与文本无关的说话人识别。我们已经提出了MSD-GMM参数的最大似然(ML)估计程序,并展示了其高识别性能。在本文中,我们描述了MSD-GMM扬声器模型的最小分类误差(MCE)训练过程。 MCE训练也适用于自动估计光谱流和沥青流的混合物相关流权重。对基于MCE的MSD-GMM扬声器模型进行评估,以执行与文本无关的扬声器识别任务。实验结果表明,通过MCE训练对频谱和音高流进行适当加权,可以对MSD-GMM参数进行MCE训练,显着减少了识别错误,并且进一步提高了系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号