首页> 外文会议>Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on >RC-MES: a novel speaker modeling technique based on regression class for speaker identification
【24h】

RC-MES: a novel speaker modeling technique based on regression class for speaker identification

机译:RC-MES:基于回归类的说话人识别的新颖说话人建模技术

获取原文

摘要

The speaker modeling technique is an essential problem in robust speaker recognition, especially when enrolment data is sparse. This paper presents a novel modeling approach named multi-eigenspace modeling technique based on regression class (RC-MES), which integrates the common eigenspace technique and the regression class (RC) idea of maximum likelihood linear regression (MLLR). RC-MES not only solves the problem of prior knowledge limitation of Gaussian mixture models (GMM) but also remedies the shortcomings of the common eigenspace that confuses speaker differences and phoneme differences. The eigenvoice analysis in RC can provide better discrimination ability between different speakers. The experimental results on speaker identification of 75 males show that, when enrolment data is sparse, RC-MES provides significant improvement over GMM, and the number of eigenvoices in RC-MES is fewer than that in the common eigenspace.
机译:说话人建模技术是健壮的说话人识别中必不可少的问题,尤其是在注册数据稀疏的情况下。本文提出了一种新颖的建模方法,称为基于回归类的多特征空间建模技术(RC-MES),该方法将常见的特征空间技术与最大似然线性回归(MLLR)的回归类(RC)概念相结合。 RC-MES不仅解决了高斯混合模型(GMM)的先验知识限制的问题,而且还弥补了混淆了说话者差异和音素差异的常见特征空间的缺点。 RC中的本征语音分析可以提供更好的区分不同说话者的能力。 75名男性说话人识别的实验结果表明,当入学数据稀疏时,RC-MES较GMM有了显着改善,并且RC-MES中的本征语音数量少于普通本征空间中的本征语音数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号