首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Computationally Efficient Speaker Identification for Large Population Tasks using MLLR and Sufficient Statistics
【24h】

Computationally Efficient Speaker Identification for Large Population Tasks using MLLR and Sufficient Statistics

机译:使用MLLR和足够的统计数据可有效地执行大型任务的说话人识别

获取原文
获取原文并翻译 | 示例

摘要

In conventional Speaker-Identification using GMM-UBM framework, the likelihood of the given test utterance is computed with respect to all speaker-models before identifying the speaker, based on the maximum likelihood criterion. The calculation of likelihood score of the test utterance is computationally intensive, especially when there are tens of thousands of speakers in database. In this paper, we propose a computationally efficient (Fast) method to calculate the likelihood of the test utterance using speaker-specific Maximum Likelihood Linear Regression (MLLR) matrices (which are precomputed) and sufficient statistics estimated from the test utterance only once. We show that while this method is an order of magnitude faster, there is some degradation in performance. Therefore, we propose a cascaded system with the Fast MLLR system identifying the top-N most probable speakers, followed by a conventional GMM-UBM to identify the most probable speaker from the top-N speakers. Experiments performed on the NIST 2004 database indicate that the cascaded system provides a speed up of 3.16 and 6.08 times for 1-side test (core condition) and 10 sec. testcondition respectively, with a marginal degradation in accuracy over the conventional GMM-UBM system.
机译:在使用GMM-UBM框架的常规说话者识别中,基于最大似然准则,在识别说话者之前,针对所有说话者模型计算给定测试发音的可能性。测试话语的似然评分的计算量很大,尤其是在数据库中有成千上万的说话者时。在本文中,我们提出了一种计算有效的(快速)方法,该方法使用说话人特定的最大似然线性回归(MLLR)矩阵(已预先计算)并从测试话语中估计出足够的统计信息来计算测试话语的可能性。我们显示,虽然此方法的速度快一个数量级,但性能会有所下降。因此,我们提出了一种级联系统,其中快速MLLR系统识别出前N个最有可能说话的人,然后是传统的GMM-UBM来识别前N个最有可能说话的人。在NIST 2004数据库上进行的实验表明,级联系统的1面测试(核心条件)和10秒的速度分别提高了3.16和6.08倍。测试条件,与传统的GMM-UBM系统相比,精度略有下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号