首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition
【24h】

Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition

机译:扬声器间加权MAP自适应用于GMM超向量扬声器识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Gaussian Mixture Models (GMM) are ubiquitously used in state-of-the-art speaker recognition systems. The popular GMM-SVM paradigm uses Maximum A Posteriori (MAP) speaker-adapted GMM models by stacking the mean vectors into a supervector that is fed into a Support Vector Machine classifier. In this paper, we modify the standard relevance MAP algorithm to better fit the speaker recognition task. We propose to emphasize the adaptation of the Gaussian mixtures according to the inter-speaker variability exhibited on a training set, thus accounting for both the occupation count and the speaker discrimination ability during adaptation. We evaluate our proposal on a relevance MAP based GMM-SVM system using a large telephone speech corpus such as the one provided in the 2006 NIST Speaker Recognition Evaluation. We show that despite its simplicity this technique is effective.
机译:高斯混合模型(GMM)广泛用于最先进的说话人识别系统。流行的GMM-SVM范例通过将均值向量堆叠到超向量中来使用最大后验(MAP)说话者自适应GMM模型,该超向量被馈送到支持向量机分类器中。在本文中,我们修改了标准相关性MAP算法,以更好地适合说话人识别任务。我们建议根据训练集上显示的说话者之间的变异性来强调高斯混合的适应性,因此要考虑适应过程中的占用人数和说话者辨别能力。我们使用大型电话语音语料库(例如2006年NIST演讲者识别评估中提供的语料库)对基于相关MAP的GMM-SVM系统进行评估。我们表明,尽管其简单性,该技术还是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号