首页> 外文会议>Chinese Conference on Biometric Recognition >Preliminary Study on Self-contained UBM Construction for Speaker Recognition
【24h】

Preliminary Study on Self-contained UBM Construction for Speaker Recognition

机译:扬声器识别自给式UBM构建的初步研究

获取原文

摘要

Although speaker recognition technology has evolved into some new stages recently, GMM-UBM (Gaussian Mixture Model-Universal Background Model) has always been the base module for the newly developed methods such as SVM, JFA and i-vector. Because of its simplicity, flexibility and robustness, GMM-UBM has been used as a benchmark system for research reference. For traditional UBM construction, speech data from a lot of speakers other than the target speakers should be obtained, which means much cost of data collection. In this paper, we make preliminary exploration on a new approach to train the UBM, named as self-contained UBM, in which only the target speakers' training data were used. We study several strategies of speaker selection for the self-contained UBM construction, gradually reduced from 50 to 3 speakers. Experiments on MASC@CCNT show that our self-contained UBM obtain considerable recognition rate compared with traditional UBM, while needing far less training data thus less training time. Furthermore, we find out that the obtained good ternary UBM speakers have an interesting characteristic of spanning a triangle (UBM speaker triangle) after dimension reduction of MFCC features with PCA.
机译:虽然扬声器识别技术最近已经进化到了一些新阶段,但GMM-UBM(高斯混合模型 - 通用背景模型)始终是新开发方法的基础模块,如SVM,JFA和I形载体。由于其简单性,灵活性和鲁棒性,GMM-UBM已被用作研究参考的基准系统。对于传统的UBM施工,应获得来自目标扬声器以外的大量扬声器的语音数据,这意味着数据收集的大量成本。在本文中,我们对培训UBM的新方法进行初步探索,命名为自包含UBM,其中仅使用目标扬声器的培训数据。我们研究了对自给式UBM建设的几种演讲选择策略,从50到3个发言者逐渐减少。对Masc @ CCNT的实验表明,与传统UBM相比,我们的自给式UBM获得了相当大的识别率,同时需要远远较低的培训数据。此外,我们发现所获得的良好的三元UBM扬声器在用PCA的MFCC特征的尺寸减少后跨越三角形(UBM扬声器三角)的有趣特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号