首页> 外文会议>International Symposium on Chinese Spoken Language Processing; 20041215-18; Hong Kong(CN) >TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM AND FRAME LEVEL LIKELIHOOD NORMALIZATION
【24h】

TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM AND FRAME LEVEL LIKELIHOOD NORMALIZATION

机译:使用GMM-UBM和框架水平相似度标准化的文本无关的说话人识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we describe a Gaussian Mixture Model-Universal Background Model (GMM-UBM) speaker identification system. In this GMM-UBM system, we derive the hypothesized speaker model by adapting the parameters of UBM using the speaker's training speech and a form of Bayesian adaptation. The UBM technique is incorporated into the GMM speaker identification system to reduce the time requirement for recognition significantly. The paper also presents a new frame level likelihood score normalization for adjusting different scores of speaker models to get more robust scores in final decision. Experiments on the 2000 NIST Speaker Recognition Evaluation corpus show that GMM-UBM and frame level likelihood score normalization yield better performance. Compared to the baseline system, around 31.2% relative error reduction is obtained from the combination of both techniques.
机译:在本文中,我们描述了高斯混合模型-通用背景模型(GMM-UBM)说话人识别系统。在此GMM-UBM系统中,我们通过使用说话人的训练语音和贝叶斯自适应形式来调整UBM的参数,从而得出假设的说话人模型。 GBM说话人识别系统采用了UBM技术,以大大减少识别所需的时间。本文还提出了一种新的帧级别似然评分归一化方法,用于调整说话人模型的不同评分,以在最终决策中获得更可靠的评分。在2000年NIST说话者识别评估语料库上的实验表明,GMM-UBM和帧级别似然评分归一化可以产生更好的性能。与基线系统相比,两种技术的组合可减少约31.2%的相对误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号