【24h】

Speaker Recognition Against Utterance Variations

机译:扬声器识别对话语变化

获取原文
获取外文期刊封面目录资料

摘要

A speaker model in speaker recognition system is to be trained from a large data set gathered in multiple sessions. Large data set requires large amount of memory and computation, and moreover it's practically hard to make users utter the data in several sessions. Recently the incremental adaptation methods are proposed to cover the problems. However, the data set gathered from multiple sessions is vulnerable to the outliers from the irregular utterance variations and the presence of noise, which result in inaccurate speaker model. In this paper, we propose an incremental robust adaptation method to minimize the influence of outliers on Gaussian Mixture Model based speaker model. The robust adaptation is obtained from an incremental version of M-estimation. Speaker model is initially trained from small amount of data and it is adapted recursively with the data available in each session. Experimental results from the data set gathered over seven months show that the proposed method is robust against outliers.
机译:扬声器识别系统中的扬声器模型将从多个会话中收集的大数据集接受培训。大数据集需要大量的内存和计算,而且实际上很难让用户在几个会话中发出数据。最近提出了增量适应方法来涵盖问题。然而,从多个会话中收集的数据集容易受到来自不规则话语变化和存在噪声的异常值,这导致扬声器模型不准确。在本文中,我们提出了一种增量稳健的适应方法,以最大限度地减少基于高斯混合模型的扬声器模型的异常值的影响。从M估计的增量版本获得了鲁棒的适应。扬声器模型最初从少量数据培训,并且它递归适应每个会话中可用的数据。数据集的实验结果聚集在七个月内,表明该方法对异常值具有强大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号