首页> 外文会议>International conference on spoken language processing >An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models
【24h】

An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

机译:使用扬声器聚类初始模型的在线增量扬声器适配方法

获取原文

摘要

We previously proposed an incremental speaker adaptation method combined with automatic speaker-change detection for broadcast news transcription where speakers change frequently and each of them utters a series of several sentences. In this method, the speaker change is detected using speaker-independent and speaker-adaptive Gaussian mixture models (GMMs). Both phone HMMs and GMMs are incrementally adapted to each speaker by the combination of MLLR, MAP and VFS methods using speaker by the combination of MLLR, MAP and VFS methods using speaker-independent (SI) models as initial models. This paper proposes its improvement in which an initial model for speaker adaptation is selected from a set of models made by speaker clustering. Either cluster-dependent phone HMMs or GMMs are used to calculate the likelihood for selecting the best initial model. In a broadcast news transcription task, the proposed method significantly reduces word error rate compared with the method using SI-HMM as an initial model. Online incremental speaker adaptation results show that word errr rate is reduced by 11.6
机译:我们之前提出了一个增量扬声器适配方法,结合自动扬声器更改检测,用于广播新闻转录,扬声器经常改变,并且它们中的每一个都展开了一系列句子。在该方法中,使用扬声器独立的和扬声器 - 自适应高斯混合模型(GMMS)来检测扬声器变化。通过使用MLLR,MAP和VFS方法的组合使用MLLR,MAP和VFS方法的组合,使用MLLR,MAP和VFS方法的组合,使用扬声器独立(SI)模型作为初始模型,通过MLLR,MAP和VFS方法的组合逐步适应每个扬声器。本文提出了改进,其中扬声器适应初始模型选自扬声器聚类制作的一组模型。依赖于群集的电话HMMS或GMMS用于计算选择最佳初始模型的可能性。在广播新闻转录任务中,与使用SI-HMM的方法作为初始模型相比,该方法显着降低了字错误率。在线增量扬声器适应结果表明,IRRR速率率降低11.6

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号