首页> 外文会议>IEEE international conference on acoustics, speech, and signal processing >Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition
【24h】

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

机译:快速扬声器适应使用扬声器 - 混合的allophone模型应用于扬声器的语音识别

获取原文

摘要

A speaker mixture principle that allows the creation of speaker-independent phone models is proposed. Speaker-tied training for rapid speaker adaptation using utterances shorter than one second is derived from this principle. The concept of speaker pruning is also introduced for reducing computational cost without degrading the speaker adaptation performance. The above principle is combined with context-dependent phone models, which have been automatically generated by the successive state splitting algorithm. In a Japanese phrase recognition experiment, speaker mixture allophone models achieved an error reduction of 29.0%, which is high in comparison with the conventional speaker-independent HMM (hidden Markov model)-LR method. Speaker adaptation by speaker-tied training attained an error reduction of 16.8% using a 0.6-s Japanese word utterance. Speaker pruning reduced the number of phone model mixtures by between 50% and 92% without lowering recognition performance.
机译:提出了允许创建扬声器的手机模型的扬声器混合原理。使用短于一秒钟的发言者为快速扬声器适应的讲话训练来自这一原则。还介绍了扬声器修剪的概念,用于降低计算成本而不会降低扬声器适应性能。上述原理与上下文相关的电话模型组合,这些电话模型已被连续状态分割算法自动生成。在日语短语识别实验中,扬声器混合偶联模型达到了29.0%的误差,与传统的扬声器无关的HMM(隐马尔可夫模型)-LR方法相比,这很高。演讲者适应扬声器绑定训练使用0.6-S日语话语达到16.8%的错误。扬声器修剪减少了电话模型混合物的数量在50%和92%之间而不降低识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号