首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution
【24h】

Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution

机译:基于多时标演化的语音自动识别模型自适应

获取原文

摘要

The change in speech characteristics is originated from various factors, at various (temporal) rates in a real world conversation. These temporal changes have their own dynamics and therefore, we propose to extend the single (time-) incremental adaptations to a multiscale adaptation, which has the potential of greatly increasing the model's robustness as it will include adaptation mechanism to approximate the nature of the characteristic change. The formulation of the incremental adaptation assumes a time evolution system of the model, where the posterior distributions, used in the decision process, are successively updated based on a macroscopic time scale in accordance with the Kalman filter theory. In this paper, we extend the original incremental adaptation scheme, based on a single time scale, to multiple time scales, and apply the method to the adaptation of both the acoustic model and the language model. We further investigate methods to integrate the multi-scale adaptation scheme to realize the robust speech recognition performance. Large vocabulary continuous speech recognition experiments for English and Japanese lectures revealed the importance of modeling multiscale properties in speech recognition.
机译:语音特征的变化源于现实对话中各种因素(以不同的(时间)速率)。这些时间变化具有其自身的动态,因此,我们建议将单次(时间)增量适应扩展为多尺度适应,这将极大地提高模型的鲁棒性,因为它将包括适应机制以逼近特征的性质改变。增量适应的公式化假设模型的时间演化系统,其中决策过程中使用的后验分布根据宏观时标根据卡尔曼滤波理论连续更新。在本文中,我们将基于单个时间尺度的原始增量自适应方案扩展到多个时间尺度,并将该方法应用于声学模型和语言模型的自适应。我们进一步研究了集成多尺度自适应方案以实现鲁棒语音识别性能的方法。用于英语和日语讲座的大型词汇连续语音识别实验表明,在语音识别中建立多尺度属性的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号