Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution

机译：基于多时标演化的语音自动识别模型自适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The change in speech characteristics is originated from various factors, at various (temporal) rates in a real world conversation. These temporal changes have their own dynamics and therefore, we propose to extend the single (time-) incremental adaptations to a multiscale adaptation, which has the potential of greatly increasing the model's robustness as it will include adaptation mechanism to approximate the nature of the characteristic change. The formulation of the incremental adaptation assumes a time evolution system of the model, where the posterior distributions, used in the decision process, are successively updated based on a macroscopic time scale in accordance with the Kalman filter theory. In this paper, we extend the original incremental adaptation scheme, based on a single time scale, to multiple time scales, and apply the method to the adaptation of both the acoustic model and the language model. We further investigate methods to integrate the multi-scale adaptation scheme to realize the robust speech recognition performance. Large vocabulary continuous speech recognition experiments for English and Japanese lectures revealed the importance of modeling multiscale properties in speech recognition.

机译：语音特征的变化源于现实对话中各种因素（以不同的（时间）速率）。这些时间变化具有其自身的动态，因此，我们建议将单次（时间）增量适应扩展为多尺度适应，这将极大地提高模型的鲁棒性，因为它将包括适应机制以逼近特征的性质改变。增量适应的公式化假设模型的时间演化系统，其中决策过程中使用的后验分布根据宏观时标根据卡尔曼滤波理论连续更新。在本文中，我们将基于单个时间尺度的原始增量自适应方案扩展到多个时间尺度，并将该方法应用于声学模型和语言模型的自适应。我们进一步研究了集成多尺度自适应方案以实现鲁棒语音识别性能的方法。用于英语和日语讲座的大型词汇连续语音识别实验表明，在语音识别中建立多尺度属性的重要性。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1088-1091|共4页
会议地点
作者
Shinji Watanabe; Atsushi Nakamura; Biing-Hwang (Fred) Juang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech recognition; incremental adaptation; multiscale; time evolution system;

机译：语音识别;增量适应;多尺度时间演化系统;

相似文献

外文文献
中文文献
专利

1. Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2018,第6期

机译：基于潜在词语言模型混合的领域自适应语音自动识别
2. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [J] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, IEICE transactions on information and systems . 2010,第1期

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术
3. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [J] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, IEICE Transactions on Information and Systems . 2010,第1期

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术
4. Speaking-Rate Adaptation of Automatic Speech Recognition System through Fuzzy Classification based Time-Scale Modification [C] . S. Shahnawazuddin, Waquar Ahmad, Hemant K. Kathania, National Conference on Communications . 2019

机译：基于模糊分类的时标修正的自动语音识别系统的语速自适应
5. Acoustic model and adaptation for automatic speech recognition and animal vocalization classification. [D] . Tao, Jidong. 2009

机译：自动语音识别和动物发声分类的声学模型和自适应。
6. Capturing Multiple Timescales of Adaptation to Second-Order Statistics With Generalized Linear Models: Gain Scaling and Fractional Differentiation [O] . Kenneth W. Latimer, Adrienne L. Fairhall 2020

机译：使用广义线性模型捕获对二阶统计数据的多个时间尺度：增益缩放和分数分化
7. Machine translation-based language model adaptation for automatic speech recognition of spoken translations [O] . Pelemans Joris, Vanallemeersch Tom, Demuynck Kris, 2015

机译：基于机器翻译的语言模型自适应，用于语音翻译的自动语音识别

Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution

摘要

著录项

相似文献

相关主题

期刊订阅