首页> 外文期刊>IEICE Transactions on Information and Systems >Continuous Speech Recognition Based on General Factor Dependent Acoustic Models
【24h】

Continuous Speech Recognition Based on General Factor Dependent Acoustic Models

机译:基于总因子相关声学模型的连续语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes continuous speech recognition incorporating the additional complement information, e.g., voice characteristics, speaking styles, linguistic information and noise environment, into HMM-based acoustic modeling. In speech recognition systems, context-dependent HMMs, i.e., triphone, and the tree-based context clustering have commonly been used. Several attempts to utilize not only phonetic contexts, but additional complement information based on context (factor) dependent HMMs have been made in recent years. However, when the additional factors for testing data are unobserved, methods for obtaining factor labels is required before decoding. In this paper, we propose a model integration technique based on general factor dependent HMMs for decoding. The integrated HMMs can be used by a conventional decoder as standard triphone HMMs with Gaussian mixture densities. Moreover, by using the results of context clustering, the proposed method can determine an optimal number of mixture components for each state dependently of the degree of influence from additional factors. Phoneme recognition experiments using voice characteristic labels show significant improvements with a small number of model parameters, and a 19.3% error reduction was obtained in noise environment experiments.
机译:本文介绍了将附加的补充信息(例如语音特性,说话风格,语言信息和噪声环境)整合到基于HMM的声学建模中的连续语音识别。在语音识别系统中,通常使用了上下文相关的HMM,即三音机,以及基于树的上下文聚类。近年来,已经进行了多种尝试,不仅利用语音上下文,而且还利用依赖于上下文(因子)的HMM来补充补充信息。然而,当没有观察到用于测试数据的附加因素时,在解码之前需要用于获得因素标签的方法。在本文中,我们提出了一种基于通用因子相关HMM的模型集成技术来进行解码。常规解码器可以将集成的HMM用作具有高斯混合密度的标准三音手机HMM。此外,通过使用上下文聚类的结果,所提出的方法可以根据其他因素的影响程度来确定每种状态的最佳混合组分数。使用语音特征标签的音素识别实验显示,使用少量模型参数可显着改善,并且在噪声环境实验中可减少19.3%的误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号