首页> 外文期刊>Engineering Economics >HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION
【24h】

HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION

机译:鲁棒语音识别的非平稳噪声的分层分类树建模

获取原文
获取外文期刊封面目录资料

摘要

Noise robustness is a key issue in successful deployment of automatic speech recognition systems in demanding environments such as hospital operating rooms. Perhaps the most successful way to overcome the additive noise obstacle is to employ a model adaptation scheme built around a set of dedicated clean speech and noise-only statistical models. Existing recognizer designs generally rely on relatively simple noise models, as more detailed ones would increase computational demands significantly. Simple models are, however, unable to provide accurate characterization of highly nonstationary noise present in real-world noisy facilities and thereby provide only limited reduction in error rate of the recognizer. The present article describes a novel approach to nonstationary acoustical noise modeling via a set of hierarchically tied hidden Markov models in a classification tree structure. Proposed statistical structure allows detailed description of nonstationary ambient acoustical noise while maintaining low computational costs during recognition. Modeling performance of the proposed construction is verified on a real background noise recorded during a neurosurgery in a hospital operating room.
机译:噪声鲁棒性是在要求苛刻的环境(例如医院手术室)中成功部署自动语音识别系统的关键问题。克服加性噪声障碍的最成功方法也许是采用围绕一组专用的纯净语音和纯噪声统计模型建立的模型自适应方案。现有的识别器设计通常依赖于相对简单的噪声模型,因为更详细的噪声模型会显着增加计算需求。但是,简单的模型无法准确表征现实的嘈杂设施中存在的高度不稳定的噪声,因此只能有限地降低识别器的错误率。本文介绍了一种通过分类树结构中的一组分层绑定隐马尔可夫模型进行非平稳声学噪声建模的新颖方法。提议的统计结构允许对非平稳环境声噪声进行详细描述,同时在识别过程中保持较低的计算成本。在医院手术室进行神经外科手术期间记录的真实背景噪声中验证了所提出结构的建模性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号