HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION

Zelinka Petr; Sigmund Milan

首页> 外文期刊>Engineering Economics >HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION

【24h】

HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION

机译：鲁棒语音识别的非平稳噪声的分层分类树建模

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Noise robustness is a key issue in successful deployment of automatic speech recognition systems in demanding environments such as hospital operating rooms. Perhaps the most successful way to overcome the additive noise obstacle is to employ a model adaptation scheme built around a set of dedicated clean speech and noise-only statistical models. Existing recognizer designs generally rely on relatively simple noise models, as more detailed ones would increase computational demands significantly. Simple models are, however, unable to provide accurate characterization of highly nonstationary noise present in real-world noisy facilities and thereby provide only limited reduction in error rate of the recognizer. The present article describes a novel approach to nonstationary acoustical noise modeling via a set of hierarchically tied hidden Markov models in a classification tree structure. Proposed statistical structure allows detailed description of nonstationary ambient acoustical noise while maintaining low computational costs during recognition. Modeling performance of the proposed construction is verified on a real background noise recorded during a neurosurgery in a hospital operating room.

机译：噪声鲁棒性是在要求苛刻的环境（例如医院手术室）中成功部署自动语音识别系统的关键问题。克服加性噪声障碍的最成功方法也许是采用围绕一组专用的纯净语音和纯噪声统计模型建立的模型自适应方案。现有的识别器设计通常依赖于相对简单的噪声模型，因为更详细的噪声模型会显着增加计算需求。但是，简单的模型无法准确表征现实的嘈杂设施中存在的高度不稳定的噪声，因此只能有限地降低识别器的错误率。本文介绍了一种通过分类树结构中的一组分层绑定隐马尔可夫模型进行非平稳声学噪声建模的新颖方法。提议的统计结构允许对非平稳环境声噪声进行详细描述，同时在识别过程中保持较低的计算成本。在医院手术室进行神经外科手术期间记录的真实背景噪声中验证了所提出结构的建模性能。

著录项

来源
《Engineering Economics》 |2010年第3期|共页
作者
Zelinka Petr; Sigmund Milan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类工业经济;
关键词

相似文献

外文文献
中文文献
专利

1. Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition [J] . Li Deng, Droppo J., Acero A. IEEE Transactions on Speech and Audio Proceeding . 2003,第6期

机译：递归估计非平稳噪声，使用迭代随机逼近技术进行鲁棒语音识别
2. Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition [J] . Li Deng, Droppo J., Acero A. IEEE Transactions on Speech and Audio Proceessing . 2003,第6期

机译：递归估计非平稳噪声，使用迭代随机逼近技术进行鲁棒语音识别
3. Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition [J] . Gonzalez Jose A., Gomez Angel M., Peinado Antonio M., Circuits, systems, and signal processing . 2017,第9期

机译：基于掩蔽模型的谱重构和噪声模型估计，用于噪声鲁棒语音识别
4. A SWITCHING LINEAR GAUSSIAN HIDDEN MARKOV MODEL AND ITS APPLICATION TO NONSTATIONARY NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION [C] . Jian Wu, Qiang Huo, International Speech Communication Association(ISCA) European Conference on Speech Communication and Technology . 2003

机译：一种切换线性高斯隐马尔可夫模型及其在鲁棒语音识别中的非间断噪声补偿应用
5. Acoustic modeling and speaker normalization strategies with application to robust in-vehicle speech recognition and dialect classification. [D] . Yapanel, Umit. 2005

机译：声学建模和说话人归一化策略及其在强大的车载语音识别和方言分类中的应用。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. RESIDUAL NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION IN NONSTATIONARY NOISE [O] . Kaisheng Yao T, Bertram E. Shi T, Pascale Fung T, 2008

机译：非平稳噪声中鲁棒语音识别的残余噪声补偿

HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅