Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

Gomez R.; Kawahara T.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

【24h】

Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

机译：基于声学模型似然性的去混响参数优化的鲁棒语音识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.

机译：混响环境中的自动语音识别（ASR）是一项艰巨的任务。大多数去混响技术通过信号处理解决了这个问题，并增强了独立于语音识别器的混响波形。在本文中，我们提出了一种与后端ASR系统的可能性相关的执行去混响的新方案。我们提出的方法以多频带比例因子的形式有效地选择了去混响参数，从而提高了声学模型的可能性。然后，使用最佳参数重新训练声学模型。在识别阶段，我们将对参数进行其他优化。通过使用高斯混合模型（GMM），选择比例因子的过程变得高效。此外，通过使用人工RIR生成器并根据声学似然性进行选择，我们消除了所采用的去混响技术对房间脉冲响应（RIR）测量的依赖性。实验结果表明，与传统方法相比，该方法在识别性能上有显着提高。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2010年第7期|p.1708-1716|共9页
作者
Gomez R.; Kawahara T.;
展开▼
作者单位

ACCMS, Kyoto University, Sakyo-ku, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automatic speech recognition (ASR); dereverberation; robustness;

机译：自动语音识别（ASR）;去精神病学;健壮性;

相似文献

外文文献
中文文献
专利

1. An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition [J] . Bo Wu, Kehuang Li, Fengpei Ge, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：端到端深度学习方法可同时进行语音去混响和声学建模，以实现可靠的语音识别
2. Phone-based filter parameter optimization of filter and sum robust speech recognition using likelihood maximization [J] . Kouhi-Jelehkaran B., Bakhshi H., Razzazi F. AEU: Archiv fur Elektronik und Ubertragungstechnik: Electronic and Communication . 2010,第12期

机译：基于电话的滤波器参数优化，使用似然最大化进行滤波器和求和鲁棒语音识别
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer [C] . Randy Gomez, Tatsuya Kawahara International Speech Communication Association . 2009

机译：基于语音识别器可能性的DERERATERATION参数优化
5. Robust Acoustic Modeling and Front-End Design for Distant Speech Recognition [D] . Mirsamadi, Seyedmahdad. 2017

机译：鲁棒的声学建模和远端语音识别前端设计
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood [O] . Gomez Randy, Kawahara Tatsuya 2010

机译：基于声学模型似然性的去混响参数优化的鲁棒语音识别

Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅