首页> 外文会议>European Signal Processing Conference >Maximum likelihood estimation of a reverberation model for robust distant-talking speech recognition

【24h】

Maximum likelihood estimation of a reverberation model for robust distant-talking speech recognition

机译：鲁棒远距离语音识别的混响模型的最大似然估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel approach for estimating a reverberation model for a robust recognizer according to [1], which is designed to allow distant-talking automatic speech recognition (ASR) in reverberant environments. Based on a few calibration utterances with known transcriptions recorded in the target environment, a maximum likelihood estimator is used to find the means and variances of the reverberation model. In contrast to [1] and to HMM training on artificially reverberated training data (e. g. [2]), measurements of room impulse responses become unnecessary, and the effort for training is greatly reduced. Simulations of a connected digit recognition task show that, in highly reverberant environments, the reverberation models estimated by the proposed approach achieve significantly higher recognition rates than HMMs trained on reverberant data.

机译：我们根据[1]提出了一种用于估计鲁棒识别器混响模型的新颖方法，该方法旨在允许在混响环境中进行远距离自动语音识别（ASR）。根据目标环境中记录的已知转录的一些校准发声，使用最大似然估计器来找到混响模型的均值和方差。与[1]和针对人工回响的训练数据的HMM训练（例如[2]）相反，房间冲动响应的测量变得不必要，并且训练的工作量大大减少。关联数字识别任务的仿真表明，在高度混响的环境中，与在混响数据上训练的HMM相比，该方法估计的混响模型实现的识别率明显更高。

著录项

来源
《European Signal Processing Conference》|2007年|1299-1303|共5页
会议地点
作者
Sehr Armin; Yuanhang Zheng; Noth Elmar; Kellermann Walter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition [J] . Sehr A., Maas R., Kellermann W. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第7期

机译：Logmelspec域中基于混响模型的解码，用于鲁棒的远距离语音识别
2. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition [J] . Biing-Hwang Juang, Rahim M.G. IEEE Transactions on Speech and Audio Proceeding . 1996,第1期

机译：通过最大似然估计消除信号偏差，以实现可靠的电话语音识别
3. Correction to “Maximum Likelihood PSD Estimation for Speech Enhancement in Reverberation and Noise” [J] . Adam Kuklasiński, Simon Doclo, Søren Holdt Jensen, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第3期

机译：对“用于混响和噪声增强语音的最大似然PSD估计”的校正
4. MAXIMUM LIKELIHOOD ESTIMATION OF A REVERBERATION MODELFOR ROBUST DISTANT-TALKING SPEECH RECOGNITION [C] . Armin Sehr, Yuanhang Zheng, Elmar N?th, EUSIPCO 2007;European signal processing conference . 2007

机译：鲁棒远距离语音识别的重塑模型的最大似然估计
5. Hidden Markov models, maximum mutual information estimation, and the speech recognition problem [D] . Normandin, Yves. 1991

机译：隐藏的马尔可夫模型，最大互信息估计和语音识别问题
6. A Framework for the Comparison of Maximum Pseudo Likelihood and Maximum Likelihood Estimation of Exponential Family Random Graph Models [O] . Marijtje A.J. van Duijn, Krista J. Gile, Mark S. Handcock -1

机译：指数族随机图模型的最大伪似然和最大似然估计的比较框架
7. Maximum Likelihood Estimation of a Reverberation Model for Robust Distant-Talking Speech Recognition [O] . Sehr Armin, Zheng Yuanhang, Nöth Elmar, 2007

机译：鲁棒远距离语音识别的混响模型的最大似然估计
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Maximum likelihood estimation of a reverberation model for robust distant-talking speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅