A general joint additive and convolutive bias compensation approachapplied to noisy Lombard speech recognition

Afify M.; Yifan Gong; Haton J.-P.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >A general joint additive and convolutive bias compensation approachapplied to noisy Lombard speech recognition

【24h】

A general joint additive and convolutive bias compensation approachapplied to noisy Lombard speech recognition

机译：通用联合加和卷积偏差补偿方法应用于嘈杂的伦巴德语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A unified approach to the acoustic mismatch problem is proposed. A maximum likelihood state-based additive bias compensation algorithm is developed for the continuous density hidden Markov model (CDHMM). Based on this technique, specific bias models in the mel cepstral and the linear spectral domains are presented. Among these models, a new polynomial trend bias model in the mel cepstral domain is derived, which proved effective for Lombard speech compensation. In addition, a joint estimation algorithm for additive and convolutive bias compensation is proposed. This algorithm is based on applying the expectation maximization (EM) technique in both above-mentioned domains, in conjunction with a parallel model combination (PMC) based transformation. The compensation of the dynamic (difference) coefficients in the proposed framework is also studied. The evaluation data base consists of a 21 confusable word vocabulary uttered by 24 speakers. Three mismatched versions of the data base are considered, i.e., Lombard speech, 15 dB noisy Lombard speech, and 5 dB noisy Lombard speech. The proposed techniques result in 50.9%, 74.6%, and 67.3% reduction in the performance difference between matched and uncompensated word error rates for the three mismatch conditions, respectively. When dynamic coefficients are considered the corresponding reductions are 46.8%, 72.4%, and 70.9%

机译：提出了一种解决声学失配问题的统一方法。针对连续密度隐马尔可夫模型（CDHMM），开发了基于最大似然状态的加性偏差补偿算法。基于这种技术，提出了梅尔倒谱和线性光谱域中的特定偏差模型。在这些模型中，推导了一个新的mel倒谱域的多项式趋势偏差模型，证明该模型对Lombard语音补偿有效。另外，提出了一种累加和卷积偏置补偿的联合估计算法。该算法基于在上述两个领域中应用期望最大化（EM）技术以及基于并行模型组合（PMC）的转换。还研究了在所提出的框架中的动态（差分）系数的补偿。评估数据库包括由24位演讲者说出的21个令人困惑的单词词汇。考虑数据库的三个不匹配版本，即，伦巴第语音，15 dB噪声伦巴德语音和5 dB噪声伦巴德语音。对于三种失配条件，所提出的技术可使匹配和未补偿的字错误率之间的性能差异分别降低50.9％，74.6％和67.3％。当考虑动态系数时，相应的减少幅度是46.8％，72.4％和70.9％

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |1998年第6期|p.524-538|共15页
作者
Afify M.; Yifan Gong; Haton J.-P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
cepstral analysis; convolution; hidden Markov models; maximum likelihood estimation; noise; speech recognition; 15 dB noisy Lombard speech; 5 dB noisy Lombard speech; CDHMM; Lombard speech compensation; acoustic mismatch problem; bias models; confusable word vocabul;

机译：倒频谱分析;卷积;隐马尔可夫模型;最大似然估计;噪声;语音识别;15 dB朗伯语音噪声;5 dB朗伯语音;CDHMM;Lombard语音补偿;声学失配问题;偏差模型;易混淆的单词词汇;

相似文献

外文文献
中文文献
专利

1. A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition [J] . Afify M., Yifan Gong IEEE Transactions on Speech and Audio Proceeding . 1998,第6期

机译：通用联合加和卷积偏差补偿方法应用于嘈杂的Lombard语音识别
2. An Additive and Convolutive Bias Compensation Algorithm for Telephone Speech Recognition [J] . HAN Zhao-Bing, ZHANG Shu-Wu, XU Bo, Acta Automatica Sinica . 2004,第2期

机译：电话语音识别的加法和卷积偏置补偿算法
3. A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition [J] . Gong Y. IEEE Transactions on Speech and Audio Proceessing . 2005,第5期

机译：一种独立于说话人的语音识别的加法和卷积失真联合补偿方法
4. Lombard effect compensation and noise suppression for noisy Lombard speech recognition [C] . Sang-Mun Chi, Yung-Hwan Oh . 1996

机译：朗伯效应补偿和噪声抑制，用于嘈杂的朗伯语音识别
5. Analysis and compensation of the Lombard effect under different types and levels of noise with application to in-set/out-of-set speaker recognition . [D] . Varadarajan, Vaishnevi S. 2007

机译：分析和补偿不同类型和不同噪声水平下的伦巴第效应，并将其应用在室内/室外说话者识别中。
6. Cascaded Convolutional Neural Network Architecture for Speech Emotion Recognition in Noisy Conditions [O] . Youngja Nam, Chankyu Lee 2021

机译：级联卷积神经网络架构用于嘈杂的条件下的语音情感识别
7. Lombard Effect Compensation And Noise Suppression For Noisy Lombard Speech Recognition [O] . Sang-Mun Chi Yung-Hwan 2007

机译：嘈杂的朗伯语音识别中的伦巴效应补偿和噪声抑制
8. Robust Recognition of Loud and Lombard Speech in the Fighter Cockpit Environment [R] . Stanton, B. J. 1988

机译：对战斗机驾驶舱环境中响度和伦巴第语音的鲁棒识别

A general joint additive and convolutive bias compensation approachapplied to noisy Lombard speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅