Maximum Likelihood Normalization for Robust Speech Recognition

机译：强大的语音识别的最大似然归一化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is well-known that additive and channel noise cause shift and scaling in MFCC features. Empirical normalization techniques to estimate and compensate for the effects, such as cep-stral mean subtraction and variance normalization, have been shown to be useful. However, these empirical estimate may not be optimal. In this paper, we approach the problem from two directions, 1) use a more robust MFCC-based features that is less sensitive to additive and channel noise and 2) propose a maximum likelihood (ML) based approach to compensate the noise effect. In addition, we proposed the use of multi-class normalization in which different normalization factors can be applied to different phonetic units. The combination of the robust features and ML normalization is particularly useful for highly mis-matched condition in the Aurora 3 corpus resulting in a 15.8% relative improvement in the highly mis-matched case and a 10.4% relative improvement on average over the three conditions.

机译：众所周知，添加剂和信道噪声会导致MFCC功能中的移位和缩放。已经证明了估计和补偿效果的经验标准化技术，例如Cep-recal平均减法和方差标准化，是有用的。但是，这些经验估计可能不是最佳的。在本文中，我们接近两个方向的问题，1）使用更强大的基于MFCC的特征，对附加和信道噪声不太敏感，并且2）提出基于最大的似然（ML）方法来补偿噪声效果。此外，我们提出了使用多级归一化，其中可以应用于不同的归一化因子。鲁棒特征和Ml归一化的组合对于极光3个语料中的高度错误匹配的病症特别有用，导致高度错误匹配的情况下的相对改善的15.8％，并且在三种条件下平均相对改善10.4％。

著录项

来源
《European Conference on Speech Communication and Technology》|2003年||共4页
会议地点
作者
Yiu-Pong LAI; Man-Hung SIU; International Speech Communication Association(ISCA);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. A Beamforming Algorithm Based on Maximum Likelihood of a Complex Gaussian Distribution With Time-Varying Variances for Robust Speech Recognition [J] . Byung Joon Cho, Jun-Min Lee, Hyung-Min Park IEEE signal processing letters . 2019,第9期

机译：基于最大似然分布且具有时变方差的复杂高斯分布的波束成形算法用于鲁棒语音识别
2. Maximum likelihood subband polynomial regression for robust speech recognition [J] . Yong Lue, Zhenyang Wu Applied Acoustics . 2013,第5期

机译：最大似然子带多项式回归用于鲁棒语音识别
3. Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition [J] . Kim D. K., Gales M. J. F. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第2期

机译：嘈杂约束最大似然线性回归用于鲁棒语音识别
4. Maximum Likelihood Normalization for Robust Speech Recognition [C] . Yiu-Pong LAI, Man-Hung SIU, International Speech Communication Association(ISCA) European Conference on Speech Communication and Technology . 2003

机译：强大的语音识别的最大似然归一化
5. Acoustic modeling and speaker normalization strategies with application to robust in-vehicle speech recognition and dialect classification. [D] . Yapanel, Umit. 2005

机译：声学建模和说话人归一化策略及其在强大的车载语音识别和方言分类中的应用。
6. Robust unified Granger causality analysis: a normalized maximum likelihood form [O] . Zhenghui Hu, Fei Li, Minjia Cheng, 2021

机译：强大的统一格兰杰因果关系分析：归一化最大可能性形式
7. Feature generation based on maximum normalized acoustic likelihood for improved speech recognition [O] . Xiang Li, Richard M. Stern 2008

机译：基于最大归一化声学似然的特征生成以改进语音识别
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Maximum Likelihood Normalization for Robust Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅