首页> 外文会议>European Conference on Speech Communication and Technology >Maximum Likelihood Normalization for Robust Speech Recognition
【24h】

Maximum Likelihood Normalization for Robust Speech Recognition

机译:强大的语音识别的最大似然归一化

获取原文

摘要

It is well-known that additive and channel noise cause shift and scaling in MFCC features. Empirical normalization techniques to estimate and compensate for the effects, such as cep-stral mean subtraction and variance normalization, have been shown to be useful. However, these empirical estimate may not be optimal. In this paper, we approach the problem from two directions, 1) use a more robust MFCC-based features that is less sensitive to additive and channel noise and 2) propose a maximum likelihood (ML) based approach to compensate the noise effect. In addition, we proposed the use of multi-class normalization in which different normalization factors can be applied to different phonetic units. The combination of the robust features and ML normalization is particularly useful for highly mis-matched condition in the Aurora 3 corpus resulting in a 15.8% relative improvement in the highly mis-matched case and a 10.4% relative improvement on average over the three conditions.
机译:众所周知,添加剂和信道噪声会导致MFCC功能中的移位和缩放。已经证明了估计和补偿效果的经验标准化技术,例如Cep-recal平均减法和方差标准化,是有用的。但是,这些经验估计可能不是最佳的。在本文中,我们接近两个方向的问题,1)使用更强大的基于MFCC的特征,对附加和信道噪声不太敏感,并且2)提出基于最大的似然(ML)方法来补偿噪声效果。此外,我们提出了使用多级归一化,其中可以应用于不同的归一化因子。鲁棒特征和Ml归一化的组合对于极光3个语料中的高度错误匹配的病症特别有用,导致高度错误匹配的情况下的相对改善的15.8%,并且在三种条件下平均相对改善10.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号