首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment
【24h】

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

机译:基于幂函数的倒谱系数的平均归一化,可在嘈杂的环境中实现鲁棒的语音识别

获取原文

摘要

This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.
机译:本文提出了对各种类型的倒频谱系数进行均值归一化的方法,以在嘈杂的环境中实现鲁棒的语音识别。尽管倒谱均值归一化(CMN)技术最初是为补偿信道失真而设计的,但也已经证明,CMN还可以在加性噪声环境中提高识别精度。但是,还没有人考虑过CMN与提取倒频谱特征所需的频谱映射功能的交互作用。本文通过数学分析干净和嘈杂条件之间的频谱失真量,研究了基于频谱映射函数类型的CMN对语音识别系统的影响。通过将自动语音识别实验中的识别错误模式的类型与Aurora 2数据库进行比较,也可以确定分析结果。实验结果表明,如果将对数函数替换为适当的分数次幂映射函数设置,则采用CMN的性能提高将变得非常重要。特别地,删除错误被大大减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号