Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

机译：基于幂函数的倒谱系数的平均归一化，可在嘈杂的环境中实现鲁棒的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.

机译：本文提出了对各种类型的倒频谱系数进行均值归一化的方法，以在嘈杂的环境中实现鲁棒的语音识别。尽管倒谱均值归一化（CMN）技术最初是为补偿信道失真而设计的，但也已经证明，CMN还可以在加性噪声环境中提高识别精度。但是，还没有人考虑过CMN与提取倒频谱特征所需的频谱映射功能的交互作用。本文通过数学分析干净和嘈杂条件之间的频谱失真量，研究了基于频谱映射函数类型的CMN对语音识别系统的影响。通过将自动语音识别实验中的识别错误模式的类型与Aurora 2数据库进行比较，也可以确定分析结果。实验结果表明，如果将对数函数替换为适当的分数次幂映射函数设置，则采用CMN的性能提高将变得非常重要。特别地，删除错误被大大减少。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|1735-1739|共5页
会议地点
作者
Baek Soonho; Kang Hong-Goo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CMN; Robust speech recognition;

机译：CMN;强大的语音识别;

相似文献

外文文献
中文文献
专利

1. Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients [J] . Mohamed Tamazin, Ahmed Gouda, Mohamed Khedr Applied Sciences . 2019,第10期

机译：基于增强功率归一化谱系齐系数的增强的自动语音识别系统
2. Normalized Autocorrelation based Features for Robust Speech Recognition in Context with Noisy Environment [J] . Poonam Bansal, Amita Dev, Shail Bala Jain Journal of information and computing science . 2011,第1期

机译：噪声环境下基于归一化自相关的鲁棒语音识别特征
3. Autocorrelation-based noise subtraction method with smoothing, overestimation, energy, and cepstral mean and variance normalization for noisy speech recognition [J] . Gholamreza Farahani EURASIP journal on audio, speech, and music processing . 2017,第1期

机译：基于自相关的噪声减法，具有平滑，高估，能量，倒谱均值和方差归一化的功能，可用于嘈杂的语音识别
4. Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment [C] . Baek Soonho, Kang Hong-Goo IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：噪声环境中强大的语音识别的功率功能基于抗痉挛系数的平均归一化
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. Robustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments [O] . Lotfi Salhi, Adnane Cherif 2013

机译：嘈杂环境中听觉Teager能量倒谱系数的病理和正常声音分类的稳健性
7. Power-normalized cepstral coefficients (pncc) for robust speech recognition [O] . Chanwoo Kim, Richard M. Stern 2013

机译：用于鲁棒语音识别的功率归一化倒谱系数（pncc）

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

摘要

著录项

相似文献

相关主题

期刊订阅