首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Power-Normalized Cepstral Coefficients (PNCC) for robust speech recognition
【24h】

Power-Normalized Cepstral Coefficients (PNCC) for robust speech recognition

机译:幂归位倒谱系数(PNCC),用于强大的语音识别

获取原文

摘要

This paper presents a new feature extraction algorithm called Power Normalized Cepstral Coefficients (PNCC) that is based on auditory processing. Major new features of PNCC processing include the use of a power-law nonlinearity that replaces the traditional log nonlinearity used in MFCC coefficients, a noise-suppression algorithm based on asymmetric filtering that suppress background excitation, and a module that accomplishes temporal masking. We also propose the use of medium-time power analysis, in which environmental parameters are estimated over a longer duration than is commonly used for speech, as well as frequency smoothing. Experimental results demonstrate that PNCC processing provides substantial improvements in recognition accuracy compared to MFCC and PLP processing for speech in the presence of various types of additive noise and in reverberant environments, with only slightly greater computational cost than conventional MFCC processing, and without degrading the recognition accuracy that is observed while training and testing using clean speech. PNCC processing also provides better recognition accuracy in noisy environments than techniques such as Vector Taylor Series (VTS) and the ETSI Advanced Front End (AFE) while requiring much less computation. We describe an implementation of PNCC using “on-line processing” that does not require future knowledge of the input.
机译:本文提出了一种新的基于听觉处理的特征提取算法,称为功率归一化倒谱系数(PNCC)。 PNCC处理的主要新功能包括:使用幂律非线性来代替MFCC系数中使用的传统对数非线性;基于非对称滤波的噪声抑制算法可抑制背景激励;以及可实现时间掩蔽的模块。我们还建议使用时域功率分析,其中在比语音和频率平滑常用时间更长的持续时间内估计环境参数。实验结果表明,与MFCC和PLP处理相比,在存在各种类型的加性噪声​​和混响环境下,PNCC处理与语音的MFCC处理相比,在识别准确度方面有了实质性的提高,而计算成本仅比常规MFCC处理略高,并且不会降低识别度使用干净的语音进行训练和测试时观察到的准确性。 PNCC处理还可以在嘈杂的环境中提供比Vector Taylor Series(VTS)和ETSI Advanced Front End(AFE)等技术更高的识别精度,同时所需的计算量更少。我们描述了使用“在线处理”的PNCC的实现,该实现不需要将来的输入知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号