首页> 外文会议>International Symposium on Intelligent Information Technology Application >A Novel Acoustic Feature Extraction Algorithm Based on Root Cepstrum Coefficients and CCBC for Robust Speech Recognition
【24h】

A Novel Acoustic Feature Extraction Algorithm Based on Root Cepstrum Coefficients and CCBC for Robust Speech Recognition

机译:一种基于根谱系统系数的新型声学特征提取算法,CCBC为鲁棒语音识别

获取原文

摘要

Studies have shown that depending on speaker task and environmental conditions, recognizers are sensitive to noisy stressful environments. The focus of this study is to achieve robust recognition in diverse environmental conditions through extracting robust features. Central to the technique is Root Cepstrum Coefficients (RCC) method, instead of logarithm amplitude spectrum and discrete cosine transform of the conventional Mel Frequency Cepstral Coefficients (MFCC), but also using Two-dimensional Root Cepstrum Coefficients (TDRCC). This feature is called TDRCC-MFCC. And then, we consider incorporating Canonical Correlation Based Compensation (CCBC) to cope with the mismatch between training and test set. The mismatch between training and test conditions can be simply clustered into three classes: differences of speakers, changes of recording channel and effects of noisy environment. We evaluate the technique using Back-Propagation Neural Networks (BPNN) on two different tasks: one is in-car speech recognition task, another is different SNR speech recognition. The experimental results show that the novel feature has very good robustness and effectiveness relative to MFCC feature and the CCBC algorithm can make speech recognition system greatly robust to all three kinds of mismatch between training set and test set.
机译:研究表明,根据扬声器任务和环境条件,识别员对嘈杂的压力环境敏感。本研究的重点是通过提取强大的特征来实现各种环境条件的鲁棒识别。该技术的核心是根谱系数(RCC)方法,代替对数幅度谱和传统MEL频率谱系数的离散余弦变换,还使用二维根谱系数(TDRCC)。此功能称为TDRCC-MFCC。然后,我们考虑将基于规范相关的补偿(CCBC)纳入应对训练和测试集之间的不匹配。训练和测试条件之间的不匹配可以简单地集中成三类:扬声器的差异,记录通道的变化和嘈杂环境的影响。我们在两个不同的任务上评估使用反向传播神经网络(BPNN)的技术:一个是车载语音识别任务,另一个是不同的SNR语音识别。实验结果表明,新颖的特征相对于MFCC特征具有非常好的稳健性和有效性,并且CCBC算法可以使语音识别系统对训练集和测试集之间的所有三种失配大大稳健。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号