首页> 外文期刊>Speech Communication >Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition
【24h】

Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition

机译:从高滞后自相关系数中提取特征以实现鲁棒的语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a feature extraction method that is robust to additive background noise is proposed for automatic speech recognition. Since the background noise corrupts the autocorrelation coefficients of the speech signal mostly at the lower-time lags, while the higher-lag autocorrelation coefficients are least affected, this method discards the lower-lag autocorrelation coefficients and uses only the higher-lag autocorrelation coefficients for spectral estimation. The magnitude spectrum of the windowed higher-lag autocorrelation sequence is used here as an estimate of the power spectrum of the speech signal. This power spectral estimate is processed further (like the well-known Mel frequency cepstral coefficient (MFCC) procedure) by the Mel filter bank, log operation and the discrete cosine transform to get the cepstral coefficients. These cepstral coefficients are referred to as the autocorrelation Mel frequency cepstral coefficients (AMFCCs). We evaluate the speech recognition performance of the AMFCC features on the Aurora and the resource management databases and show that they perform as well as the MFCC features for clean speech and their recognition performance is better than the MFCC features for noisy speech. Finally, we show that the AMFCC features perform better than the features derived from the robust linear prediction-based methods for noisy speech.
机译:针对自动语音识别,提出了一种对加性背景噪声具有鲁棒性的特征提取方法。由于背景噪声主要在较低的时滞处破坏语音信号的自相关系数,而对较高时滞的自相关系数的影响最小,因此该方法会丢弃较低时滞的自相关系数,而仅将较高时滞的自相关系数用于频谱估计。窗口化的较高滞后自相关序列的幅度谱在这里用作语音信号功率谱的估计。通过梅尔滤波器组,对数运算和离散余弦变换,对该功率谱估计值进行进一步处理(类似于众所周知的梅尔频率倒谱系数(MFCC)程序),以得到倒谱系数。这些倒谱系数称为自相关梅尔频率倒谱系数(AMFCC)。我们在Aurora和资源管理数据库上评估了AMFCC功能的语音识别性能,并显示了它们在清洁语音方面的性能与MFCC功能相同,并且其识别性能优于在嘈杂语音方面的MFCC功能。最后,我们证明了AMFCC功能的性能要优于基于鲁棒线性预测方法的嘈杂语音功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号