首页> 外文会议>European Signal Processing Conference >A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION
【24h】

A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION

机译:鲁棒语音识别的听觉特征比较

获取原文

摘要

This paper presents a detailed comparison of the performance of two auditory based feature extraction algorithms for automatic speech recognition (ASR). The feature sets are Zero- Crossings with Peak Amplitudes (ZCPA) and the recently introduced Power-Law Nonlinearity and Power-Bias Subtraction (PNCC). Standard Mel-Frequency Cepstral Coefficients (MFCC) are also tested for comparison. Although front-ends have been compared in previous papers, this work focuses on two of the most promising algorithms for noise robustness. The performance of all features is reported on the TIMIT database using a HMM system. It is found that the PNCC features outperform MFCC in clean conditions and are robust to noise. ZCPA performance is shown to vary widely with filterbank configuration and frame length. The ZCPA performance is poor in clean conditions but is the least affected by white noise. PNCC is shown to be the most promising new feature set for robust ASR in recent years.
机译:本文介绍了两个基于听觉的特征提取算法的性能的详细比较,用于自动语音识别(ASR)。特征集是具有峰值幅度(ZCPA)的零点(ZCPA),最近引入的电力 - 法律非线性和功率偏压减法(PNCC)。还测试了标准熔融频率谱系齐数(MFCC)以进行比较。虽然前端已经在先前的论文中进行了比较,但这项工作侧重于噪声稳健性最有前景的两个算法。使用HMM系统在Timit数据库上报告所有功能的性能。发现PNCC在清洁条件下优于MFCC,并且对噪声稳健。 ZCPA性能显示出滤波器配置和帧长度的广泛变化。 ZCPA性能在清洁条件下差,但受白噪声的影响最小。 PNCC显示为近年来为强大的ASR提供最有希望的新功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号