【24h】

Auditory Contrast Spectrum for Robust Speech Recognition

机译:听觉对比谱用于语音识别

获取原文
获取原文并翻译 | 示例

摘要

Traditional speech representations are based on power spectrum which is obtained by energy integration from many frequency bands. Such representations are sensitive to noise since noise energy distributed in a wide frequency band may deteriorate speech representations. Inspired by the contrast sensitive mechanism in auditory neural processing, in this paper, we propose an auditory contrast spectrum extraction algorithm which is a relative representation of auditory temporal and frequency spectrum. In this algorithm, speech is first processed using a temporal contrast processing which enhances speech temporal modulation envelopes in each auditory filter band and suppresses steady low contrast envelopes. The temporal contrast enhanced speech is then integrated to form speech spectrum which is named as temporal contrast spectrum. The temporal contrast spectrum is then analyzed in spectral scale spaces. Since speech and noise spectral profiles are different, we apply a lateral inhibition function to choose a spectral profile subspace in which noise component is reduced more while speech component is not deteriorated. We project the temporal contrast spectrum to the optimal scale space in which cepstral feature is extracted. We apply this cepstral feature for robust speech recognition experiments on AURORA-2J corpus. The recognition results show that there is 61.12% improvement of relative performance for clean training and 27.45% improvement of relative performance for multi-condition training.
机译:传统的语音表示基于功率谱,该功率谱是通过从许多频带进行能量积分获得的。这样的表示对噪声敏感,因为在宽频带中分布的噪声能量可能会使语音表示恶化。在听觉神经处理中的对比敏感机制的启发下,本文提出了一种听觉对比频谱提取算法,该算法是听觉时间频谱和频谱的相对表示。在此算法中,首先使用时间对比处理来处理语音,该处理会增强每个听觉滤波器频带中的语音时间调制包络并抑制稳定的低对比度包络。然后将时间对比度增强的语音进行积分以形成语音频谱,该频谱被称为时间对比度频谱。然后在光谱标度空间中分析时间对比光谱。由于语音和噪声频谱轮廓不同,因此我们应用横向抑制功能来选择频谱轮廓子空间,在该子空间中,噪声分量会进一步降低,而语音分量不会恶化。我们将时间对比谱投影到倒谱特征提取的最佳尺度空间。我们将此倒谱特性应用于AURORA-2J语料库上的健壮语音识别实验。识别结果表明,清洁训练的相对成绩提高了61.12%,多条件训练的相对成绩提高了27.45%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号