首页> 外文会议>Automatic Speech Recognition amp; Understanding, 2009. ASRU 2009 >Temporal envelope subtraction for robust speech recognition using modulation spectrum
【24h】

Temporal envelope subtraction for robust speech recognition using modulation spectrum

机译:时间包络减法,使用调制频谱进行鲁棒的语音识别

获取原文

摘要

In this paper, we present a new noise compensation technique for modulation frequency features derived from syllable length segments of subband temporal envelopes. The subband temporal envelopes are estimated using frequency domain linear prediction (FDLP). We propose a technique for noise compensation in FDLP where an estimate of the noise envelope is subtracted from the noisy speech envelope. The noise compensated FDLP envelopes are compressed with static (logarithmic) and dynamic (adaptive loops) compression and are transformed into modulation spectral features. Experiments are performed on a phoneme recognition task as well as a connected digit recognition task where the test data is corrupted with variety of noise types at different signal to noise ratios. In these experiments with mismatched train and test conditions, the proposed features provide considerable improvements compared to other state of the art noise robust feature extraction techniques (average relative improvement of 25 % and 35 % over the baseline PLP features for phoneme and word recognition tasks respectively).
机译:在本文中,我们提出了一种新的噪声补偿技术,用于从子带时间包络的音节长度段得出的调制频率特征。使用频域线性预测(FDLP)估计子带时间包络。我们提出了一种用于FDLP中的噪声补偿的技术,其中从噪声语音包络中减去噪声包络的估计值。经噪声补偿的FDLP包络通过静态(对数)和动态(自适应环路)压缩进行压缩,并转换为调制频谱特征。在音素识别任务以及连接的数字识别任务上进行了实验,其中测试数据因各种信噪比下的各种噪声类型而损坏。在这些火车和测试条件不匹配的实验中,与其他最新的噪声鲁棒特征提取技术相比,拟议的特征提供了显着的改进(分别比音素和单词识别任务的基线PLP特征平均分别提高了25%和35%) )。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号