...
首页> 外文期刊>IEEE signal processing letters >Temporal Structure Normalization of Speech Feature for Robust Speech Recognition
【24h】

Temporal Structure Normalization of Speech Feature for Robust Speech Recognition

机译:语音特征的时态结构归一化,用于鲁棒语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

This letter presents a new feature normalization technique to normalize the temporal structure of speech features. The temporal structure of the features is partially represented by its power spectral density (PSD). We observed that the PSD of the features varies with the corrupting noise and signal-to-noise ratio. To reduce the PSD variation due to noise, we propose to normalize the PSD of features to a reference function by filtering the features. Experimental results on the AURORA-2 task show that the proposed approach when combined with the mean and variance normalization improves the speech recognition accuracy significantly; the system achieves 69.11% relative error rate reduction over the baseline.
机译:这封信提出了一种新的特征归一化技术,以对语音特征的时间结构进行归一化。特征的时间结构部分由其功率谱密度(PSD)表示。我们观察到,特征的PSD随着噪声和信噪比的变化而变化。为了减少由于噪声引起的PSD变化,我们建议通过过滤特征将特征的PSD归一化为参考函数。在AURORA-2任务上的实验结果表明,与均值和方差归一化相结合时,该方法可以显着提高语音识别的准确性。该系统相对于基线的相对错误率降低了69.11%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号