首页> 外文会议>International Congress on Image and Signal Processing >A Hybrid Method of Noise Robust Speech Recognition Based on Fractional Spectral Subtraction and Perceptual Linear Predictive
【24h】

A Hybrid Method of Noise Robust Speech Recognition Based on Fractional Spectral Subtraction and Perceptual Linear Predictive

机译:一种基于分数谱减法和感知线性预测的噪声鲁棒语音识别的混合方法

获取原文

摘要

By combining Fractional Spectral Subtraction (FSS) with Perceptual Linear Predictive (PLP), a hybrid method of noise robustness speech recognition is investigated in this paper. This method uses FSS for noisy speech to reduce noise components in the fractional Fourier domain. According to the results of computing Itakura distance and Mean Square Error (MSE), an approximate optimal fractional order is then obtained by comparing the difference between them. Perceptual Linear Predictive Cepstral Coefficients (PLPCC) is finally computed for the enhanced speech in terms of the above obtained order. It is shown that this hybrid method performs better compared with conventional spectral subtraction and PLPCC for digits speech recognition experiments. Moreover, this method denotes good noise robustness when noise levels increases.
机译:通过将分数谱减法(FSS)与感知线性预测(PLP)组合,本文研究了一种噪声鲁棒性语音识别的混合方法。该方法使用FSS进行嘈杂的语音,以减少分数傅里叶域中的噪声分量。根据计算ITAKURA距离和均方误差(MSE)的结果,然后通过比较它们之间的差异来获得近似的最佳分数顺序。感知线性预测性临床系数(PLPCC)最终计算出上述订单的增强语音。结果表明,与数字语音识别实验的传统光谱减法和PLPCC相比,这种混合方法更好地执行。此外,该方法在噪声水平增加时表示良好的噪声稳健性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号