首页> 外文期刊>Journal of Information Recording >On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise
【24h】

On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise

机译:实施人类听觉系统对基于Q-Log的语音识别抗噪声鲁棒性功能的影响

获取原文
获取原文并翻译 | 示例
       

摘要

Mimicking human auditory systems as well as applying mean normalization in feature extraction are widely believed to improve the robustness of speech recognition. Traditionally, the normalization is conducted in the log domain by subtracting the features with their long-term mean. Some studies have found that the use of power functions instead of log yield more robust features. In previous studies, a q-logarithmic function (q-log), which is also a power function, was used to derive a normalization method. The method, called q-mean normalization (q-MN) in this paper, was found more effective than conventional normalization methods. In these works, q-MN was still applied in the power spectral domain. Here, the method is applied after mapping the power spectra on human auditory systems, and, after an analysis on the effect of the method on noisy speech, we propose a blind and adaptive normalization technique to determine a suitable q in q-MN. The experiments show that the proposed features are more robust than conventional features such as MFCC. The results also confirm that using nonlinear resolutions inspired by human auditory systems benefits speech recognition and is better than using a uniform resolution.
机译:人们普遍认为,模仿人类听觉系统以及在特征提取中应用均值归一化可以提高语音识别的鲁棒性。传统上,归一化是在对数域中通过减去特征的长期平均值来进行的。一些研究发现,使用幂函数而不是对数会产生更强大的功能。在以前的研究中,q对数函数(q-log)也是幂函数,用于推导归一化方法。发现该方法称为q-均值归一化(q-MN),它比常规归一化方法更有效。在这些工作中,q-MN仍然应用于功率谱域。在此,该方法是在将功率谱映射到人类听觉系统上之后应用的,并且在分析了该方法对嘈杂语音的影响之后,我们提出了一种盲法和自适应归一化技术来确定q-MN中合适的q。实验表明,所提出的功能比常规功能(如MFCC)更健壮。结果还证实,使用受人类听觉系统启发的非线性分辨率有利于语音识别,并且比使用统一分辨率更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号