【24h】

Robust speech recognition based on emphasis filtering on formant regions in mobile noise environment

机译:基于对移动噪声环境中共振峰区域进行重点过滤的鲁棒语音识别

获取原文

摘要

In real environment, the performance of automatic speech recognizer (ASR) has been degraded by various noises. Thus, some speech enhancement techniques have been applied but some distortions have still existed in the speech region. Actually, MFCC-derived features which have been popularly used in most of ASRs are strongly related with the spectral envelope. Moreover, the formant in the spectral envelope is very important factor to recognize the speech. However, it is very difficult to detect exactly the formant frequency in various noisy environments. Therefore, we propose the efficient algorithm to detect the spectral regions presenting any formants based on the spectral ratio instead of the direct detection of formant frequency. Then we check whether each detected frequency range is valid by several confidence measures. If it is acceptable, the region is amplified by the proper emphasis filter. The evaluation in the mobile voice search task shows that the proposed method provides some improvement over baseline system.
机译:在实际环境中,各种噪音会降低自动语音识别器(ASR)的性能。因此,已经应用了一些语音增强技术,但是在语音区域中仍然存在一些失真。实际上,大多数ASR中普遍使用的MFCC衍生功能与频谱包络密切相关。此外,频谱包络中的共振峰是识别语音的非常重要的因素。但是,在各种嘈杂环境中准确检测共振峰频率非常困难。因此,我们提出一种有效的算法,基于频谱比来检测呈现任何共振峰的频谱区域,而不是直接检测共振峰频率。然后,我们通过几种置信度来检查每个检测到的频率范围是否有效。如果可以接受,则通过适当的加重滤镜放大该区域。在移动语音搜索任务中的评估表明,所提出的方法相对于基线系统提供了一些改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号