首页> 外文期刊>Signal Processing, IET >Efficient harmonic peak detection of vowel sounds for enhanced voice activity detection
【24h】

Efficient harmonic peak detection of vowel sounds for enhanced voice activity detection

机译:有效的元音谐波峰值检测,增强语音活动检测能力

获取原文
获取原文并翻译 | 示例

摘要

Voice activity detection (VAD) involves discriminating speech segments from background noise and is a critical step in numerous speech-related applications. However, distinguishing speech from noise based on the properties of noise is fallible, because it is difficult to predict and characterise the noise occurring in real life. In this study, the authors instead focus on the intrinsic characteristics of speech. The harmonic peaks of vowel sounds have higher energies than the other spectral components of speech and are the speech features most likely to survive in most cases of severe noise. Therefore, the energy differences between harmonic peaks and other spectral features show promise for enabling robust VAD. To exploit this feature, the harmonic peaks must be accurately located. For this purpose, this study proposes an efficient harmonic peak location detection (HPD) method. Based on extensive experiments conducted in the presence of various noise types and signal-to-noise ratios, we found that VAD with the proposed HPD approach outperforms existing VAD methods and does so with reasonable computational cost and higher robustness.
机译:语音活动检测(VAD)涉及将语音段与背景噪声区分开来,并且是众多语音相关应用程序中的关键步骤。然而,基于噪声的性质将语音与噪声区分开是容易犯错的,因为很难预测和表征现实生活中出现的噪声。在这项研究中,作者将注意力集中在语音的内在特征上。元音的谐波峰值具有比语音的其他频谱分量更高的能量,并且是在大多数严重噪声情况下最有可能幸存的语音特征。因此,谐波峰值与其他频谱特征之间的能量差显示了实现强大VAD的希望。要利用此功能,必须准确定位谐波峰值。为此,本研究提出了一种有效的谐波峰值位置检测(HPD)方法。基于在各种噪声类型和信噪比存在下进行的广泛实验,我们发现采用建议的HPD方法的VAD优于现有的VAD方法,并且这样做具有合理的计算成本和更高的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号