首页> 外文会议> >Bispectra Analysis-Based VAD for Robust Speech Recognition
【24h】

Bispectra Analysis-Based VAD for Robust Speech Recognition

机译:基于双谱分析的VAD用于可靠的语音识别

获取原文
获取原文并翻译 | 示例

摘要

A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order auto-cumulants. This algorithm differs from many others in the way the decision rule is formulated (detection tests) and the domain used in this approach. Clear improvements in speech/non-speech discrimination accuracy demonstrate the effectiveness of the proposed VAD. It is shown that application of statistical detection test leads to a better separation of the speech and noise distributions, thus allowing a more effective discrimination and a tradeoff between complexity and performance. The algorithm also incorporates a previous noise reduction block improving the accuracy in detecting speech and non-speech. The experimental analysis carried out on the AURORA databases and tasks provides an extensive performance evaluation together with an exhaustive comparison to the standard VADs such as ITU G.729, GSM AMR and ETSI AFE for distributed speech recognition (DSR), and other recently reported VADs.
机译:提出了一种鲁棒有效的语音活动检测(VAD)算法,以提高嘈杂环境中的语音识别性能。该方法基于对输入通道进行滤波以避免高能量噪声成分,然后基于三阶自动累积量确定语音/非语音双谱。该算法与其他许多算法的不同之处在于制定决策规则(检测测试)的方式以及此方法中使用的域。语音/非语音辨别准确性的明显提高证明了所建议的VAD的有效性。结果表明,统计检测测试的应用可以更好地分离语音和噪声分布,从而可以更有效地进行区分,并在复杂度和性能之间进行权衡。该算法还结合了先前的降噪模块,从而提高了检测语音和非语音的准确性。在AURORA数据库和任务上进行的实验分析提供了广泛的性能评估,并且与标准VAD(例如用于分布式语音识别(DSR)的ITU G.729,GSM AMR和ETSI AFE和其他最近报告的VAD)进行了详尽的比较。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号