首页> 外文期刊>ETRI journal >Statistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision
【24h】

Statistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision

机译:基于统计模型和二阶条件MAP的语音活动检测

获取原文
           

摘要

In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (CMAP) criterion. As a technical improvement for the first-order CMAP criterion in [1], we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the interframe correlation of voice activity. This is clearly different from the previous approach [1] in that we employ the voice activity decisions in the second-order (previous two frames) CMAP, which has quadruple thresholds with an additional degree of freedom, rather than the first-order (previous single frame). Also, a soft-decision scheme is incorporated, resulting in time-varying thresholds for further performance improvement. Experimental results show that the proposed algorithm outperforms the conventional CMAP-based VAD technique under various experimental conditions.
机译:在本文中,我们提出了一种基于统计模型的语音活动检测(VAD)的新方法,该方法结合了二阶条件最大后验(CMAP)标准。作为对[1]中一阶CMAP准则的一项技术改进,我们考虑了当前观察和前两个帧中的语音活动决策,以充分考虑语音活动的帧间相关性。这明显不同于以前的方法[1],因为我们在二阶(前两个帧)CMAP中采用语音活动决策,该CMAP具有四个阈值并具有额外的自由度,而不是一阶(前一帧)单帧)。此外,还结合了软决策方案,从而产生了随时间变化的阈值,可以进一步提高性能。实验结果表明,该算法在各种实验条件下均优于传统的基于CMAP的VAD技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号