首页> 外文会议>2013 Constantinides International Workshop on Signal Processing >Empirical Mode Decomposition VAD based on multiple sensor LRT
【24h】

Empirical Mode Decomposition VAD based on multiple sensor LRT

机译:基于多传感器LRT的经验模式分解VAD

获取原文
获取原文并翻译 | 示例

摘要

Voice Activity Detection (VAD) remains a challenging task given its dependence on adverse noise and reverberation conditions. The problem becomes even more difficult when the microphones used to detect speech reside far from the speaker. In this paper, an unsupervised VAD scheme is presented, based on the Empirical Mode Decomposition (EMD) analysis framework and a multiple input likelihood ratio test (LRT). The highly efficient method of EMD relies on local characteristics of time scale of the data to analyse and decompose non-stationary signals into a set of so called intrinsic mode functions (IMF). These functions are injected to the multiple input LRT scheme in order to decide upon speech presence or absence. To minimize mis-detections and enhance the performance of the hypothesis test, a computationally efficient forgetting scheme along with an adaptive threshold are also employed. Simulations, conducted in several artificial environments, illustrate that significant improvements can be expected, in terms of performance, from the proposed scheme when compared to similar VAD systems.
机译:语音活动检测(VAD)依赖于不利的噪音和混响条件,因此仍然是一项具有挑战性的任务。当用于检测语音的麦克风离扬声器较远时,此问题将变得更加困难。本文基于经验模态分解(EMD)分析框架和多输入似然比检验(LRT),提出了一种无监督的VAD方案。 EMD的高效方法依赖于数据时标的局部特征,以将非平稳信号分析和分解为一组所谓的固有模式函数(IMF)。这些功能被注入到多输入LRT方案中,以便确定语音是否存在。为了最大程度地减少误检测并提高假设检验的性能,还采用了计算有效的遗忘方案以及自适应阈值。在几种人工环境中进行的仿真表明,与相似的VAD系统相比,可以从提议的方案的性能方面实现预期的重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号