首页> 外文会议>2012 8th International Symposium on Chinese Spoken Language Processing. >Robust voice activity detection using empirical mode decomposition and modulation spectrum analysis
【24h】

Robust voice activity detection using empirical mode decomposition and modulation spectrum analysis

机译:使用经验模态分解和调制频谱分析的可靠语音活动检测

获取原文
获取原文并翻译 | 示例

摘要

Voice activity detection (VAD) is used to detect speechon-speech periods in observed signals. However, the current VAD technique has a serious problem in that the accuracy of detection of speech periods drastically reduces if it is used for noisy speech and/or for mixtures of speechon-speech such as those in music and environmental sounds. Thus, VAD needs to be robust to enable speech periods to be accurately detected in these situations. This paper proposes an approach to robust VAD using empirical mode decomposition (EMD) and modulation spectrum analysis (MSA) to resolve these problems. This is proposed to reducing background noise by using EMD without estimating SNR (noise conditions), and then to determining speechon-speech periods by using MSA. Three experiments on VAD in real environments were conducted to evaluate the proposed method by comparing it with typical methods (Otsu's and G.729B). The results demonstrated that the proposed method could accurately detect speech periods more accurately than the typical methods.
机译:语音活动检测(VAD)用于检测观察到的信号中的语音/非语音时段。但是,当前的VAD技术存在严重的问题,即如果将其用于嘈杂的语音和/或用于语音/非语音的混合(例如音乐和环境声音中的混合),则语音时段的检测精度会大大降低。因此,在这些情况下,VAD必须健壮才能使语音时段能够被准确检测。本文提出了一种使用经验模式分解(EMD)和调制频谱分析(MSA)的鲁棒VAD方法来解决这些问题。提出这是为了通过使用EMD来减少背景噪声而不估计SNR(噪声条件),然后通过使用MSA来确定语音/非语音周期。通过在真实环境中对VAD进行了三个实验,通过与典型方法(Otsu和G.729B)进行比较来评估该方法。结果表明,与传统方法相比,该方法能够更准确地检测语音时段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号