【24h】

Voice Activity Detection using Microphone Array

机译:使用麦克风阵列进行语音活动检测

获取原文
获取原文并翻译 | 示例

摘要

It is useful to decide whether the microphone signal includes a target speech or not at a temporal moment because the process called the voice activity detection (VAD) can reduce any redundant efforts made for the speech coding or the speech recognition, or it can help provide more accurate noise estimation for the speech enhancement. The detection of speech or non-speech in a frame has been simply done by observing the variance of its energy level, zero crossing rate or periodicity. In this occasion, however, the detection error increases exponentially as much as the background noise is added up. Unvoiced fricative sounds which have low energy with being distributed over widebands are more vulnerable to the background noise than any other phonemes are. It is proposed in this literature that voice activity can be detected more robustly in noisy environment by observing the subband power ratio of the noisy speech and its beamformed signal. Also, it is shown to be effective in the fricatives than in the vowels. Whatsoever, this method guarantees much better performance than single microphone VADs when the noise is obviously reduced by beamforming.
机译:决定麦克风信号在某个瞬间是否包含目标语音很有用,因为称为语音活动检测(VAD)的过程可以减少语音编码或语音识别上的任何多余工作,或者可以帮助提供用于语音增强的更准确的噪声估计。通过观察其能量水平,过零率或周期性的变化,可以简单地完成对帧中语音或非语音的检测。然而,在这种情况下,检测误差与背景噪声的增加成指数增长。与其他音素相比,能量低且分布在宽带上的清音摩擦音更容易受到背景噪声的干扰。在该文献中提出,通过观察噪声语音及其波束形成信号的子带功率比,可以在噪声环境中更鲁棒地检测语音活动。同样,它在摩擦音中也比在元音中更有效。无论如何,当通过波束成形明显降低噪声时,此方法可保证比单个麦克风VAD更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号