首页> 外文期刊>WSEAS Transactions on Computers >A Wavelet-Based Voice Activity Detection Algorithm in Variable-Level Noise Environment
【24h】

A Wavelet-Based Voice Activity Detection Algorithm in Variable-Level Noise Environment

机译:可变噪声环境下基于小波的语音活动检测算法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a novel entropy-based voice activity detection (VAD) algorithm is presented in variable-level noise environment. Since the frequency energy of different types of noise focuses on different frequency subband, the effect of corrupted noise on each frequency subband is different. It is found that the seriously obscured frequency subbands have little word signal information left, and are harmful for detecting voice activity segment (VAS). First, we use bark-scale wavelet decomposition (BSWD) to split the input speech into 24 critical subbands. In order to discard the seriously corrupted frequency subband, a method of adaptive frequency subband extraction (AFSE) is then applied to only use the frequency subband. Next, we propose a measure of entropy defined on the spectrum domain of selected frequency subband to form a robust voice feature parameter. In addition, unvoiced is usually eliminated. An unvoiced detection is also integrated into the system to improve the intelligibility of voice. Experimental results show that the performance of this algorithm is superior to the G729B and other entropy-based VAD especially for variable-level background noise.
机译:本文提出了一种在变噪声环境下基于熵的语音活动检测算法。由于不同类型噪声的频率能量集中在不同的子频带上,因此损坏的噪声对每个子频带的影响是不同的。发现严重模糊的频率子带几乎没有单词信号信息,并且对于检测语音活动片段(VAS)是有害的。首先,我们使用树皮尺度小波分解(BSWD)将输入语音分为24个关键子带。为了丢弃严重损坏的频率子带,然后将自适应频率子带提取(AFSE)方法应用于仅使用频率子带。接下来,我们提出一种在所选频率子带的频谱域上定义的熵的度量,以形成鲁棒的语音特征参数。另外,通常消除清音。清音检测也集成到系统中,以提高语音的清晰度。实验结果表明,该算法的性能优于G729B和其他基于熵的VAD,特别是对于可变水平的背景噪声。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号