首页> 外文期刊>Pattern recognition letters >An improved noise-robust voice activity detector based on hidden semi-Markov models
【24h】

An improved noise-robust voice activity detector based on hidden semi-Markov models

机译:基于隐藏半马尔可夫模型的改进型鲁棒语音活动检测器

获取原文
获取原文并翻译 | 示例
           

摘要

To improve the performance of voice activity detector (VAD) in noisy environments, this paper concentrates on three critical aspects related to noise robustness including speech features, feature distributions and temporal dependence. Based on the statistic on TIMIT and NOIZEUS, Mel-frequency cepstrum coefficients (MFCCs) are selected as speech features, Gaussian Mixture distributions (GMD) are applied to associate the observations in MFCC domain with both speech and non-speech states, and Weibull and Gamma distributions are used to explicitly model noise and speech durations, respectively. To integrate these aspects into VAD, the hidden semi-Markov model (HSMM) as a generalized hidden Markov model (HMM) is introduced first. Then the VAD decision is made according to the likelihood ratio test (LRT) incorporating state prior knowledge and modified forward variables of HSMM. We design a recursive way to efficiently calculate modified forward variables. Finally a series of experiments demonstrate: (1) the positive effect of different robustness-related schemes adopted in the proposed VAD; (2) better performance against the standard ITU-T G.729B, Adaptive MultiRate VAD phase 2 (AMR2), Advanced Front-end (AFE), HMM-based VAD and VAD using Laplacian-Gaussian model (LD-GD based VAD).
机译:为了提高在嘈杂环境中的语音活动检测器(VAD)的性能,本文集中在与噪声鲁棒性相关的三个关键方面,包括语音特征,特征分布和时间依赖性。根据TIMIT和NOIZEUS的统计数据,选择梅尔频率倒谱系数(MFCC)作为语音特征,应用高斯混合分布(GMD)将MFCC域中的观测与语音和非语音状态相关联,并将Weibull和伽玛分布分别用于显式建模噪声和语音持续时间。为了将这些方面集成到VAD中,首先引入隐式半马尔可夫模型(HSMM)作为广义隐式马尔可夫模型(HMM)。然后根据包含状态先验知识和HSMM修改后的前向变量的似然比检验(LRT)做出VAD决策。我们设计一种递归方法来有效地计算修改后的前向变量。最后一系列实验证明:(1)在拟议的VAD中采用了与健壮性相关的不同方案的积极效果; (2)相对于标准ITU-T G.729B,自适应多速率VAD阶段2(AMR2),高级前端(AFE),基于HMM的VAD和使用拉普拉斯高斯模型(基于LD-GD的VAD)的VAD具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号