...
首页> 外文期刊>IEEE Transactions on Signal Processing >Voice Activity Detection Based on Multiple Statistical Models
【24h】

Voice Activity Detection Based on Multiple Statistical Models

机译:基于多种统计模型的语音活动检测

获取原文
获取原文并翻译 | 示例

摘要

One of the key issues in practical speech processing is to achieve robust voice activity detection (VAD) against the background noise. Most of the statistical model-based approaches have tried to employ the Gaussian assumption in the discrete Fourier transform (DFT) domain, which, however, deviates from the real observation. In this paper, we propose a class of VAD algorithms based on several statistical models. In addition to the Gaussian model, we also incorporate the complex Laplacian and Gamma probability density functions to our analysis of statistical properties. With a goodness-of-fit tests, we analyze the statistical properties of the DFT spectra of the noisy speech under various noise conditions. Based on the statistical analysis, the likelihood ratio test under the given statistical models is established for the purpose of VAD. Since the statistical characteristics of the speech signal are differently affected by the noise types and levels, to cope with the time-varying environments, our approach is aimed at finding adaptively an appropriate statistical model in an online fashion. The performance of the proposed VAD approaches in both the stationary and nonstationary noise environments is evaluated with the aid of an objective measure.
机译:实际语音处理中的关键问题之一是针对背景噪声实现鲁棒的语音活动检测(VAD)。大多数基于统计模型的方法都试图在离散傅立叶变换(DFT)域中采用高斯假设,但是这与实际观察结果有所不同。在本文中,我们基于几种统计模型提出了一类VAD算法。除高斯模型外,我们还将复杂的拉普拉斯算子和伽马概率密度函数合并到我们的统计属性分析中。通过拟合优度测试,我们分析了在各种噪声条件下嘈杂语音的DFT频谱的统计特性。在统计分析的基础上,针对VAD的目的,在给定的统计模型下建立了似然比检验。由于语音信号的统计特性受噪声类型和级别的影响不同,为了应对时变环境,我们的方法旨在以在线方式自适应地找到合适的统计模型。借助客观措施评估了所建议的VAD方法在固定和非固定噪声环境中的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号