首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
【24h】

Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain

机译:对数谱域中使用约束顺序隐马尔可夫模型的噪声估计

获取原文
获取原文并翻译 | 示例
           

摘要

The temporal correlation of speech presence/absence is widely used in noise estimation. The most popular technique for exploiting temporal correlation is the smoothing of noisy spectra using a time-recursive filter, in which the forgetting factor is controlled by speech presence probability. However, this technique is not unified into a theoretical framework that enables optimal noise estimation. In theory, hidden Markov models (HMMs) are superior to this technique in modeling temporal correlation. HMMs can model a time sequence of presence/absence of speech signal as a dynamic process of the transition between speech and non-speech states. Moreover, a number of methods, such as maximum likelihood, are available for optimal estimation of HMM parameters. This paper presents a constrained sequential HMM for modeling the log-power sequence on each frequency band. The emission probability of each HMM state is represented by a Gaussian model. The Gaussian mean of the non-speech state is considered as the optimal estimate of noise logarithmic power. The HMM parameter set is sequentially estimated from one frame to another on the basis of maximum likelihood. The proposed method is compared with well-established algorithms through various experiments. Our method delivers more accurate results and does not rely on the assumption of the “non-speech signal onset” as do most algorithms.
机译:语音存在/不存在的时间相关性被广泛用于噪声估计中。利用时间相关性的最流行技术是使用时间递归滤波器对噪声频谱进行平滑,其中遗忘因子由语音存在概率控制。但是,该技术并未统一到能够进行最佳噪声估计的理论框架中。从理论上讲,隐马尔可夫模型(HMM)在建模时间相关性方面优于该技术。 HMM可以将语音信号存在/不存在的时间序列建模为语音和非语音状态之间转换的动态过程。此外,许多方法,例如最大似然性,可用于HMM参数的最佳估计。本文提出了一种约束顺序HMM,用于对每个频带上的对数功率序列进行建模。每个HMM状态的发射概率由高斯模型表示。非语音状态的高斯平均值被认为是噪声对数功率的最佳估计。基于最大似然,从一帧到另一帧顺序地估计HMM参数集。通过各种实验,将该方法与公认的算法进行了比较。我们的方法提供了更准确的结果,并且不像大多数算法那样依赖于“非语音信号开始”的假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号