首页> 外文会议>International conference on latent variable analysis and signal separation >Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
【24h】

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

机译:椭圆稳定张量分解的多通道音频建模

获取原文

摘要

This paper introduces a new method for multichannel speech enhancement based on a versatile modeling of the residual noise spectrogram. Such a model has already been presented before in the single channel case where the noise component is assumed to follow an alpha-stable distribution for each time-frequency bin, whereas the speech spectrogram, supposed to be more regular, is modeled as Gaussian. In this paper, we describe a multichannel extension of this model, as well as a Monte Carlo Expectation - Maximisation algorithm for parameter estimation. In particular, a multichannel extension of the Itakura-Saito nonnegative matrix factorization is exploited to estimate the spectral parameters for speech, and a Metropolis-Hastings algorithm is proposed to estimate the noise contribution. We evaluate the proposed method in a challenging multichannel denoising application and compare it to other state-of-the-art algorithms.
机译:本文介绍了一种基于残噪频谱图通用建模的多通道语音增强新方法。以前已经在单通道情况下提出了这样的模型,在该情况下,假定噪声分量对于每个时频点遵循α稳定分布,而语音频谱图则被认为是更规则的,被建模为高斯模型。在本文中,我们描述了该模型的多通道扩展,以及用于参数估计的蒙特卡罗期望-最大化算法。特别是,利用Itakura-Saito非负矩阵分解的多通道扩展来估计语音的频谱参数,并提出了Metropolis-Hastings算法来估计噪声贡献。我们在具有挑战性的多通道降噪应用中评估了该方法,并将其与其他最新算法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号