Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

机译：椭圆稳定张量分解的多通道音频建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a new method for multichannel speech enhancement based on a versatile modeling of the residual noise spectrogram. Such a model has already been presented before in the single channel case where the noise component is assumed to follow an alpha-stable distribution for each time-frequency bin, whereas the speech spectrogram, supposed to be more regular, is modeled as Gaussian. In this paper, we describe a multichannel extension of this model, as well as a Monte Carlo Expectation - Maximisation algorithm for parameter estimation. In particular, a multichannel extension of the Itakura-Saito nonnegative matrix factorization is exploited to estimate the spectral parameters for speech, and a Metropolis-Hastings algorithm is proposed to estimate the noise contribution. We evaluate the proposed method in a challenging multichannel denoising application and compare it to other state-of-the-art algorithms.

机译：本文介绍了一种基于残噪频谱图通用建模的多通道语音增强新方法。以前已经在单通道情况下提出了这样的模型，在该情况下，假定噪声分量对于每个时频点遵循α稳定分布，而语音频谱图则被认为是更规则的，被建模为高斯模型。在本文中，我们描述了该模型的多通道扩展，以及用于参数估计的蒙特卡罗期望-最大化算法。特别是，利用Itakura-Saito非负矩阵分解的多通道扩展来估计语音的频谱参数，并提出了Metropolis-Hastings算法来估计噪声贡献。我们在具有挑战性的多通道降噪应用中评估了该方法，并将其与其他最新算法进行了比较。

著录项

来源
《International conference on latent variable analysis and signal separation》|2018年|13-23|共11页
会议地点
作者
Mathieu Fontaine; Fabian-Robert Stoeter; Antoine Liutkus; Umut Simsekli; Romain Serizel; Roland Badeau;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A novel multichannel audio signal compression method based on tensor representation and decomposition [J] . Jing Wang, Xiang Xie, Jingming Kuang Communications, China . 2014,第3期

机译：基于张量表示和分解的新型多通道音频信号压缩方法
2. Tensor completion for recovering multichannel audio signal with missing data [J] . Yang Lidong, Liu Min, Wang Jing, Communications, China . 2019,第4期

机译：张量完成以恢复丢失数据的多通道音频信号
3. Tensor completion for recovering multichannel audio signal with missing data [J] . Yang Lidong, Liu Min, Wang Jing, Communications, China . 2019,第4期

机译：张于恢复具有缺失数据的多声道音频信号的张解器完成
4. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition [C] . Mathieu Fontaine, Fabian-Robert Stoter, Antoine Liutkus, International Conference on Latent Variable Analysis and Signal Separation . 2018

机译：椭圆稳定张量分解的多通道音频建模
5. Robust texture identification and unsupervised texture segmentation using multichannel decomposition and hidden Markov model. [D] . Chen, Jia-Lin. 1992

机译：使用多通道分解和隐马尔可夫模型的稳健纹理识别和无监督纹理分割。
6. Single-Trial Decoding of Bistable Perception Based on Sparse Nonnegative Tensor Decomposition [O] . Zhisong Wang, Alexander Maier, Nikos K. Logothetis, 2008

机译：基于稀疏非负张量分解的双稳态感知的单次解码
7. Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation [O] . Ozerov, Alexey, Févotte, Cédric, Blouet, Raphaël, 2011

机译：具有结构化约束的多通道非负张量因子分解，用于用户指导的音频源分离
8. When are Overcomplete Topic Models Identifiable? Uniqueness of Tensor Tucker Decompositions with Structured Sparsity. [R] . Anandkumar, A., Hsu, D., Janzamin, M., 2013

机译：何时超完整主题模型可识别？具有结构稀疏性的Tensor Tucker分解的唯一性。

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

摘要

著录项

相似文献

相关主题

期刊订阅