Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors

Jukic Ante; van Waterschoot Toon; Gerkmann Timo; Doclo Simon

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors

【24h】

Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors

机译：具有稀疏先验的基于多通道线性预测的语音混响

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The quality of speech signals recorded in an enclosure can be severely degraded by room reverberation. In this paper, we focus on a class of blind batch methods for speech dereverberation in a noiseless scenario with a single source, which are based on multi-channel linear prediction in the short-time Fourier transform domain. Dereverberation is performed by maximum-likelihood estimation of the model parameters that are subsequently used to recover the desired speech signal. Contrary to the conventional method, we propose to model the desired speech signal using a general sparse prior that can be represented in a convex form as a maximization over scaled complex Gaussian distributions. The proposed model can be interpreted as a generalization of the commonly used time-varying Gaussian model. Furthermore, we reformulate both the conventional and the proposed method as an optimization problem with an -norm cost function, emphasizing the role of sparsity in the considered speech dereverberation methods. Experimental evaluation in different acoustic scenarios show that the proposed approach results in an improved performance compared to the conventional approach in terms of instrumental measures for speech quality.

机译：房间混响会严重降低外壳中记录的语音信号的质量。在本文中，我们集中在基于短时傅立叶变换域中的多通道线性预测的一类无噪声场景下，具有单一来源的语音去混响的盲处理方法。去混响是通过模型参数的最大似然估计执行的，模型参数随后用于恢复所需的语音信号。与传统方法相反，我们建议使用通用稀疏先验对所需语音信号进行建模，该稀疏先验可以凸面形式表示为缩放后的复杂高斯分布的最大化。所提出的模型可以解释为常用的时变高斯模型的推广。此外，我们将常规方法和拟议方法都重新设计为具有-norm成本函数的优化问题，强调了稀疏性在考虑的语音去混响方法中的作用。在不同声学场景下的实验评估表明，与传统方法相比，该方法在语音质量的仪器测量方面具有更高的性能。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2015年第9期|1509-1520|共12页
作者
Jukic Ante; van Waterschoot Toon; Gerkmann Timo; Doclo Simon;
展开▼
作者单位

Department of Medical Physics and Acoustics, University of Oldenburg, Oldenburg, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-channel linear prediction; sparse priors; speech dereverberation; speech enhancement;

机译：多通道线性预测;稀疏先验;语音去混响;语音增强;

相似文献

外文文献
中文文献
专利

1. Blind speech dereverberation using sparse decomposition and multi-channel linear prediction [J] . Leila Mousavi, Farbod Razzazi, Afrooz Haghbin International journal of speech technology . 2019,第3期

机译：稀疏分解和多通道线性预测的盲语音去混响
2. On a Blind Speech Dereverberation Algorithm Using Multi-Channel Linear Prediction [J] . Marc DELCROIX, Takafumi HIKICHI, Masato MIYOSHI IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2006,第10期

机译：基于多通道线性预测的盲语音去混响算法
3. Blind dereverberation algorithm for speech signals based on multi-channel linear prediction [J] . Marc Delcroix, Masato Miyoshi, Takafumi Hikichi Acoustical science and technology . 2005,第5期

机译：基于多通道线性预测的语音信号盲去混响算法
4. Speech dereverberation with multi-channel linear prediction and sparse priors for the desired signal [C] . Jukic Ante, van Waterschoot Toon, Gerkmann Timo, . 2014

机译：具有多通道线性预测和稀疏先验的语音去混响
5. Development of dereverberation algorithms for improved speech intelligibility by cochlear implant users. [D] . Hazrati, Oldooz. 2012

机译：开发去混响算法，以改善人工耳蜗使用者的语音清晰度。
6. Iterative hard thresholding in genome-wide association studies: Generalized linear models prior weights and double sparsity [O] . Benjamin B Chu, Kevin L Keys, Christopher A German, 2020

机译：全基因组关联研究中的迭代硬阈值：广义线性模型先验权重和双稀疏性
7. Partitioned Block Frequency Domain Kalman Filter for Multi-Channel Linear Prediction based Blind Speech Dereverberation [O] . Dietzen Thomas, Spriet A, Tirry W, 2016

机译：基于分块频域卡尔曼滤波器的多通道线性语音盲混响预测

Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅