The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

Madhu N.; Spriet A.; Jansen S.; Koning R.; Wouters J.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

【24h】

The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

机译：在单通道降噪系统中使用理想二进制掩码和理想维纳滤波器改善语音清晰度的潜力：在听觉假体中的应用

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Whereas state-of-the-art single-channel noise reduction algorithms for auditory prostheses demonstrate an appreciable suppression of the noise and improved speech quality, they are unable, thus far, to improve the intelligibility of noise-degraded speech signals. Alternative approaches to speech enhancement using a binary time-frequency mask have demonstrated substantial intelligibility improvements in low signal-to-noise-ratio (SNR) conditions under ideal settings, making this a promising research direction for auditory prostheses. These approaches exploit the sparsity and disjoint-ness of speech spectra in their short-time—frequency representation to preserve only the target-dominant time-frequency regions in the processed output. State-of-the-art noise reduction algorithms in contrast are soft-decision approaches which weight each time-frequency region in proportion to the prevailing SNR. However, the potential for intelligibility improvement using these approaches has not been examined systematically vis-à-vis the binary mask alternative. This contribution compares the performance of an ideal soft-decision system, exemplified by the ideal Wiener filter (IWF), and the ideal binary mask (IBM) for single-channel speech enhancement for auditory prostheses. To obtain results relevant to this application area, a (relatively) low spectral resolution, modelled using the Bark-spectrum scale, is used for both the IWF and the IBM. This spectral resolution is comparable to that being used in commercial hearing instruments. The comparison is in terms of potential for intelligibility improvement and resulting signal quality. Intelligibility tests carried out under various noise conditions and SNRs show that the IWF leads to higher intelligibility scores than the IBM in low SNR conditions. Under non-ideal parameter estimates, it is demonstrated that the IWF approach is also much less sensitive to estimation errors. Quality-wise, a preference for the IWF exists- This was evaluated using a two-stage, pair-wise preference-rating test.

机译：尽管用于听觉假肢的最新单通道降噪算法显示出对噪声的明显抑制，并改善了语音质量，但迄今为止，它们仍无法提高降噪语音信号的清晰度。使用二进制时频掩模进行语音增强的替代方法已经证明，在理想设置下，在低信噪比（SNR）条件下，语言的清晰度有了显着提高，这使其成为听觉假体的有希望的研究方向。这些方法在其短时频率表示中利用了语音频谱的稀疏性和不连续性，以仅在处理后的输出中保留目标主导的时频区域。相比之下，最新的降噪算法是软决策方法，该方法将每个时频区域与主流SNR成比例地加权。但是，相对于二进制掩码替代方案，尚未系统地检查使用这些方法改善清晰度的可能性。此贡献比较了理想的软决策系统的性能，该系统以理想的维纳滤波器（IWF）和理想的二进制掩码（IBM）为例，用于听觉假体的单通道语音增强。为了获得与该应用领域相关的结果，IWF和IBM均使用（相对）较低的光谱分辨率（使用树皮光谱标度建模）。该光谱分辨率可与商用助听器中使用的光谱分辨率相媲美。比较是在提高清晰度和产生信号质量的潜力方面。在各种噪声条件和SNR下进行的可懂度测试表明，在低SNR条件下，IWF的清晰度比IBM高。在非理想参数估计下，证明了IWF方法对估计误差的敏感度也低得多。在质量方面，存在对IWF的偏好-这是通过两阶段，成对的偏好评级测试进行评估的。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第1期|p.61-70|共10页
作者
Madhu N.; Spriet A.; Jansen S.; Koning R.; Wouters J.;
展开▼
作者单位

Division of Experimental Otorhinolaryngology, Dept. Neurosciences, Katholieke Universiteit Leuven, Belgium;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Binary masks; hearing-aids; soft-decision; speech enhancement; speech intelligibility in noise;

机译：二进制口罩;助听器;软判决;语音增强;噪声中的语音清晰度;

相似文献

外文文献
中文文献
专利

1. Speech intelligibility improvements with hearing aids using bilateral and binaural adaptive multichannel Wiener filtering based noise reduction a [J] . Bram Cornelis, Marc Moonen, Jan Wouters The Journal of the Acoustical Society of America . 2012,第6期

机译：使用双边和双耳自适应多通道维纳滤波的降噪技术，借助助听器提高语音清晰度
2. SINGLE CHANNEL SPEECH ENHANCEMENT USING IDEAL BINARY MASK TECHNIQUE BASED ON COMPUTATIONAL AUDITORY SCENE ANALYSIS [J] . ABRAR HUSSAIN, KALAIVANI CHELLAPPAN, SITI ZAMRATOL M Journal of Theoretical and Applied Information Technology . 2016,第1期

机译：基于计算音频场景分析的理想二元掩膜技术的单通道语音增强
3. Speech intelligibility in reverberation with ideal binary masking: Effects of early reflections and signal-to-noise ratio threshold [J] . Roman N., Woodruff J. The Journal of the Acoustical Society of America . 2013,第3期

机译：具有理想二进制掩蔽的混响中的语音清晰度：早期反射和信噪比阈值的影响
4. EVALUATION OF STOI FOR SPEECH AT LOW SIGNAL-TO-NOISE RATIOS AFTER ENHANCEMENT WITH IDEAL BINARY MASKS [C] . Simone Graetzer, Carl Hopkins International Congress on Sound and Vibration . 2018

机译：用理想二元面罩在增强后，在低信噪比下评估STOI
5. Speech enhancement algorithms using Kalman filtering and masking properties of human auditory systems. [D] . Ma, Ning. 2005

机译：使用卡尔曼滤波和人类听觉系统掩蔽属性的语音增强算法。
6. Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction [O] . Ning Li, Philipos C. Loizou -1

机译：影响理想二进制掩蔽语音清晰度的因素：降噪的含义
7. Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction [O] . Ning Li, Philipos C. Loizou 2008

机译：影响理想二进制语音的可理解性的因素：降噪引起的影响
8. Application of Active Noise Reduction for Hearing Protection and Speech Intelligibility Improvement [R] . Steeneken, H. J. M., Langhout, G. 1985

机译：有源降噪在听力保护和语音清晰度改善中的应用

The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅