Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain

Li Chao; Jiang Ting; Wu Sheng

首页> 外文期刊>Communications, China >Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain

【24h】

Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain

机译：基于改进的调制域中帧迭代谱减法的单通道语音增强

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).

机译：针对经典光谱减法引入的音乐噪声问题，已经成功地应用了短时间调制域（STM）光谱减法方法以进行单通道语音增强。然而，由于语音活动检测（VAD）不准确，还需要进一步改善剩余的音乐噪声和增强的性能，尤其是在低信噪比（SNR）场景中。为了解决这个问题，提出了STM域（Immodssub）中的改进的帧迭代光谱减法。更具体地，利用帧间相关性，直接噪声减法以处理STM域中的每个帧的噪声信号。然后，基于分段SNR的预定阈值，将噪声信号分为语音或沉默帧。利用这些分类结果，在噪声减法之后开发了相应的掩模功能以进行嘈杂的语音。最后，利用调制域中的语音信号的增加的稀疏性，正交匹配追踪（OMP）技术用于语音帧以提高语音质量和可懂度。所提出的方法的有效性被三种类型的噪声评估，包括白噪声，粉红色噪声和HFChannel噪声。所得结果表明，该方法优于较低的SNR（5至+ 5 dB）的一些建立的基线。

著录项

来源
《Communications, China》 |2021年第9期|100-115|共16页
作者
Li Chao; Jiang Ting; Wu Sheng;
展开▼
作者单位

Beijing Univ Posts & Telecommun Sch Informat & Commun Engn Beijing 100876 Peoples R China;

Beijing Univ Posts & Telecommun Sch Informat & Commun Engn Beijing 100876 Peoples R China;

Beijing Univ Posts & Telecommun Sch Informat & Commun Engn Beijing 100876 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs;

机译：短时调制域;单通道语音增强;调制改进的帧迭代光谱减法;低SNRS;

相似文献

外文文献
中文文献
专利

1. SPEECH ENHANCEMENT USING CONSTRAINED SPECTRAL AMPLITUDE SUBTRACTION BASED ON NONCAUSAL A PRIORI SNR [J] . Wu Hongwei, Wu Zhenyang 电子科学学刊（英文版） . 2006,第006期
2. Single-channel speech enhancement method based on masking properties and minimum statistics [J] . 江小平, 姚天任, 傅华系统工程与电子技术（英文版） . 2004,第002期
3. Single-Channel Speech Enhancement Using Critical-Band Rate Scale Based Improved Multi-Band Spectral Subtraction [J] . Navneet Upadhyay, Abhijit Karmakar Journal of Signal and Information Processing . 2013,第3期

机译：基于关键频带速率标度的单通道语音增强基于改进的多频带频谱减法
4. Speech Enhancement Using Modified Modulation Magnitude Estimation-Based Spectral Subtraction Algorithm [J] . M. Kalamani, S. Valarmathy, M. Krishnamoorthi Arabian Journal for Science and Engineering . 2014,第12期

机译：基于改进的调制幅度估计的谱减法增强语音
5. RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK [J] . QIAOLING ZHOU International Journal of Innovative Computing Information and Control . 2020,第5期

机译：基于改进频谱减法和深神经网络的英语语音增强算法研究
6. Modulation Domain Spectral Subtraction for Speech Enhancement [C] . Kuldip Paliwal, Belinda Schwerin, Kamil Wojcicki International Speech Communication Association . 2009

机译：语音增强的调制域光谱减法
7. Feature-based speech enhancement techniques based on spectral subtraction and Wiener filtering [D] . Chan, Mike Veng-Hang 1999

机译：基于频谱减法和维纳滤波的基于特征的语音增强技术
8. Spectral subtraction denoising preprocessing block to improve P300-based brain-computer interfacing [O] . Mohammed J Alhaddad, Mahmoud I Kamel, Meena M Makary, 2010

机译：频谱减法去噪预处理模块可改善基于P300的脑机接口
9. Single-channel speech enhancement using spectral subtraction in the short-time modulation domain [O] . Kuldip Paliwal, Kamil Wójcicki, Belinda Schwerin 2011

机译：在短时调制域中使用谱减法的单通道语音增强
10. Evaluation of a Correlation Subtraction Method for Enhancing Speech Degraded by Additive White Noise. [R] . Lim, J. S. 1977

机译：添加白噪声降低语音相关减法的评估。

Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain

摘要

著录项

相似文献

相关主题

期刊订阅