Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints

机译：使用具有低秩和稀疏约束的健壮NMF进行无监督单声道语音增强

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Non-negative spectrogram decomposition and its variants have been extensively investigated for speech enhancement due to their efficiency in extracting perceptually meaningful components from mixtures. Usually, these approaches are implemented on the condition that training samples for one or more sources are available beforehand. However, in many real-world scenarios, it is always impossible for conducting any prior training. To solve this problem, we proposed an approach which directly extracts the representations of background noises from the noisy speech via imposing non-negative constraints on the low-rank and sparse decomposition of the noisy spectrogram. The noise representations are subsequently utilized when estimating the clean speech. In this technique, potential spectral structural regularity could be discovered for better reconstruction of clean speech. Evaluations on the Noisex-92 and TIMIT database showed that the proposed method achieves significant improvements over the state-of-the-art methods in unsupervised speech enhancement.

机译：非负声谱图分解及其变体已被广泛研究用于语音增强，因为它们有效地从混合物中提取了感知上有意义的成分。通常，这些方法是在事先获得一个或多个源的训练样本的条件下实施的。但是，在许多实际情况下，始终不可能进行任何事先培训。为了解决这个问题，我们提出了一种通过对噪声频谱图的低秩和稀疏分解施加非负约束来直接从噪声语音中提取背景噪声表示的方法。随后在估计干净语音时利用噪声表示。在这种技术中，可以发现潜在的频谱结构规律性，以更好地重建干净的语音。对Noisex-92和TIMIT数据库的评估表明，与无监督语音增强的最新技术相比，该方法取得了显着改进。

著录项

来源
《2015 IEEE China Summit amp; International Conference on Signal and Information Processing》|2015年|1-4|共4页
会议地点 Chengdu(CN)
作者
Yinan Li; Xiongwei Zhang; Meng Sun; Gang Min;
展开▼
作者单位

Lab. of Intell. Inf. Process., PLA Univ. of Sci. Technol., Nanjing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
low-rank and sparse decomposition; non-negative matrix factorization; speech enhancement;

机译：低秩稀疏分解;非负矩阵分解;语音增强;

相似文献

外文文献
中文文献
专利

1. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF信息波束形成的无监督语音增强技术，用于强噪声自动语音识别
2. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF的噪声强度自动语音识别的无监督语音增强
3. Unsupervised feature selection and NMF de-noising for robust Speech Emotion Recognition [J] . Bandela Surekha Reddy, Kumar T. Kishore Applied Acoustics . 2021,第Jana期

机译：无监督的功能选择和NMF用于强大语音情感识别的脱模
4. Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints [C] . Yinan Li, Xiongwei Zhang, Meng Sun, IEEE China Summit and International Conference on Signal and Information Processing . 2015

机译：使用具有低级别和稀疏约束的鲁棒NMF无监督的单声道语音增强
5. Sparse and Low-Rank Techniques for the Efficient Restoration of Images =Sparse and Low-Rank Techniques for the Efficient Restoration of Images [D] . Zhang, Mingli. 2017

机译：高效的图像稀疏和低秩技术=高效的图像稀疏和低秩技术
6. High-Resolution Dynamic Speech Imaging with Joint Low-Rank and Sparsity Constraints [O] . Maojing Fu, Bo Zhao, Christopher Carignan, -1

机译：联合低秩和稀疏约束的高分辨率动态语音成像
7. NMF based speech and music separation in monaural speech recordings with sparseness and temporal continuity constraints [O] . Tu Ming, Xie Xiang, Jiao Yishan 2013

机译：基于NMF的语音和音乐分离在单声道语音记录中，具有稀疏性和时间连续性约束

Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints

摘要

著录项

相似文献

相关主题

期刊订阅