Speech Enhancement With Deep Neural Networks Using MoG Based Labels

机译：使用基于MoG的标签的深度神经网络进行语音增强

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a mixture of Gaussians-deep neural network (MoG-DNN) algorithm for single-microphone speech enhancement. We combine between the generative mixture of Gaussians (MoG) model and the discriminative deep neural network (DNN). The proposed algorithm consists of two phases, the training phase and the test phase. In the training phase, the clean speech power spectral density (PSD) is modeled as a MoG representing an unsupervised assortment of the speech signal. Following, the database is labeled to fit the given MoG. DNN is then trained to classify noisy time-frame features to one of the Gaussians from the already inferred MoG. Given the classification results, a speech presence probability (SPP) is obtained in the test phase. Using the SPP, soft spectral subtraction is then applied, while, simultaneously updating the noise statistics. The generative unsupervised MoG can be applied to any unknown database, in addition to preserving the speech spectral structure. Furthermore, the discriminative DNN maintains the continuity of the speech. Experimental study shows that the proposed algorithm produces higher objective measurements scores compared to other speech enhancement algorithms.

机译：在本文中，我们提出了一种用于单麦克风语音增强的混合高斯深层神经网络（MoG-DNN）算法。我们将高斯模型（MoG）的生成混合与判别性深度神经网络（DNN）结合在一起。所提出的算法包括两个阶段，训练阶段和测试阶段。在训练阶段，将干净语音功率谱密度（PSD）建模为MoG，表示语音信号的无监督分类。接下来，将数据库标记为适合给定的MoG。然后，对DNN进行训练，以根据已经推断出的MoG将嘈杂的时间范围特征分类为高斯之一。给定分类结果，可以在测试阶段获得语音存在概率（SPP）。然后，使用SPP进行软频谱减法，同时更新噪声统计信息。生成的无监督MoG除了可以保留语音频谱结构之外，还可以应用于任何未知数据库。此外，具有区别性的DNN可以保持语音的连续性。实验研究表明，与其他语音增强算法相比，该算法产生了更高的客观测量分数。

著录项

来源
《IEEE International Conference on Rebooting Computing》|2018年|1-5|共5页
会议地点
作者
Hodaya Hammer; Gilad Rath; Shlomo E. Chazan; Jacob Goldberger; Sharon Gannot;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech enhancement; Noise measurement; Databases; Training; Task analysis; Neural networks; Electrical engineering;

机译：语音增强;噪声测量;数据库;培训;任务分析;神经网络;电气工程;

相似文献

外文文献
中文文献
专利

1. Deep convolutional neural network-based speech enhancement to improve speech intelligibility and quality for hearing-impaired listeners (Retraction of 2018) [J] . Rahiman P. F. Khaleelur, Jayanthi V. S., Jayanthi A. N. Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering . 2019,第3期

机译：基于深度卷积神经网络的语言增强，提高听力障碍听众的语音清晰度和质量（2018年撤回）
2. Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems [J] . Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第1期

机译：基于通用和专用深度神经网络的语音增强系统的语音清晰度潜力
3. Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement [J] . Saleem N., Khattak M. I., Perez E. V. NTT R&D . 2019,第12期

机译：基于深神经网络的单频语语音增强的光谱相位估计
4. Speech Enhancement With Deep Neural Networks Using MoG Based Labels [C] . Hodaya Hammer, Gilad Rath, Shlomo E. Chazan, IEEE International Conference on Rebooting Computing . 2018

机译：基于MOG的标签与深神经网络的语音增强
5. Robust Training Methods for Deep Neural Networks with a Variety of Label Noise [D] . Kamabattula, Sree Ram. 2021

机译：具有各种标签噪声的深神经网络的强大培训方法
6. Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users [O] . Tobias Goehring, Federico Bolner, Jessica J.M. Monaghan, -1

机译：基于神经网络的语音增强功能可提高人工耳蜗用户的语音清晰度
7. EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION [O] . Deepak Baby, Jort F. Gemmeke, Tuomas Virtanen, 2016

机译：基于示例的语音增强在深度神经网络自动语音识别中的应用

Speech Enhancement With Deep Neural Networks Using MoG Based Labels

摘要

著录项

相似文献

相关主题

期刊订阅