首页> 外文会议>INTERSPEECH 2012 >Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs

【24h】

Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs

机译：低SNR，扬声器依赖性语音增强，使用GMMS和MFCCS

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a two-stage speech enhancement technique. In the training stage, a Gaussian Mixture Model (GMM) of the mel-frequency cepstral coefficients (MFCCs) of a user's clean speech is computed wherein the component densities of the GMM serve to model the user's "acoustic classes." In the enhancement stage, MFCCs from a noisy speech signal are computed and the underlying clean acoustic class is identified via a maximum a posteriori (MAP) decision and a novel mapping matrix. The associated GMM parameters are then used to estimate the MFCCs of the clean speech from the MFCCs of the noisy speech. Finally, the estimated MFCCs are transformed back to a time-domain waveform. Our results show that we can improve PESQ in environments as low as-10 dB SNR.

机译：在本文中，我们提出了一种两级语音增强技术。在训练阶段，计算用户清洁语音的麦克朗谱系数（MFCC）的高斯混合模型（GMM），其中GMM的组件密度用于建模用户的“声学类”。在增强阶段，计算来自噪声语音信号的MFCC，并且通过最大后验（MAP）决策和新颖的映射矩阵来识别底层清洁声学类。然后，关联的GMM参数用于估计来自嘈杂语音的MFCC的清洁语音的MFCC。最后，估计的MFCC被转换回时域波形。我们的结果表明，我们可以在低至10 dB SNR的环境中改进PESQ。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Laura E. Boucheron; Phillip L. De Leon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
Speech enhancement; MFCC; GMM;

机译：语音增强;MFCC;GMM;
入库时间 2022-08-20 22:09:20

相似文献

外文文献
中文文献
专利

1. Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach [J] . Ankur Maurya, Divya Kumar, R.K. Agarwal Procedia Computer Science . 2018,第5期

机译：使用MFCC-GMM方法的印地语语音信号扬声器识别
2. MFCC-GMM based accent recognition system for Telugu speech signals [J] . Kasiprasad Mannepalli, Panyam Narahari Sastry, Maloji Suman International journal of speech technology . 2016,第1期

机译：基于MFCC-GMM的泰卢固语语音信号重音识别系统
3. Design of an Automatic Speaker Recognition System Based on Adapted MFCC and GMM Methods for Arabic Speech [J] . El Bachir TAZI, Abderrahim BENABBOU, Mostafa. HARTI International journal of computer science and network security . 2010,第1期

机译：基于自适应MFCC和GMM方法的阿拉伯语音自动说话人识别系统设计
4. Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs [C] . Laura E. Boucheron, Phillip L. De Leon Annual conference of the International Speech Communication Association . 2012

机译：使用GMM和MFCC的低SNR，与说话者相关的语音增强
5. High-rate optimized quantization structures and speaker-dependent wideband speech coding. [D] . Duni, Ethan Robert. 2007

机译：高速优化的量化结构和与说话者相关的宽带语音编码。
6. One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions [O] . Xianglilan Zhang, Jiping Sun, Zhigang Luo 2010

机译：不利条件下与语言无关和与说话者相关的语音识别的一对多加权动态时间规整
7. Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation [O] . Ji Ming, Timothy J. Hazen, James R. Glass 2013

机译：将缺失特征理论，语音增强和说话者相关/独立建模结合起来进行语音分离

Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs

摘要

著录项

相似文献

相关主题

期刊订阅