A reliable data selection for model-based noise suppression using unsupervised joint speaker adaptation and noise model estimation

机译：使用无监督的联合说话人自适应和噪声模型估计的基于模型的噪声抑制的可靠数据选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of model-based noise suppression is significantly affected by variations in speaker characteristics and the modeling accuracy of the noise. As regards this problem, the joint processing of speaker adaptation and accurate noise model estimation are crucial factors for improving model-based noise suppression. However, this joint processing is computationally intractable due to the direct unobservability of clean speech and noise signals in the conventional approach, which incorporates a vector Taylor series-based approach. To overcome this problem, we investigate a way of achieving joint processing by utilizing minimum mean squared error (MMSE) estimates of clean speech and noise. The MMSE estimates allow the flexible estimation of accurate parameters for the joint processing without intractable computation or any approximation. Here, since the MMSE estimates of clean speech and noise include some estimation errors, the estimation errors often degrade the accuracy of parameter estimation. Thus, we also employ a reliable data selection technique based on voice activity detection to estimate the joint processing parameters. The evaluation result reveals that the proposed reliable data selection method successfully improves both parameter estimation and speech recognition accuracy.

机译：基于模型的噪声抑制的性能受扬声器特性和噪声建模精度的变化影响很大。关于此问题，说话人自适应和准确的噪声模型估计的联合处理是改善基于模型的噪声抑制的关键因素。但是，由于在常规方法中采用了基于矢量泰勒级数的方法，因此干净语音和噪声信号的直接不可观察性，因此这种联合处理在计算上难以解决。为克服此问题，我们研究了一种利用干净语音和噪声的最小均方误差（MMSE）估计来实现联合处理的方法。 MMSE估算允许灵活估算联合处理的准确参数，而无需进行复杂的计算或任何近似计算。在此，由于干净语音和噪声的MMSE估计包括一些估计误差，所以估计误差通常会降低参数估计的准确性。因此，我们还采用了基于语音活动检测的可靠数据选择技术来估计联合处理参数。评估结果表明，所提出的可靠的数据选择方法成功地提高了参数估计和语音识别的准确性。

著录项

来源
《2012 IEEE International Conference on Signal Processing, Communications and Computing.》|2012年|p.148- 153|共6页
会议地点 Hong Kong(CN);Hong Kong(CN)
作者
Masakiyo Fujimoto; Tomohiro Nakatani;
展开▼
作者单位

NTT Communication Science Laboratories, NTT Corporation, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信号处理;信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptation of Acoustic Models in Joint Speaker and Noise Space Using Bilinear Models [J] . Yongwon JEONG, Hyung Soon KIM IEICE transactions on information and systems . 2014,第8期

机译：使用双线性模型的联合说话人和噪声空间中的声学模型的适应
2. Noise robust speech recognition applied to unsupervised speaker adaptation [J] . Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2002,第527期

机译：适用于无监督说话者适应的抗噪语音识别
3. Noise robust speech recognition applied to unsupervised speaker adaptation [J] . Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, 電子情報通信学会技術研究報告. 音声. Speech . 2002,第529期

机译：适用于无监督说话者适应的抗噪语音识别
4. A reliable data selection for model-based noise suppression using unsupervised joint speaker adaptation and noise model estimation [C] . Masakiyo Fujimoto, Tomohiro Nakatani IEEE International Conference on Signal Processing, Communications and Computing . 2012

机译：使用无监督的联合扬声器适应和噪声模型估计的基于模型的噪声抑制的可靠数据选择
5. Phase noise estimation and suppression for a SDMA DFT-precoded OFDM system. [D] . Wang, Yu. 2009

机译：SDMA DFT预编码OFDM系统的相位噪声估计和抑制。
6. Model Selection and Estimation of Multi-Compartment Models in Diffusion MRI with a Rician Noise Model [O] . Xinghua Zhu, Yaniv Gur, Wenping Wang, -1

机译：选型和扩散mRI多室模型的估计与莱斯噪声模型
7. Adaptation of Acoustic Models in Joint Speaker and Noise Space Using Bilinear Models [O] . Yongwon JEONG, Hyung Soon KIM 2014

机译：双线性模型对扬声器和噪声空间中的声学模型的适应

A reliable data selection for model-based noise suppression using unsupervised joint speaker adaptation and noise model estimation

摘要

著录项

相似文献

相关主题

期刊订阅