首页> 外文会议>2015 IEEE China Summit amp; International Conference on Signal and Information Processing >Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints
【24h】

Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints

机译:使用具有低秩和稀疏约束的健壮NMF进行无监督单声道语音增强

获取原文
获取原文并翻译 | 示例

摘要

Non-negative spectrogram decomposition and its variants have been extensively investigated for speech enhancement due to their efficiency in extracting perceptually meaningful components from mixtures. Usually, these approaches are implemented on the condition that training samples for one or more sources are available beforehand. However, in many real-world scenarios, it is always impossible for conducting any prior training. To solve this problem, we proposed an approach which directly extracts the representations of background noises from the noisy speech via imposing non-negative constraints on the low-rank and sparse decomposition of the noisy spectrogram. The noise representations are subsequently utilized when estimating the clean speech. In this technique, potential spectral structural regularity could be discovered for better reconstruction of clean speech. Evaluations on the Noisex-92 and TIMIT database showed that the proposed method achieves significant improvements over the state-of-the-art methods in unsupervised speech enhancement.
机译:非负声谱图分解及其变体已被广泛研究用于语音增强,因为它们有效地从混合物中提取了感知上有意义的成分。通常,这些方法是在事先获得一个或多个源的训练样本的条件下实施的。但是,在许多实际情况下,始终不可能进行任何事先培训。为了解决这个问题,我们提出了一种通过对噪声频谱图的低秩和稀疏分解施加非负约束来直接从噪声语音中提取背景噪声表示的方法。随后在估计干净语音时利用噪声表示。在这种技术中,可以发现潜在的频谱结构规律性,以更好地重建干净的语音。对Noisex-92和TIMIT数据库的评估表明,与无监督语音增强的最新技术相比,该方法取得了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号