首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >PREDICTION BASED FILTERING AND SMOOTHING TO EXPLOIT TEMPORAL DEPENDENCIES IN NMF
【24h】

PREDICTION BASED FILTERING AND SMOOTHING TO EXPLOIT TEMPORAL DEPENDENCIES IN NMF

机译:基于预测的滤波和平滑,以利用NMF中的时间依赖性

获取原文

摘要

Nonnegative matrix factorization is an appealing technique for many audio applications. However, in it's basic form it does not use temporal structure, which is an important source of information in speech processing. In this paper, we propose NMF-based filtering and smoothing algorithms that are related to Kalman filtering and smoothing. While our prediction step is similar to that of Kalman filtering, we develop a multiplicative update step which is more convenient for nonnegative data analysis and in line with existing NMF literature. The proposed smoothing approach introduces an unavoidable processing delay, but the filtering algorithm does not and can be readily used for on-line applications. Our experiments using the proposed algorithms show a significant improvement over the baseline NMF approaches. In the case of speech denoising with factory noise at 0 dB input SNR, the smoothing algorithm outperforms NMF with 3.2 dB in SDR and around 0.5 MOS in PESQ, likewise source separation experiments result in improved performance due to taking advantage of the temporal regularities in speech.
机译:非负矩阵分解是许多音频应用的吸引力技术。但是,在它的基本形式中,它不使用时间结构,这是语音处理中的重要信息来源。在本文中,我们提出了基于NMF的过滤和平滑算法,与卡尔曼滤波和平滑有关。虽然我们的预测步骤类似于卡尔曼滤波的步骤,但我们开发了一种乘法更新步骤,这更方便了非环境数据分析和与现有的NMF文献一致。所提出的平滑方法引入了不可避免的处理延迟,但滤波算法没有并且可以容易地用于在线应用程序。我们使用所提出的算法的实验显示出对基线NMF方法的显着改进。在0 dB输入SNR的出厂噪声的语音噪声的情况下,平滑算法优于3.2 dB的SDR中的NMF,PESQ中约为0.5 MOS,同样源分离实验导致由于利用演讲中的时间规律而提高了性能。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号