首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking
【24h】

Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking

机译:使用多通道NMF和声学跟踪分离运动声源

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we propose a method for separation of moving sound sources. The method is based on first tracking the sources and then estimation of source spectrograms using multichannel nonnegative matrix factorization (NMF) and extracting the sources from the mixture by single-channel Wiener filtering. We propose a novel multichannel NMF model with time-varying mixing of the sources denoted by spatial covariance matrices (SCM) and provide update equations for optimizing model parameters minimizing squared Frobenius norm. The SCMs of the model are obtained based on estimated directions of arrival of tracked sources at each time frame. The evaluation is based on established objective separation criteria and using real recordings of two and three simultaneous moving sound sources. The compared methods include conventional beamforming and ideal ratio mask separation. The proposed method is shown to exceed the separation quality of other evaluated blind approaches according to all measured quantities. Additionally, we evaluate the method's susceptibility toward tracking errors by comparing the separation quality achieved using annotated ground truth source trajectories.
机译:在本文中,我们提出了一种分离运动声源的方法。该方法基于以下步骤:首先跟踪源,然后使用多通道非负矩阵分解(NMF)估算源频谱图,并通过单通道Wiener滤波从混合物中提取源。我们提出了一种新颖的多通道NMF模型,其中时变混合了由空间协方差矩阵(SCM)表示的源,并提供了更新方程,用于优化模型参数,从而最小化平方Frobenius范数。基于在每个时间框架内跟踪源的估计到达方向,可以获得模型的SCM。评估基于既定的客观分离标准,并使用两个和三个同时移动声源的真实录音。比较的方法包括传统的波束成形和理想比率的掩模分离。结果表明,根据所有测得的量,所提出的方法均超过了其他评估盲法的分离质量。此外,我们通过比较使用带注释的地面真源轨迹实现的分离质量,评估了该方法对跟踪误差的敏感性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号