首页> 外文会议>European Signal Processing Conference >Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization
【24h】

Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization

机译:基于倒谱正则化的非负张量分解的立体声音乐分离

获取原文

摘要

This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.
机译:本文提出了一种基于非负张量因子分解(NTF)的立体声音乐分离新方法。立体声音乐大致分为两种类型。录制音乐或合成音乐,在本文中我们将重点放在合成音乐上。合成的音乐信号通常以许多单独的源信号的线性组合及其对每个声道信号的混合增益(即,时不变幅度缩放)产生。因此,合成立体声音乐分离是不确定的源分离问题,其中相位分量对分离没有帮助。 NTF是处理此问题的有效技术之一,它可以将立体声通道音乐信号的振幅频谱图分解为基本向量,并激活各个音乐源信号及其相应的混合增益。然而,由于可用的分离声音提示受到限制,因此在该分离问题中基本上难以获得足够的分离性能。为解决此问题,我们提出了一种基于NTF的立体声通道分离的倒谱正则化方法。所提出的方法使分离的音乐源信号遵循各个音乐源信号的相应的高斯混合模型,这些模型使用它们的可用样本预先进行训练。进行了使用真实音乐信号的实验评估,以研究该方法在有监督和无监督分离框架中的有效性。实验结果表明,所提出的方法在两种框架下的分离性能均得到了显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号