首页> 外文会议>European Signal Processing Conference >Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization

【24h】

Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization

机译：基于倒谱正则化的非负张量分解的立体声音乐分离

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.

机译：本文提出了一种基于非负张量因子分解（NTF）的立体声音乐分离新方法。立体声音乐大致分为两种类型。录制音乐或合成音乐，在本文中我们将重点放在合成音乐上。合成的音乐信号通常以许多单独的源信号的线性组合及其对每个声道信号的混合增益（即，时不变幅度缩放）产生。因此，合成立体声音乐分离是不确定的源分离问题，其中相位分量对分离没有帮助。 NTF是处理此问题的有效技术之一，它可以将立体声通道音乐信号的振幅频谱图分解为基本向量，并激活各个音乐源信号及其相应的混合增益。然而，由于可用的分离声音提示受到限制，因此在该分离问题中基本上难以获得足够的分离性能。为解决此问题，我们提出了一种基于NTF的立体声通道分离的倒谱正则化方法。所提出的方法使分离的音乐源信号遵循各个音乐源信号的相应的高斯混合模型，这些模型使用它们的可用样本预先进行训练。进行了使用真实音乐信号的实验评估，以研究该方法在有监督和无监督分离框架中的有效性。实验结果表明，所提出的方法在两种框架下的分离性能均得到了显着改善。

著录项

来源
《European Signal Processing Conference》|2017年|981-985|共5页
会议地点
作者
Shogo Seki; Tomoki Toda; Kazuya Takeda;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multiple signal classification; Cepstrum; Music; Source separation; Spectrogram; Upper bound; Europe;

机译：多信号分类;倒谱;音乐;源分离;频谱图;上限;欧洲;

相似文献

外文文献
中文文献
专利

1. Clustering Algorithm for Unsupervised Monaural Musical Sound Separation Based on Non-negative Matrix Factorization [J] . Sang Ha PARK, Seokjin LEE, Koeng-Mo SUNG IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2012,第4期

机译：基于非负矩阵分解的无监督单声道音乐分离的聚类算法
2. On the use of a spatial cue as prior information for stereo sound source separation based on spatially weighted non-negative tensor factorization [J] . Yuki Mitsufuji, Axel Roebel EURASIP journal on advances in signal processing . 2014,第1期

机译：基于空间加权非负张量分解的空间提示作为先验信息用于立体声声源分离的研究
3. Non-Negative Tensor Factorization Applied to Music Genre Classification [J] . Benetos E., Kotropoulos C. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第8期

机译：非负张量分解应用于音乐流派分类
4. Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization [C] . Shogo Seki, Tomoki Toda, Kazuya Takeda European Signal Processing Conference . 2017

机译：基于非负张量分解的立体声音乐分离综合规范化
5. On the separation of T Tauri star spectra using non-negative matrix factorization and Bayesian positive source separation. [D] . Kenney, Colleen. 2010

机译：关于使用非负矩阵分解和贝叶斯正源分离的T Tauri星光谱的分离。
6. Indicator Regularized Non-Negative Matrix Factorization Method-Based Drug Repurposing for COVID-19 [O] . Xianfang Tang, Lijun Cai, Yajie Meng, 2020

机译：指示器规则化非负矩阵分解方法的Covid-19药物重新施用
7. On the use of a spatial cue as prior information for stereo sound source separation based on spatially weighted non-negative tensor factorization [O] . Yuki Mitsufuji, Axel Roebel 2014

机译：基于空间加权非负张量分解的空间提示作为先验信息用于立体声声源分离的研究

Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅