Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation

Zhu; B.; Li; W.; Li; R.; Xue; X.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation

【24h】

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation

机译：单声道歌唱声音分离的多阶段非负矩阵分解

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Separating singing voice from music accompaniment can be of interest for many applications such as melody extraction, singer identification, lyrics alignment and recognition, and content-based music retrieval. In this paper, a novel algorithm for singing voice separation in monaural mixtures is proposed. The algorithm consists of two stages, where non-negative matrix factorization (NMF) is applied to decompose the mixture spectrograms with long and short windows respectively. A spectral discontinuity thresholding method is devised for the long-window NMF to select out NMF components originating from pitched instrumental sounds, and a temporal discontinuity thresholding method is designed for the short-window NMF to pick out NMF components that are from percussive sounds. By eliminating the selected components, most pitched and percussive elements of the music accompaniment are filtered out from the input sound mixture, with little effect on the singing voice. Extensive testing on the MIR-1K public dataset of 1000 short audio clips and the Beach-Boys dataset of 14 full-track real-world songs showed that the proposed algorithm is both effective and efficient.

机译：将歌声与音乐伴奏分开可能对许多应用感兴趣，例如旋律提取，歌手识别，歌词对齐和识别以及基于内容的音乐检索。本文提出了一种新的单声道混合语音分离算法。该算法包括两个阶段，其中应用非负矩阵分解（NMF）分别分解具有长窗和短窗的混合频谱图。针对长窗NMF设计了一种频谱不连续性阈值方法，以从音高的乐器声音中选择出NMF分量；为短窗NMF设计了一种时间不连续性阈值方法，以从打击乐中挑选出NMF分量。通过消除选定的成分，音乐伴奏中大多数音高和打击乐元素会从输入混音中滤除，而对演唱声音的影响很小。对1000个短音频片段的MIR-1K公共数据集和14首真实曲目的Beach-Boys数据集进行了广泛测试，结果表明，该算法既有效又有效。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第10期|2096-2107|共12页
作者
Zhu; B.; Li; W.; Li; R.; Xue; X.;
展开▼
作者单位

School of Computer Science, Fudan University, Shanghai, P. R. China|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-stage method; non-negative matrix factorization (NMF); singing voice separation; spectral discontinuity; temporal discontinuity;

机译：多阶段方法;非负矩阵分解;语音分离;谱间断;时间间断;

相似文献

外文文献
中文文献
专利

1. Clustering Algorithm for Unsupervised Monaural Musical Sound Separation Based on Non-negative Matrix Factorization [J] . Sang Ha PARK, Seokjin LEE, Koeng-Mo SUNG IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2012,第4期

机译：基于非负矩阵分解的无监督单声道音乐分离的聚类算法
2. Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification [J] . Hu Ying, Liu Guizhong Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第4期

机译：使用非负矩阵部分协因子分离歌手识别的歌声
3. A Skip Attention Mechanism for Monaural Singing Voice Separation [J] . Weitao Yuan, Shengbei Wang, Xiangrui Li, IEEE signal processing letters . 2019,第10期

机译：单声道歌声分离的跳跃注意机制
4. A Local Discontinuity Based Approach for Monaural Singing Voice Separation from Accompanying Music with Multi-stage Non-negative Matrix Factorization [C] . Hatem Deif, Wenwu Wang, Lu Gan, IEEE Global Conference on Signal and Information Processing . 2015

机译：一种基于局部不连续的伴随音乐与多阶段非负矩阵分解的语音分离的方法
5. On the separation of T Tauri star spectra using non-negative matrix factorization and Bayesian positive source separation. [D] . Kenney, Colleen. 2010

机译：关于使用非负矩阵分解和贝叶斯正源分离的T Tauri星光谱的分离。
6. Wheezing Sound Separation Based on Informed Inter-Segment Non-Negative Matrix Partial Co-Factorization [O] . Juan De La Torre Cruz, Francisco Jesús Cañadas Quesada, Nicolás Ruiz Reyes, 2020

机译：基于信息间非负矩阵部分协同因子的喘息声分离
7. Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation [O] . Weitao Yuan, Bofei Dong, Shengbei Wang, 2021

机译：演化多分辨率汇集CNN用于单声道歌唱语音分离

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation

摘要

著录项

相似文献

相关主题

期刊订阅