Informed monaural source separation of music based on convolutional sparse coding

机译：基于卷积稀疏编码的音乐单声道声源分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monaural source separation is a challenging problem that has many important applications in music information retrieval. In this paper, we focus on the score-informed variant of this problem. While non-negative matrix factorization and some other approaches have been shown effective, few existing approaches have properly taken the phase information into account. There are unnatural sound in the separation result, as the phase of each source signal is considered equivalent to the phase of the mixed signal. To remedy this, we propose to perform source separation directly in the time domain using a convolutional sparse coding (CSC) approach. Evaluation on the Bach10 dataset shows that, when the instrument, pitch and onset/offset time are informed, the source to distortion ratio of the separation result reaches 8.59 dB, which is 2.02 dB higher than a state-of-the-art system called Soundprism.

机译：单声道音源分离是一个具有挑战性的问题，在音乐信息检索中具有许多重要的应用。在本文中，我们重点讨论此问题的分数告知变体。尽管非负矩阵分解和其他一些方法已被证明是有效的，但很少有现有方法适当地考虑了相位信息。分离结果中会有不自然的声音，因为每个源信号的相位都被认为等同于混合信号的相位。为了解决这个问题，我们建议使用卷积稀疏编码（CSC）方法在时域中直接执行源分离。对Bach10数据集的评估表明，在告知仪器，音高和开始/偏移时间后，分离结果的源失真比达到8.59 dB，这比称为的最新系统高2.02 dB。声音棱镜。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|236-240|共5页
会议地点
作者
Jao Ping-Keng; Yang Yi-Hsuan; Wohlberg Brendt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolutional sparse coding; dictionary learning; score-informed monaural source separation;

机译：卷积稀疏编码;字典学习;得分式单声道源分离;

相似文献

外文文献
中文文献
专利

1. Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder [J] . Tsai Kun-Hsi, Wang Wei-Chien, Cheng Chui-Hsuan, Biomedical and Health Informatics, IEEE Journal of . 2020,第11期

机译：基于周期性编码的深度自动化器的心脏和肺部声音盲声道源分离
2. DEMIXING JAZZ-MUSIC: SPARSE CODING NEURAL GAS FOR THE SEPARATION OF NOISY OVERCOMPLETE SOURCES [J] . Kai Labusch, Erhardt Barth, Thomas Martinetz Neural network world journal . 2009,第5期

机译：简化的爵士音乐：稀疏编码的神经气体，用于分离噪声过大的源
3. Monaural speech/music source separation using discrete energy separation algorithm [J] . Yevgeni Litvin, Israel Cohen, Dan Chazan Signal processing . 2010,第12期

机译：使用离散能量分离算法的单声道语音/音乐源分离
4. Informed monaural source separation of music based on convolutional sparse coding [C] . Jao Ping-Keng, Yang Yi-Hsuan, Wohlberg Brendt IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：基于卷积稀疏编码的音乐通知单源分离
5. Score-informed musical source separation and reconstruction [D] . Han, Yushen. 2013

机译：乐谱信息的音乐源分离和重建
6. Automated chest screening based on a hybrid model of transfer learning and convolutional sparse denoising autoencoder [O] . Changmiao Wang, Ahmed Elazab, Fucang Jia, 2018

机译：基于转移学习和卷积稀疏去噪自动编码器混合模型的自动胸部筛查
7. NMF based speech and music separation in monaural speech recordings with sparseness and temporal continuity constraints [O] . Tu Ming, Xie Xiang, Jiao Yishan 2013

机译：基于NMF的语音和音乐分离在单声道语音记录中，具有稀疏性和时间连续性约束

Informed monaural source separation of music based on convolutional sparse coding

摘要

著录项

相似文献

相关主题

期刊订阅