首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Informed monaural source separation of music based on convolutional sparse coding
【24h】

Informed monaural source separation of music based on convolutional sparse coding

机译:基于卷积稀疏编码的音乐通知单源分离

获取原文

摘要

Monaural source separation is a challenging problem that has many important applications in music information retrieval. In this paper, we focus on the score-informed variant of this problem. While non-negative matrix factorization and some other approaches have been shown effective, few existing approaches have properly taken the phase information into account. There are unnatural sound in the separation result, as the phase of each source signal is considered equivalent to the phase of the mixed signal. To remedy this, we propose to perform source separation directly in the time domain using a convolutional sparse coding (CSC) approach. Evaluation on the Bach10 dataset shows that, when the instrument, pitch and onset/offset time are informed, the source to distortion ratio of the separation result reaches 8.59 dB, which is 2.02 dB higher than a state-of-the-art system called Soundprism.
机译:单声道源分离是一个具有挑战性的问题,在音乐信息检索中具有许多重要的应用。在本文中,我们专注于这个问题的得分信息。虽然非负数矩阵分解和一些其他方法已经有效地显示出来,但很少有现有方法已经适当考虑相位信息。分离结果中存在不自然声音,因为每个源信号的相位被认为等于混合信号的相位。要解决此问题,我们建议使用卷积稀疏编码(CSC)方法直接在时域中执行源分离。 Bach10 DataSet上的评估显示,当通知仪器,俯仰和开始/偏移时间时,分离结果的失真率源达到8.59 dB,比称为最先进的系统为2.02 dB声音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号