首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Informed monaural source separation of music based on convolutional sparse coding
【24h】

Informed monaural source separation of music based on convolutional sparse coding

机译:基于卷积稀疏编码的音乐单声道声源分离

获取原文

摘要

Monaural source separation is a challenging problem that has many important applications in music information retrieval. In this paper, we focus on the score-informed variant of this problem. While non-negative matrix factorization and some other approaches have been shown effective, few existing approaches have properly taken the phase information into account. There are unnatural sound in the separation result, as the phase of each source signal is considered equivalent to the phase of the mixed signal. To remedy this, we propose to perform source separation directly in the time domain using a convolutional sparse coding (CSC) approach. Evaluation on the Bach10 dataset shows that, when the instrument, pitch and onset/offset time are informed, the source to distortion ratio of the separation result reaches 8.59 dB, which is 2.02 dB higher than a state-of-the-art system called Soundprism.
机译:单声道音源分离是一个具有挑战性的问题,在音乐信息检索中具有许多重要的应用。在本文中,我们重点讨论此问题的分数告知变体。尽管非负矩阵分解和其他一些方法已被证明是有效的,但很少有现有方法适当地考虑了相位信息。分离结果中会有不自然的声音,因为每个源信号的相位都被认为等同于混合信号的相位。为了解决这个问题,我们建议使用卷积稀疏编码(CSC)方法在时域中直接执行源分离。对Bach10数据集的评估表明,在告知仪器,音高和开始/偏移时间后,分离结果的源失真比达到8.59 dB,这比称为的最新系统高2.02 dB。声音棱镜。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号