首页> 外文期刊>Neural computing & applications >Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram
【24h】

Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram

机译:使用预先学习的字典和重建语音谱图来唱歌语音分离

获取原文
获取原文并翻译 | 示例
           

摘要

Recently the mixture spectrogram of a song is usually considered as a superposition of a sparse spectrogram and a low-rank spectrogram, which correspond to the vocal part and the accompaniment part of the song, respectively. Based on this observation, one can separate singing voice from the background music. However, the quality of such separation might be limited, since the vocal part may be not described very well by low rank, and moreover its more prior information, such as annotation, should be considered when designing separation algorithm. Based on these considerations, in this paper, we present two categories, time-frequency-based source separation algorithms. Specifically, one incorporates both the vocal and instrumental spectrograms as sparse matrix and low-rank matrix, meanwhile combines some side information of vocal part, i.e., the reconstructed voice spectrogram from the annotation. The others further consider both the vocal and instrumental spectrograms as sparse matrix and group-sparse matrix, respectively. Evaluations on the iKala dataset show that the proposed methods are effective and efficient for both the separated singing voice and music accompaniment.
机译:最近,歌曲的混合谱图通常被认为是稀疏谱图的叠加和低秩谱图,其分别对应于歌曲的声音部分和伴奏部分。基于这种观察,可以将歌声与背景音乐分开。然而,这种分离的质量可能受到限制,因为声音部分可以通过低等级进行非常好,而且在设计分离算法时,应该考虑其更高的信息,例如注释。在本文的基础上,我们提出了两类,基于时频的源分离算法。具体地,一种将声乐和仪器谱图与稀疏矩阵和低秩矩阵结合在一起,同时组合了声乐部分的某些侧面信息,即,来自注释的重建语音谱图。其他人进一步考虑声乐和乐谱分别作为稀疏矩阵和组稀疏矩阵。 IKALA数据集的评估表明,所提出的方法对于分离的歌唱语音和音乐伴奏是有效和有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号