首页> 外文期刊>Journal of the Audio Engineering Society >Singing Voice Separation by Low-Rank and Sparse Spectrogram Decomposition with Pre-learned Dictionaries
【24h】

Singing Voice Separation by Low-Rank and Sparse Spectrogram Decomposition with Pre-learned Dictionaries

机译:用预先学习的词典拼接低级和稀疏频谱图分解的语音分离

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Unsupervised spectrogram decomposition has shown promising results for singing voice separation in recent years. Its basic idea is to decompose the mixture spectrogram into a sparse spectrogram for the singing voice and a low-rank spectrogram for the background music. This approach, however, has two limitations. First, the unsupervised nature prevents the pre-learning of voice and background music dictionaries from widely available, although not related to the song being separated, isolated singing voice and background music recordings. Second, some components of the singing voice (e.g., fricatives) and the background music (e.g., less repetitive background) may not show the preferred sparse and low-rank properties, respectively. In this paper we propose to decompose the mixture spectrogram into three parts: a sparse spectrogram representing the singing voice, a low-rank spectrogram representing the background music, and a residual spectrogram for the components that are not identified by either the sparse or the low-rank spectrogram. Besides, we learn universal voice and music dictionaries from isolated singing voice and background music training data. Finally, we propose an approach named Low-rank and Sparse representation with Pre-learned Dictionaries under the framework of Alternating Direction Method of Multiplier. Experiments on the MIR-1K dataset and iKala dataset show its better performance.
机译:无监督的谱图分解显示近年来歌唱语音分离的有希望的结果。其基本思想是将混合谱图分解为唱歌语音的稀疏频谱图和背景音乐的低秩谱图。然而,这种方法有两个限制。首先,无监督的性质可以防止语音和背景音乐词典的预测从广泛使用,虽然与被分开的歌曲无关,隔离歌唱语音和背景乐谱。其次,歌唱语音的一些组件(例如,摩擦)和背景音乐(例如,较少的重复背景)分别不显示优选的稀疏和低秩属性。在本文中,我们建议将混合体谱图分解为三个部分:表示唱歌语音的稀疏频谱图,表示背景音乐的低级谱图,以及由稀疏或低的组件的组件的残余频谱图-rank频谱图。此外,我们学习来自孤立的歌唱语音和背景音乐训练数据的通用语音和音乐词典。最后,我们提出了一种在乘法器的交替方向方法的框架下命名为低级别和稀疏表示的方法。 MIR-1K DataSet和Ikala DataSet上的实验显示了更好的性能。

著录项

  • 来源
    《Journal of the Audio Engineering Society》 |2017年第5期|377-388|共12页
  • 作者单位

    Department of Mathematics Shanghai University Shanghai 200444 P R China;

    Department of Mathematics Shanghai University Shanghai 200444 P R China;

    Department of Electrical and Computer Engineering University of Rochester Rochester NY 14627 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号