首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A score-informed shift-invariant extension of complex matrix factorization for improving the separation of overlapped partials in music recordings
【24h】

A score-informed shift-invariant extension of complex matrix factorization for improving the separation of overlapped partials in music recordings

机译:分数信息的复杂矩阵因式分解不变扩展,用于改善音乐录音中重叠部分的分离

获取原文

摘要

Similar to non-negative matrix factorization (NMF), complex matrix factorization (CMF) can be used to decompose a given music recording into individual sound sources. In contrast to NMF, CMF models both the magnitude and phase of a source, which can improve the separation of overlapped partials. However, the shift-invariance for spectral templates enabling NMF-based methods to efficiently model vibrato in music is not available with CMF. Further, the estimation of an entire phase matrix for each source results in a high number of parameters in CMF, which often leads to poor local minima. In this paper we show that score information provides a source of prior knowledge rich enough to stabilize the CMF parameter estimation, without sacrificing its expressive power. As a second contribution, we present a shift-invariant extension to CMF bringing the vibrato-modeling capabilities of NMF to CMF. As our experiments demonstrate our proposed method consistently improves the separation quality for overlapped partials compared to score-informed NMF.
机译:与非负矩阵分解(NMF)相似,复数矩阵分解(CMF)可用于将给定的音乐录制分解为单独的声源。与NMF相比,CMF可以对源的大小和相位进行建模,从而可以改善重叠部分的分离。但是,CMF无法使用频谱模板的不变性,该频谱不变性使基于NMF的方法可以有效地对音乐中的颤音进行建模。此外,对每个源的整个相位矩阵的估计会导致CMF中有大量参数,这通常会导致较差的局部最小值。在本文中,我们表明分数信息提供了足够丰富的先验知识,可以稳定CMF参数估计,而又不牺牲其表达能力。作为第二个贡献,我们提出了对CMF的不变位移扩展,将NMF的颤音建模功能引入了CMF。正如我们的实验所证明的,与分数通知的NMF相比,我们提出的方法可不断提高重叠部分的分离质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号