首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription
【24h】

Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription

机译:带有子空间音符建模的非负群稀疏性用于复音转录

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic music transcription (AMT) can be performed by deriving a pitch-time representation through decomposition of a spectrogram with a dictionary of pitch-labelled atoms. Typically, non-negative matrix factorisation (NMF) methods are used to decompose magnitude spectrograms. One atom is often used to represent each note. However, the spectrum of a note may change over time. Previous research considered this variability using different atoms to model specific parts of a note, or large dictionaries comprised of datapoints from the spectrograms of full notes. In this paper, the use of subspace modelling of note spectra is explored, with group sparsity employed as a means of coupling activations of related atoms into a pitched subspace. Stepwise and gradient-based methods for non-negative group sparse decompositions are proposed. Finally, a group sparse NMF approach is used to tune a generic harmonic subspace dictionary, leading to improved NMF-based AMT results.
机译:自动音乐转录(AMT)可以通过分解带有音高标记原子字典的声谱图得出音高时间表示来执行。通常,非负矩阵分解(NMF)方法用于分解幅度谱图。通常使用一个原子来表示每个音符。但是,音符的频谱可能会随时间变化。以前的研究考虑到这种可变性,它使用不同的原子来建模音符的特定部分,或者是由完整音符的频谱图中的数据点组成的大型词典。在本文中,探索了音符谱图的子空间建模的用途,其中组稀疏性被用作将相关原子的激活耦合到倾斜的子空间中的一种手段。提出了基于步长和梯度的非负群稀疏分解方法。最后,使用组稀疏NMF方法调整通用谐波子空间字典,从而改善了基于NMF的AMT结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号