Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription

O Hanlon Ken; Nagano Hidehisa; Keriven Nicolas; Plumbley Mark D.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription

【24h】

Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription

机译：带有子空间音符建模的非负群稀疏性用于复音转录

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic music transcription (AMT) can be performed by deriving a pitch-time representation through decomposition of a spectrogram with a dictionary of pitch-labelled atoms. Typically, non-negative matrix factorisation (NMF) methods are used to decompose magnitude spectrograms. One atom is often used to represent each note. However, the spectrum of a note may change over time. Previous research considered this variability using different atoms to model specific parts of a note, or large dictionaries comprised of datapoints from the spectrograms of full notes. In this paper, the use of subspace modelling of note spectra is explored, with group sparsity employed as a means of coupling activations of related atoms into a pitched subspace. Stepwise and gradient-based methods for non-negative group sparse decompositions are proposed. Finally, a group sparse NMF approach is used to tune a generic harmonic subspace dictionary, leading to improved NMF-based AMT results.

机译：自动音乐转录（AMT）可以通过分解带有音高标记原子字典的声谱图得出音高时间表示来执行。通常，非负矩阵分解（NMF）方法用于分解幅度谱图。通常使用一个原子来表示每个音符。但是，音符的频谱可能会随时间变化。以前的研究考虑到这种可变性，它使用不同的原子来建模音符的特定部分，或者是由完整音符的频谱图中的数据点组成的大型词典。在本文中，探索了音符谱图的子空间建模的用途，其中组稀疏性被用作将相关原子的激活耦合到倾斜的子空间中的一种手段。提出了基于步长和梯度的非负群稀疏分解方法。最后，使用组稀疏NMF方法调整通用谐波子空间字典，从而改善了基于NMF的AMT结果。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2016年第3期|530-542|共13页
作者
O Hanlon Ken; Nagano Hidehisa; Keriven Nicolas; Plumbley Mark D.;
展开▼
作者单位

Centre for Digital Music, Queen Mary University of London, London, U.K.;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automatic music transcription; Group sparsity; automatic music transcription; group sparsity; non-negative matrix factorisation; stepwise optimal;

机译：自动音乐转录;组稀疏性;自动音乐转录;组稀疏性;非负矩阵分解;逐步最优;

相似文献

外文文献
中文文献
专利

1. Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures [J] . Francisco Jose Rodriguez-Serrano, Julio Jose Carabias-Orti, Pedro Vera-Candeas, Multimedia Tools and Applications . 2014,第1期

机译：使用仪器模型对基于单音源的多音混合物进行音频分离和转录的单音约束非负稀疏编码
2. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第3期

机译：贝叶斯非负矩阵因式分解在和弦音乐转录中的增强谐和性
3. Polyphonic Piano Transcription with a Note-Based Music Language Model [J] . Qi Wang, Ruohua Zhou, Yonghong Yan Applied Sciences . 2018,第3期

机译：基于音符的音乐语言模型的复音钢琴转录
4. Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram [C] . Lufei Gao, Li Su, Yi-Hsuan Yang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：非差分矩阵图的非负矩阵分解的复音钢琴音符转录
5. Neural Networks for Automatic Polyphonic Piano Music Transcription [D] . Ender, Johnathon Michael. 2018

机译：自动复音钢琴音乐转录的神经网络
6. Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis [O] . Chuan-Yuan Wang, Jin-Xing Liu, Na Yu, 2019

机译：稀疏图正规化非负矩阵分解基于Huber损失模型的癌症数据分析
7. Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription [O] . O'Hanlon, K, Nagano, H, Keriven, N, 2016

机译：带有子空间音符建模的非负群稀疏性用于复音转录

Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅