Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram

机译：非差分矩阵图的非负矩阵分解的复音钢琴音符转录

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic music transcription is usually approached by using a time-frequency (TF) representation such as the short-time Fourier transform (STFT) spectrogram or the constant-Q transform. In this paper, we propose a novel yet simple TF representation that capitalizes the effectiveness of spectral flux features in highlighting note onset times. We refer to this representation as the differential spectrogram and investigate its usefulness for note-level piano transcription using two different non-negative matrix factorization (NMF) algorithms. Experiments on the MAPS ENSTDkCl dataset validate the advantages of the differential spectrogram over the STFT spectrogram for this task. Moreover, by adapting a state-of-the-art convolutional NMF algorithm with the differential spectrogram, we can achieve even better accuracy than the state-of-the-art on this dataset. Our analysis shows that the new representation suppresses unwanted TF patterns and performs particularly well in improving the recall rate.

机译：通常，通过使用时频（TF）表示（例如短时傅立叶变换（STFT）频谱图或常量Q变换）来实现自动音乐转录。在本文中，我们提出了一种新颖而又简单的TF表示形式，该表示形式充分利用了频谱通量特征在突出音符发作时间方面的有效性。我们将此表示形式称为差分频谱图，并使用两种不同的非负矩阵分解（NMF）算法研究其对音符级钢琴转录的有用性。在MAPS ENSTDkCl数据集上进行的实验验证了差分频谱图相对于STFT频谱图在此任务上的优势。此外，通过将最新的卷积NMF算法与差分频谱图配合使用，我们可以获得比该数据集上最新技术更高的准确性。我们的分析表明，新的表示形式可以抑制不必要的TF模式，并且在提高召回率方面表现特别出色。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2017年|291-295|共5页
会议地点
作者
Lufei Gao; Li Su; Yi-Hsuan Yang; Tan Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Spectrogram; Adaptation models; Music; Standards; Convolution; Algorithm design and analysis;

机译：频谱图;适应模型;音乐;标准;卷积;算法设计与分析;

相似文献

外文文献
中文文献
专利

1. Generative Spectrogram Factorization Models for Polyphonic Piano Transcription [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第3期

机译：和弦钢琴转录的生成频谱图分解模型
2. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第3期

机译：贝叶斯非负矩阵因式分解在和弦音乐转录中的增强谐和性
3. Constrained non-negative matrix factorization for score-informed piano music restoration [J] . Canadas-Quesada F. J., Vera-Candeas P., Martinez-Munoz D., Digital Signal Processing . 2016,第Null期

机译：约束非负矩阵因式分解用于乐谱信息还原的钢琴音乐
4. Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram [C] . Lufei Gao, Li Su, Yi-Hsuan Yang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：具有差分谱图非负矩阵分解的复音钢琴注意转录
5. Group Convex Orthogonal Non-negative Matrix Tri-Factorization with Applications in FC Fingerprinting [D] . ?Li, Kendrick 2020

机译：集团凸正交非负矩阵三分解与 FC 指纹应用
6. Monophonic and Polyphonic Wheezing Classification Based on Constrained Low-Rank Non-Negative Matrix Factorization [O] . Juan De La Torre Cruz, Francisco Jesús Cañadas Quesada, Nicolás Ruiz Reyes, 2021

机译：基于受约束低级别非负矩阵分子的单声道和多关喘息分类
7. A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization [O] . Felix Weninger, Christian Kirst, Hans-joachim Bungartz 2013

机译：使用有监督的非负矩阵分解对复调钢琴音符转录的判别方法

Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram

摘要

著录项

相似文献

相关主题

期刊订阅