Piano music transcription with fast convolutional sparse coding

机译：钢琴音乐转录快速卷积稀疏编码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic music transcription (AMT) is the process of converting an acoustic musical signal into a symbolic musical representation, such as a MIDI file, which contains the pitches, the onsets and offsets of the notes and, possibly, their dynamics and sources (i.e., instruments). Most existing algorithms for AMT operate in the frequency domain, which introduces the well known time/frequency resolution trade-off of the Short Time Fourier Transform and its variants. In this paper, we propose a time-domain transcription algorithm based on an efficient convolutional sparse coding algorithm in an instrument-specific scenario, i.e., the dictionary is trained and tested on the same piano. The proposed method outperforms a current state-of-the-art AMT method by over 26% in F-measure, achieving a median F-measure of 93.6%, and drastically increases both time and frequency resolutions, especially for the lowest octaves of the piano keyboard.

机译：自动音乐转录（AMT）是将声音音乐信号转换为符号音乐表示的过程，例如MIDI文件，其中包含音符的音高，持续的持续和偏移，可能是它们的动态和源（即，仪器）。用于AMT的大多数现有算法在频域中操作，介绍了短时间傅里叶变换及其变体的众所周知的时间/频率分辨率折衷。在本文中，我们提出了一种基于仪器特定方案中有效卷积稀疏编码算法的时域转录算法，即，在同一钢琴上训练和测试字典。所提出的方法在F测量中以超过26％的方式优于最新的最先进的AMT方法，实现了93.6％的中位数，并且大大增加了时间和频率分辨率，特别是对于最低八位八个钢琴键盘。

著录项

来源
《IEEE International Workshop on Machine Learning for Signal Processing》|2015年||共6页
会议地点
作者
Cogliati Andrea; Zhiyao Duan; Wohlberg Brendt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信号处理;
关键词
Fourier transforms; audio coding; music; time-frequency analysis; AMT; MIDI file; automatic music transcription; fast convolutional sparse coding; frequency domain; median F-measure; musical signal; piano keyboard; piano music transcription; short time Fourier transform; symbolic musical representation; time-domain transcription algorithm; time-frequency resolution trade-off; Convolution; Convolutional codes; Dictionaries; Heuristic algorithms; Time-domain analysis; Time-frequency analysis; Automatic Music Transcription; Convolutional Sparse Coding; Shift Invariant; Sparse Representation;

机译：傅里叶变换;音频编码;音乐;时间频率分析;amt;MIDI文件;自动音乐转录;快速卷积稀疏编码;频域;中位数F测量;音乐信号;钢琴键盘;钢琴音乐转录;短时间傅里叶变换;符号音乐表示;时域转录算法;时间频分辨率折衷;卷积;卷积码;语言;时域分析;时间频率分析;自动音乐转录;卷积稀疏编码;扭转不变;稀疏表示;

相似文献

外文文献
中文文献
专利

1. Context-Dependent Piano Music Transcription With Convolutional Sparse Coding [J] . Andrea Cogliati, Zhiyao Duan, Brendt Wohlberg Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：卷积稀疏编码的上下文相关钢琴音乐转录
2. Constrained non-negative sparse coding using learnt instrument templates for realtime music transcription [J] . J.J. Carabias-Orti, F.J. Rodriguez-Serrano, P. Vera-Candeas, Engineering Applications of Artificial Intelligence . 2013,第7期

机译：使用学习的乐器模板进行约束的非负稀疏编码以进行实时音乐转录
3. Fast convolutional sparse coding using matrix inversion lemma [J] . Sorel Michal, Sroubek Filip Digital Signal Processing . 2016,第Null期

机译：使用矩阵求逆引理的快速卷积稀疏编码
4. Piano music transcription with fast convolutional sparse coding [C] . Cogliati Andrea, Zhiyao Duan, Wohlberg Brendt IEEE International Workshop on Machine Learning for Signal Processing . 2015

机译：快速卷积稀疏编码的钢琴音乐转录
5. Fast space-varying convolution in stray light reduction, fast matrix vector multiplication using the sparse matrix transform, and activation detection in fMRI data analysis. [D] . Wei, Jianing. 2010

机译：快速减少杂散光的空间变化卷积，使用稀疏矩阵变换的快速矩阵向量乘法以及fMRI数据分析中的激活检测。
6. Fast Sparse Coding for Range Data Denoising with Sparse Ridges Constraint [O] . Zhi Gao, Mingjie Lao, Yongsheng Sang, 2018

机译：具有稀疏脊线约束的距离数据去噪的快速稀疏编码
7. Fast and Flexible Convolutional Sparse Coding [O] . Heide Felix, Heidrich Wolfgang, Wetzstein Gordon 2015

机译：快速灵活的卷积稀疏编码

Piano music transcription with fast convolutional sparse coding

摘要

著录项

相似文献

相关主题

期刊订阅