Piano music transcription with fast convolutional sparse coding

机译：快速卷积稀疏编码的钢琴音乐转录

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic music transcription (AMT) is the process of converting an acoustic musical signal into a symbolic musical representation, such as a MIDI file, which contains the pitches, the onsets and offsets of the notes and, possibly, their dynamics and sources (i.e., instruments). Most existing algorithms for AMT operate in the frequency domain, which introduces the well known time/frequency resolution trade-off of the Short Time Fourier Transform and its variants. In this paper, we propose a time-domain transcription algorithm based on an efficient convolutional sparse coding algorithm in an instrument-specific scenario, i.e., the dictionary is trained and tested on the same piano. The proposed method outperforms a current state-of-the-art AMT method by over 26% in F-measure, achieving a median F-measure of 93.6%, and drastically increases both time and frequency resolutions, especially for the lowest octaves of the piano keyboard.

机译：自动音乐转录（AMT）是将声音音乐信号转换为符号音乐表示形式（例如MIDI文件）的过程，其中包含音高，音符的起音和偏移以及音符的动态和来源（例如，仪器）。现有的大多数AMT算法都在频域中运行，这引入了短时傅立叶变换及其变体的众所周知的时间/频率分辨率权衡。在本文中，我们提出了一种在特定乐器场景下基于高效卷积稀疏编码算法的时域转录算法，即该词典是在同一架钢琴上进行训练和测试的。所提出的方法在F值方面比当前最新的AMT方法高出26％以上，中值F值达到93.6％，并显着提高了时间和频率分辨率，尤其是对于最低的八度音阶而言。钢琴键盘。

著录项

来源
《IEEE International Workshop on Machine Learning for Signal Processing》|2015年|1-6|共6页
会议地点
作者
Cogliati Andrea; Zhiyao Duan; Wohlberg Brendt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fourier transforms; audio coding; music; time-frequency analysis; AMT; MIDI file; automatic music transcription; fast convolutional sparse coding; frequency domain; median F-measure; musical signal; piano keyboard; piano music transcription; short time Fourier transform; symbolic musical representation; time-domain transcription algorithm; time-frequency resolution trade-off; Convolution; Convolutional codes; Dictionaries; Heuristic algorithms; Time-domain analysis; Time-frequency analysis; Automatic Music Transcription; Convolutional Sparse Coding; Shift Invariant; Sparse Representation;

机译：傅里叶变换;音频编码;音乐;时频分析; AMT; MIDI文件;自动音乐转录;快速卷积稀疏编码;频域;中值F测度;音乐信号;钢琴键盘;钢琴音乐转录;短时傅里叶变换;符号音乐表示法;时域转录算法;时频分辨率的权衡;卷积;卷积码;字典;启发式算法;时域分析;时频分析;自动音乐转录;卷积稀疏编码;不变移位;稀疏表示;

相似文献

外文文献
中文文献
专利

1. Context-Dependent Piano Music Transcription With Convolutional Sparse Coding [J] . Andrea Cogliati, Zhiyao Duan, Brendt Wohlberg Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：卷积稀疏编码的上下文相关钢琴音乐转录
2. Constrained non-negative sparse coding using learnt instrument templates for realtime music transcription [J] . J.J. Carabias-Orti, F.J. Rodriguez-Serrano, P. Vera-Candeas, Engineering Applications of Artificial Intelligence . 2013,第7期

机译：使用学习的乐器模板进行约束的非负稀疏编码以进行实时音乐转录
3. Fast convolutional sparse coding using matrix inversion lemma [J] . Sorel Michal, Sroubek Filip Digital Signal Processing . 2016,第Null期

机译：使用矩阵求逆引理的快速卷积稀疏编码
4. Piano music transcription with fast convolutional sparse coding [C] . Cogliati Andrea, Zhiyao Duan, Wohlberg Brendt IEEE International Workshop on Machine Learning for Signal Processing . 2015

机译：钢琴音乐转录快速卷积稀疏编码
5. Fast space-varying convolution in stray light reduction, fast matrix vector multiplication using the sparse matrix transform, and activation detection in fMRI data analysis. [D] . Wei, Jianing. 2010

机译：快速减少杂散光的空间变化卷积，使用稀疏矩阵变换的快速矩阵向量乘法以及fMRI数据分析中的激活检测。
6. Fast Sparse Coding for Range Data Denoising with Sparse Ridges Constraint [O] . Zhi Gao, Mingjie Lao, Yongsheng Sang, 2018

机译：具有稀疏脊线约束的距离数据去噪的快速稀疏编码
7. Fast and Flexible Convolutional Sparse Coding [O] . Heide Felix, Heidrich Wolfgang, Wetzstein Gordon 2015

机译：快速灵活的卷积稀疏编码

Piano music transcription with fast convolutional sparse coding

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅