首页> 外文会议>IEEE International Workshop on Machine Learning for Signal Processing >Piano music transcription with fast convolutional sparse coding
【24h】

Piano music transcription with fast convolutional sparse coding

机译:钢琴音乐转录快速卷积稀疏编码

获取原文

摘要

Automatic music transcription (AMT) is the process of converting an acoustic musical signal into a symbolic musical representation, such as a MIDI file, which contains the pitches, the onsets and offsets of the notes and, possibly, their dynamics and sources (i.e., instruments). Most existing algorithms for AMT operate in the frequency domain, which introduces the well known time/frequency resolution trade-off of the Short Time Fourier Transform and its variants. In this paper, we propose a time-domain transcription algorithm based on an efficient convolutional sparse coding algorithm in an instrument-specific scenario, i.e., the dictionary is trained and tested on the same piano. The proposed method outperforms a current state-of-the-art AMT method by over 26% in F-measure, achieving a median F-measure of 93.6%, and drastically increases both time and frequency resolutions, especially for the lowest octaves of the piano keyboard.
机译:自动音乐转录(AMT)是将声音音乐信号转换为符号音乐表示的过程,例如MIDI文件,其中包含音符的音高,持续的持续和偏移,可能是它们的动态和源(即,仪器)。用于AMT的大多数现有算法在频域中操作,介绍了短时间傅里叶变换及其变体的众所周知的时间/频率分辨率折衷。在本文中,我们提出了一种基于仪器特定方案中有效卷积稀疏编码算法的时域转录算法,即,在同一钢琴上训练和测试字典。所提出的方法在F测量中以超过26%的方式优于最新的最先进的AMT方法,实现了93.6%的中位数,并且大大增加了时间和频率分辨率,特别是对于最低八位八个钢琴键盘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号