首页> 外文会议>IEEE International Workshop on Machine Learning for Signal Processing >Piano music transcription with fast convolutional sparse coding
【24h】

Piano music transcription with fast convolutional sparse coding

机译:快速卷积稀疏编码的钢琴音乐转录

获取原文
获取外文期刊封面目录资料

摘要

Automatic music transcription (AMT) is the process of converting an acoustic musical signal into a symbolic musical representation, such as a MIDI file, which contains the pitches, the onsets and offsets of the notes and, possibly, their dynamics and sources (i.e., instruments). Most existing algorithms for AMT operate in the frequency domain, which introduces the well known time/frequency resolution trade-off of the Short Time Fourier Transform and its variants. In this paper, we propose a time-domain transcription algorithm based on an efficient convolutional sparse coding algorithm in an instrument-specific scenario, i.e., the dictionary is trained and tested on the same piano. The proposed method outperforms a current state-of-the-art AMT method by over 26% in F-measure, achieving a median F-measure of 93.6%, and drastically increases both time and frequency resolutions, especially for the lowest octaves of the piano keyboard.
机译:自动音乐转录(AMT)是将声音音乐信号转换为符号音乐表示形式(例如MIDI文件)的过程,其中包含音高,音符的起音和偏移以及音符的动态和来源(例如,仪器)。现有的大多数AMT算法都在频域中运行,这引入了短时傅立叶变换及其变体的众所周知的时间/频率分辨率权衡。在本文中,我们提出了一种在特定乐器场景下基于高效卷积稀疏编码算法的时域转录算法,即该词典是在同一架钢琴上进行训练和测试的。所提出的方法在F值方面比当前最新的AMT方法高出26%以上,中值F值达到93.6%,并显着提高了时间和频率分辨率,尤其是对于最低的八度音阶而言。钢琴键盘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号