首页> 外国专利> Context-dependent piano music transcription with convolutional sparse coding

Context-dependent piano music transcription with convolutional sparse coding

机译:卷积稀疏编码的上下文相关钢琴音乐转录

摘要

The present disclosure presents a novel approach to automatic transcription of piano music in a context-dependent setting. Embodiments described herein may employ an efficient algorithm for convolutional sparse coding to approximate a music waveform as a summation of piano note waveforms convolved with associated temporal activations. The piano note waveforms may be pre-recorded for a particular piano that is to be transcribed and may optionally be pre-recorded in the specific environment where the piano performance is to be performed. During transcription, the note waveforms may be fixed and associated temporal activations may be estimated and post-processed to obtain the pitch and onset transcription. Experiments have shown that embodiments of the disclosure significantly outperform state-of-the-art music transcription methods trained in the same context-dependent setting, in both transcription accuracy and time precision, in various scenarios including synthetic, anechoic, noisy, and reverberant environments.
机译:本公开提出了一种在上下文相关的环境中自动复制钢琴音乐的新颖方法。本文描述的实施例可以采用用于卷积稀疏编码的有效算法,以将音乐波形近似为与相关的时间激活卷积的钢琴音符波形的总和。可以为要转录的特定钢琴预先记录钢琴音符波形,并且可以可选地在要执行钢琴演奏的特定环境中预先记录钢琴音符波形。在转录期间,音符波形可以是固定的,并且相关的时间激活可以被估计和后处理以获得音高和开始转录。实验表明,本公开的实施例在包括合成,无回声,嘈杂和混响环境在内的各种情况下,在转录准确度和时间精确度方面均显着优于在相同的上下文相关设置中训练的最新音乐转录方法。 。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号