首页> 外文期刊>ACM transactions on intelligent systems >Tempo Driven Audio-to-Score Alignment Using Spectral Decomposition and Online Dynamic Time Warping
【24h】

Tempo Driven Audio-to-Score Alignment Using Spectral Decomposition and Online Dynamic Time Warping

机译:使用频谱分解和在线动态时间规整的速度驱动音频到分数对齐

获取原文
获取原文并翻译 | 示例
           

摘要

In this article, we present an online score following framework designed to deal with automatic accompaniment. The proposed framework is based on spectral factorization and online Dynamic Time Warping (DTW) and has two separated stages: preprocessing and alignment. In the first one, we convert the score into a reference audio signal using a MIDI synthesizer software and we analyze the provided information in order to obtain the spectral patterns (i.e., basis functions) associated to each score unit. In this work, a score unit represents the occurrence of concurrent or isolated notes in the score. These spectral patterns are learned from the synthetic MIDI signal using a method based on Non-negative Matrix Factorization (NMF) with Beta-divergence, where the gains are initialized as the ground-truth transcription inferred from the MIDI. On the second stage, a non-iterative signal decomposition method with fixed spectral patterns per score unit is used over the magnitude spectrogram of the input signal resulting in a distortion matrix that can be interpreted as the cost of the matching for each score unit at each frame. Finally, the relation between the performance and the musical score times is obtained using a strategy based on online DTW, where the optimal path is biased by the speed of interpretation. Our system has been evaluated and compared to other systems, yielding reliable results and performance.
机译:在本文中,我们提出了一个旨在处理自动伴奏的在线乐谱跟踪框架。所提出的框架基于频谱分解和在线动态时间规整(DTW),并具有两个分离的阶段:预处理和对齐。在第一个中,我们使用MIDI合成器软件将乐谱转换为参考音频信号,并分析提供的信息,以获得与每个乐谱单元相关的频谱模式(即基本函数)。在这项工作中,分数单位表示分数中并发或孤立音符的出现。使用基于具有Beta散度的非负矩阵分解(NMF)的方法从合成MIDI信号中学习这些频谱模式,其中,将增益初始化为从MIDI推断出的真实转录。在第二阶段,在输入信号的幅度谱图上使用每个得分单位具有固定频谱模式的非迭代信号分解方法,从而生成失真矩阵,该失真矩阵可以解释为每个得分单位的匹配成本。帧。最后,使用基于在线DTW的策略获得演奏和乐谱时间之间的关系,其中最佳路径受解释速度的影响。我们的系统已经过评估,并与其他系统进行比较,得出了可靠的结果和性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号