首页> 外文期刊>Multimedia Tools and Applications >An effective method for audio-to-score alignment using onsets and modified constant Q spectra
【24h】

An effective method for audio-to-score alignment using onsets and modified constant Q spectra

机译:使用起点和修正的恒定Q谱来进行音频与乐谱对齐的有效方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an effective algorithm for polyphonic audio-to-score alignment that aligns a polyphonic music performance to its corresponding score. The proposed framework consists of three steps: onset detection, note matching, and dynamic programming. In the first step, onsets are detected and then onset features are extracted by applying the constant Q transform around each onset. A similarity matrix is computed using a note-matching function to evaluate the similarity between concurrent notes in the music score and onsets in the audio recording. Finally, dynamic programming is used to extract the optimal alignment path in the similarity matrix. We compared five onset detectors and three spectrum difference vectors at selected audio onsets. The experimental results revealed that our method achieved higher precision than did the other algorithms included for comparison. This paper also proposes an online approach based on onset detection that can detect most notes within only 10ms. Based on our experimental results, this online approach outperforms all methods included for comparison when the tolerance window is 50ms.
机译:本文提出了一种有效的复音音频至乐谱对齐算法,该算法可将复音音乐演奏与其对应的乐谱对齐。提议的框架包括三个步骤:发作检测,音符匹配和动态编程。第一步,检测起点,然后通过在每个起点周围应用恒定Q变换来提取起点特征。使用音符匹配功能计算相似度矩阵,以评估乐谱中的并发音符和录音中的开始音之间的相似度。最后,使用动态规划来提取相似矩阵中的最佳对齐路径。我们在选定的音频开始位置比较了五个开始检测器和三个频谱差异向量。实验结果表明,与用于比较的其他算法相比,我们的方法具有更高的精度。本文还提出了一种基于发作检测的在线方法,该方法可以在10ms内检测到大多数音符。根据我们的实验结果,当公差窗口为50ms时,此在线方法优于所有用于比较的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号