首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An efficient method for polyphonic audio-to-score alignment using onset detection and constant Q transform
【24h】

An efficient method for polyphonic audio-to-score alignment using onset detection and constant Q transform

机译:一种使用开始检测和恒定Q变换进行复音音频与音高对齐的有效方法

获取原文
获取外文期刊封面目录资料

摘要

This paper proposes an innovative method that aligns a polyphonic audio recording of music to its corresponding symbolic score. In the first step, we perform onset detection and then apply constant Q transform around each onset. A similarity matrix is computed by using a scoring function which evaluates the similarity between notes in the music score and onsets in the audio recording. At last, we use dynamic programming to extract the best alignment path in the similarity matrix. We compared two onset detectors and two note matching methods. Our method is more efficient and has higher precision than the traditional chroma-based DTW method. Our algorithm achieved the best precision, which are 10% higher than the compared traditional algorithm when the tolerance window is 50 ms.
机译:本文提出了一种创新的方法,可以将音乐的复音录音与其相应的符号乐谱对齐。第一步,我们执行发作检测,然后在每次发作周围应用恒定的Q变换。通过使用评分函数计算相似度矩阵,该评分函数评估乐谱中的音符与音频记录中的开始之间的相似度。最后,我们使用动态编程来提取相似矩阵中的最佳对齐路径。我们比较了两个发作检测器和两种音符匹配方法。与传统的基于色度的DTW方法相比,我们的方法效率更高且具有更高的精度。当容差窗口为50 ms时,我们的算法达到了最佳精度,比传统算法高出10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号