首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model
【24h】

Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model

机译:基于贝叶斯隐性谐波分配隐马尔可夫模型的复音音频比分对准

获取原文

摘要

This paper presents a Bayesian method for temporally aligning a music score and an audio rendition. A critical problem in audio-to-score alignment is in dealing with the wide variety of timbre and volume of the audio rendition. In contrast with existing works that achieve this through ad-hoc feature design or careful training of tone models, we propose a Bayesian audio-to-score alignment method by modeling music performance as a Bayesian Hidden Markov Model, each state of which emits a Bayesian signal model based on Latent Harmonic Allocation. After attenuating reverberation, variational Bayes method is used to iteratively adapt the alignment, instrument tone model and the volume balance at each position of the score. The method is evaluated using sixty works of classical music of a variety of instrumentation ranging from solo piano to full orchestra. We verify that our method improves the alignment accuracy compared to dynamic time warping based on chroma vector for orchestral music, or our method employed in a maximum likelihood setting.
机译:本文提出了一种贝叶斯方法,用于在时间上对齐乐谱和音频再现。音频与片段的对齐方式中的一个关键问题是如何处理多种不同的音色和音量。与通过特设特征设计或对音调模型进行仔细训练来实现此目的的现有作品相比,我们提出了一种贝叶斯音频到分数对齐方法,该方法通过将音乐性能建模为贝叶斯隐马尔可夫模型来建模,每种状态都发出贝叶斯基于潜在谐波分配的信号模型。减弱混响后,使用变分贝叶斯方法迭代调整比分,乐器音调模型和乐谱每个位置的音量平衡。该方法是使用60种古典音乐作品进行评估的,这些作品包括从独奏钢琴到完整管弦乐队的各种乐器。我们验证了与基于管弦音乐的色度矢量的动态时间规整相比,或在最大似然设置中采用的方法,我们的方法与动态时间扭曲相比具有更高的对齐精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号