首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Automatic Transcription of Diatonic Harmonica Recordings
【24h】

Automatic Transcription of Diatonic Harmonica Recordings

机译:谐音口琴记录的自动转录

获取原文

摘要

This paper presents a method for automatic transcription of the diatonic Harmonica instrument. It estimates the multi-pitch activations through a spectrogram factorisation framework. This framework is based on Probabilistic Latent Component Analysis (PLCA) and uses a fixed 4-dimensional dictionary with spectral templates extracted from Harmonica's instrument timbre. Methods based on spectrogram factorisation may suffer from local-optima issues in the presence of harmonic overlap or considerable timbre variability. To alleviate this issue, we propose a set of harmonic constraints that are inherent to the Harmonica instrument note layout or are caused by specific diatonic Harmonica playing techniques. These constraints help to guide the factorisation process until convergence into meaningful multi-pitch activations is achieved. This work also builds a new audio dataset containing solo recordings of diatonic Harmonica excerpts and the respective multi-pitch annotations. We compare our proposed approach against multiple baseline techniques for automatic music transcription on this dataset and report the results based on frame-based F-measure statistics.
机译:本文提出了一种全音速口琴乐器的自动转录方法。它通过频谱图分解框架来估计多音高激活。该框架基于概率潜在成分分析(PLCA),并使用固定的4维字典,其中包含从口琴乐器音色中提取的光谱模板。在存在谐波重叠或相当大的音色变化性的情况下,基于频谱图分解的方法可能会遇到局部最优问题。为缓解此问题,我们提出了一组谐波约束,这些约束是口琴乐器音符布局固有的或由特定的全音阶口琴演奏技术引起的。这些约束条件有助于指导分解过程,直到实现有意义的多音高激活的收敛为止。这项工作还建立了一个新的音频数据集,其中包含全音阶口琴摘录的录音以及相应的多音高注释。我们将我们提出的方法与该数据集上用于自动音乐转录的多种基线技术进行比较,并基于基于帧的F量度统计报告结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号