...
首页> 外文期刊>Applied Sciences >Polyphonic Piano Transcription with a Note-Based Music Language Model
【24h】

Polyphonic Piano Transcription with a Note-Based Music Language Model

机译:基于音符的音乐语言模型的复音钢琴转录

获取原文
           

摘要

This paper proposes a note-based music language model (MLM) for improving note-level polyphonic piano transcription. The MLM is based on the recurrent structure, which could model the temporal correlations between notes in music sequences. To combine the outputs of the note-based MLM and acoustic model directly, an integrated architecture is adopted in this paper. We also propose an inference algorithm, in which the note-based MLM is used to predict notes at the blank onsets in the thresholding transcription results. The experimental results show that the proposed inference algorithm improves the performance of note-level transcription. We also observe that the combination of the restricted Boltzmann machine (RBM) and recurrent structure outperforms a single recurrent neural network (RNN) or long short-term memory network (LSTM) in modeling the high-dimensional note sequences. Among all the MLMs, LSTM-RBM helps the system yield the best results on all evaluation metrics regardless of the performance of acoustic models.
机译:本文提出了一种基于音符的音乐语言模型(MLM),用于改善音符级和弦钢琴的转录。 MLM基于循环结构,可以对音乐序列中音符之间的时间相关性进行建模。为了将基于音符的传销和声学模型的输出直接结合,本文采用了集成架构。我们还提出了一种推理算法,其中基于笔记的MLM用于预测阈值转录结果中空白处的笔记。实验结果表明,所提出的推理算法提高了音符级转录的性能。我们还观察到,在建模高维音符序列时,受限的Boltzmann机(RBM)和递归结构的组合优于单个递归神经网络(RNN)或长短期记忆网络(LSTM)。在所有MLM中,无论声学模型的性能如何,LSTM-RBM都能帮助系统在所有评估指标上产生最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号