首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram
【24h】

Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram

机译:非差分矩阵图的非负矩阵分解的复音钢琴音符转录

获取原文

摘要

Automatic music transcription is usually approached by using a time-frequency (TF) representation such as the short-time Fourier transform (STFT) spectrogram or the constant-Q transform. In this paper, we propose a novel yet simple TF representation that capitalizes the effectiveness of spectral flux features in highlighting note onset times. We refer to this representation as the differential spectrogram and investigate its usefulness for note-level piano transcription using two different non-negative matrix factorization (NMF) algorithms. Experiments on the MAPS ENSTDkCl dataset validate the advantages of the differential spectrogram over the STFT spectrogram for this task. Moreover, by adapting a state-of-the-art convolutional NMF algorithm with the differential spectrogram, we can achieve even better accuracy than the state-of-the-art on this dataset. Our analysis shows that the new representation suppresses unwanted TF patterns and performs particularly well in improving the recall rate.
机译:通常,通过使用时频(TF)表示(例如短时傅立叶变换(STFT)频谱图或常量Q变换)来实现自动音乐转录。在本文中,我们提出了一种新颖而又简单的TF表示形式,该表示形式充分利用了频谱通量特征在突出音符发作时间方面的有效性。我们将此表示形式称为差分频谱图,并使用两种不同的非负矩阵分解(NMF)算法研究其对音符级钢琴转录的有用性。在MAPS ENSTDkCl数据集上进行的实验验证了差分频谱图相对于STFT频谱图在此任务上的优势。此外,通过将最新的卷积NMF算法与差分频谱图配合使用,我们可以获得比该数据集上最新技术更高的准确性。我们的分析表明,新的表示形式可以抑制不必要的TF模式,并且在提高召回率方面表现特别出色。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号