Piano music transcription modeling note temporal evolution

机译：钢琴音乐转录建模注意时态演变

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic music transcription (AMT) is the process of converting an acoustic musical signal into a symbolic musical representation such as a MIDI piano roll, which contains the pitches, the onsets and offsets of the notes and, possibly, their dynamic and source (i.e., instrument). Existing algorithms for AMT commonly identify pitches and their saliences in each frame and then form notes in a post-processing stage, which applies a combination of thresholding, pruning and smoothing operations. Very few existing methods consider the note temporal evolution over multiple frames during the pitch identification stage. In this work we propose a note-based spectrogram factorization method that uses the entire temporal evolution of piano notes as a template dictionary. The method uses an artificial neural network to detect note onsets from the audio spectral flux. Next, it estimates the notes present in each audio segment between two successive onsets with a greedy search algorithm. Finally, the spectrogram of each segment is factorized using a discrete combination of note templates comprised of full note spectrograms of individual piano notes sampled at different dynamic levels. We also propose a new psychoacoustically informed measure for spectrogram similarity.

机译：自动音乐转录（AMT）是将声学音乐信号转换为诸如MIDI钢琴卷之类的象征性音乐表示的过程，其中包含音高，音符的起音和偏移以及可能的音符和动感（例如，仪器）。现有的AMT算法通常会在每个帧中识别音高及其显着性，然后在后期处理阶段形成音符，该阶段将阈值，修剪和平滑操作组合在一起。在音高识别阶段，很少有现有方法考虑音符在多个帧上的时间演变。在这项工作中，我们提出了一种基于音符的频谱图分解方法，该方法将钢琴音符的整个时间演变用作模板字典。该方法使用人工神经网络从音频频谱通量中检测音符发作。接下来，它使用贪婪搜索算法估计两次连续发作之间每个音频片段中出现的音符。最后，使用音符模板的离散组合来分解每个片段的声谱图，该音符模板包括以不同动态水平采样的单个钢琴音符的完整音符谱图。我们还提出了一种新的心理听觉知觉的频谱图相似性度量。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|429-433|共5页
会议地点
作者
Cogliati Andrea; Duan Zhiyao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Automatic music transcription; multi-pitch estimation; onset detection; spectrogram factorization;

机译：自动音乐转录;多音高估计;开始检测;频谱图分解;

相似文献

外文文献
中文文献
专利

1. Polyphonic Piano Transcription with a Note-Based Music Language Model [J] . Qi Wang, Ruohua Zhou, Yonghong Yan Applied Sciences . 2018,第3期

机译：基于音符的音乐语言模型的复音钢琴转录
2. Automatic Transcription of Polyphonic Piano Music Using Genetic Algorithms, Adaptive Spectral Envelope Modeling, and Dynamic Noise Level Estimation [J] . Reis G., Fernandez de Vega F., Ferreira A. Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第8期

机译：使用遗传算法，自适应频谱包络模型和动态噪声水平估计自动复音钢琴音乐的转录
3. Towards Automatic Music Transcription: Extraction of MIDI-Data out of Polyphonic Piano Music [J] . Jens Wellhausen Journal of Systemics, Cybernetics and Informatics . 2005,第3期

机译：迈向自动音乐转录：从和弦钢琴音乐中提取MIDI数据
4. Piano music transcription modeling note temporal evolution [C] . Cogliati Andrea, Duan Zhiyao IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：钢琴音乐转录建模注意时间演变
5. The Rite of Spring: An Original Solo Piano Transcription of Stravinsky's 1913 Ballet with Annotations and Historical Notes [D] . Fried, William Norman 2011

机译：春天的仪式：斯特拉文斯基1913年芭蕾舞剧的原始独奏钢琴笔录，带注释和历史笔记
6. Relationship between Musical Characteristics and Temporal Breathing Pattern in Piano Performance [O] . Yutaka Sakaguchi, Eriko Aiba 2016

机译：钢琴演奏中音乐特性与时间呼吸模式的关系
7. Note onset detection for the transcription of polyphonic piano music [O] . Boogaart C. Gregor van den, Lienhart Rainer (Prof. Dr.) 2009

机译：注意复音钢琴音乐的转录的起始检测

Piano music transcription modeling note temporal evolution

摘要

著录项

相似文献

相关主题

期刊订阅