首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription
【24h】

Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription

机译:使用和声包络估计的多音调联合音乐复音

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a method for automatic transcription of music signals based on joint multiple-F0 estimation is proposed. As a time–frequency representation, the constant-Q resonator time–frequency image is employed, while a novel noise suppression technique based on pink noise assumption is applied in a preprocessing step. In the multiple-F0 estimation stage, the optimal tuning and inharmonicity parameters are computed and a salience function is proposed in order to select pitch candidates. For each pitch candidate combination, an overlapping partial treatment procedure is used, which is based on a novel spectral envelope estimation procedure for the log-frequency domain, in order to compute the harmonic envelope of candidate pitches. In order to select the optimal pitch combination for each time frame, a score function is proposed which combines spectral and temporal characteristics of the candidate pitches and also aims to suppress harmonic errors. For postprocessing, hidden Markov models (HMMs) and conditional random fields (CRFs) trained on MIDI data are employed, in order to boost transcription accuracy. The system was trained on isolated piano sounds from the MAPS database and was tested on classic and jazz recordings from the RWC database, as well as on recordings from a Disklavier piano. A comparison with several state-of-the-art systems is provided using a variety of error metrics, where encouraging results are indicated.
机译:本文提出了一种基于联合多重F0估计的音乐信号自动转录方法。作为时频表示,采用了恒定Q谐振器​​时频图像,而在预处理步骤中采用了基于粉红噪声假设的新颖噪声抑制技术。在多重F0估计阶段,计算最佳调谐和不谐度参数,并提出显着性函数以选择音调候选。对于每个音高候选组合,使用重叠的部分处理过程,该过程基于对数频域的新颖频谱包络估计过程,以便计算候选音高的谐波包络。为了为每个时间帧选择最佳音高组合,提出了一种得分函数,该函数结合了候选音高的频谱和时间特性,并且还旨在抑制谐波误差。对于后处理,采用了隐马尔可夫模型(HMM)和在MIDI数据上训练的条件随机字段(CRF),以提高转录准确性。该系统接受过MAPS数据库中孤立的钢琴声音的培训,并经过RWC数据库中的经典和爵士录音以及Disklavier钢琴的录音测试。使用各种错误度量标准与几种最新系统进行了比较,并指出了令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号