首页> 外文会议>IEEE International Workshop on Multimedia Signal Processing >Multi-pitch estimation based on sparse representation with pre-screened dictionary
【24h】

Multi-pitch estimation based on sparse representation with pre-screened dictionary

机译:带有预筛选字典的基于稀疏表示的多音高估计

获取原文

摘要

This paper presents a study on frame-level multi-pitch estimation (MPE) for polyphonic piano music based on sparse representation approach. In this approach, a multi-pitch input spectrum is represented by a sparse linear combination of a large number of spectrum exemplars in a given dictionary. By estimating the sparse weight vector and identifying its non-zero elements, a set of possible pitch candidates can be found. This study is focused mainly on the construction and optimization of the exemplar dictionary. A complete dictionary is first built from single-note piano music. We propose to perform prescreening on the dictionary by which the exemplars of the notes belonging to certain octaves and chromas are excluded during the subsequent estimation. Experimental results show that the pre-screening process not only helps in reducing the computational complexity, but also leads to more accurate pitch estimation. On the formulation of sparse estimation problem, we introduce a probabilistic assumption on the estimation error, such that the estimation is converted into a constrained convex quadratic programming problem. We also propose to use spectral combination as a new scheme of pitch determination from the estimated sparse weights.
机译:本文提出了一种基于稀疏表示方法的复音钢琴音乐帧级多音高估计(MPE)的研究。在这种方法中,多音高输入频谱由给定字典中大量频谱示例的稀疏线性组合表示。通过估计稀疏权重向量并识别其非零元素,可以找到一组可能的音高候选。这项研究主要集中在示例词典的构建和优化上。首先从单音符钢琴音乐构建完整的字典。我们建议对字典执行预筛选,通过该字典,可以在后续估计期间排除属于某些八度和色度的音符的样本。实验结果表明,预筛选过程不仅有助于降低计算复杂度,而且可以使音高估计更加准确。在稀疏估计问题的表述上,我们引入了一个关于估计误差的概率假设,从而将估计转化为约束凸二次规划问题。我们还建议使用频谱组合作为根据估计的稀疏权重确定音高的新方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号