首页> 外文会议>IEEE International Workshop on Multimedia Signal Processing >Multi-pitch estimation based on sparse representation with pre-screened dictionary
【24h】

Multi-pitch estimation based on sparse representation with pre-screened dictionary

机译:基于漏洞字典的稀疏表示的多音高估计

获取原文

摘要

This paper presents a study on frame-level multi-pitch estimation (MPE) for polyphonic piano music based on sparse representation approach. In this approach, a multi-pitch input spectrum is represented by a sparse linear combination of a large number of spectrum exemplars in a given dictionary. By estimating the sparse weight vector and identifying its non-zero elements, a set of possible pitch candidates can be found. This study is focused mainly on the construction and optimization of the exemplar dictionary. A complete dictionary is first built from single-note piano music. We propose to perform prescreening on the dictionary by which the exemplars of the notes belonging to certain octaves and chromas are excluded during the subsequent estimation. Experimental results show that the pre-screening process not only helps in reducing the computational complexity, but also leads to more accurate pitch estimation. On the formulation of sparse estimation problem, we introduce a probabilistic assumption on the estimation error, such that the estimation is converted into a constrained convex quadratic programming problem. We also propose to use spectral combination as a new scheme of pitch determination from the estimated sparse weights.
机译:本文介绍了基于稀疏表示方法的多关钢琴音乐的帧级多音高估计(MPE)研究。在这种方法中,多间距输入频谱由给定字典中大量频谱示例的稀疏线性组合表示。通过估计稀疏权重向量并识别其非零元素,可以找到一组可能的音高候选。本研究主要集中在示例性词典的构建和优化。完整的词典是由单票钢琴音乐建造的。我们建议在随后的估计期间排除属于某些八度曲线和核数的音符的示例,在字典上执行预先筛选。实验结果表明,预筛分过程不仅有助于降低计算复杂性,而且导致更准确的音高估计。在稀疏估计问题的制定上,我们对估计误差引入概率的假设,使得估计被转换为约束的凸二次编程问题。我们还建议使用光谱组合作为从估计的稀疏重量的音高测定的新方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号