首页> 外文会议>International Symposium on Computer Music Modeling and Retrieval(CMMR 2004); 20040526-29; Esbjerg(DK) >A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies
【24h】

A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies

机译:一种新的概率谱基音估计器:精确的MCMC近似策略

获取原文
获取原文并翻译 | 示例

摘要

We propose a robust probabilistic pitch (f_0) estimator in the presence of interference and low SNR conditions, without the computational requirements of optimal time-domain methods. Our analysis is driven by sinusoidal peaks extracted by a windowed STFT. Given f_0 and a reference amplitude (A_0), peak frequency/amplitude observations are modeled probabilistically in order to be robust to undetected harmonics, spurious peaks, skewed peak estimates, and inherent deviations from ideal or other assumed harmonic structure. Parameters f_0 and A_0 are estimated by maximizing the observations' likelihood (here A_0 is treated as a nuisance parameter). Some previous spectral pitch estimation methods, most notably the work of Goldstein, introduce a probabilistic framework with a corresponding maximum likelihood approach. However, our method significantly extends the latter in order to guarantee robustness under adverse conditions, facilitating possible extensions to the polyphonic context. For instance, our addressing of spurious as well as undetected peaks averts a sudden breakdown under low-SNR conditions. Furthermore, our assimilation of peak amplitudes facilitates the incorporation of timbral knowledge. Our method utilizes a hidden, discrete-valued descriptor variable identifying spurious/undetected peaks. The likelihood evaluation, requiring a computationally unwieldy summation over all descriptor states, is successfully approximated by a MCMC traversal chiefly amongst high-probability states. The MCMC traversal obtains virtually identical evaluations for the entire likelihood surface at a fraction of the computational cost.
机译:我们在存在干扰和低SNR条件的情况下提出了鲁棒的概率基音(f_0)估计器,而没有最佳时域方法的计算要求。我们的分析是由窗口STFT提取的正弦峰驱动的。在给定f_0和参考振幅(A_0)的情况下,概率模型化峰值频率/振幅观察值,以使其对未检测到的谐波,杂散峰,偏斜的峰值估计以及与理想或其他假定谐波结构的固有偏差具有鲁棒性。通过最大化观测值的可能性来估计参数f_0和A_0(此处将A_0视为令人讨厌的参数)。某些先前的频谱音高估计方法(最著名的是Goldstein的工作)引入了概率框架以及相应的最大似然方法。但是,我们的方法大大扩展了后者,以保证在不利条件下的鲁棒性,从而有利于复音上下文的可能扩展。例如,我们对杂散以及未检测到的峰的处理避免了在低SNR条件下的突然击穿。此外,我们对峰值幅度的吸收有助于整合音色知识。我们的方法利用一个隐藏的,离散值的描述符变量来识别虚假/未检测到的峰。需要在所有描述符状态上进行计算上笨拙的求和的似然性评估主要通过在高概率状态中进行的MCMC遍历成功地进行了近似。 MCMC遍历仅需计算量的一小部分,即可获得对整个似然面的几乎相同的评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号