首页> 外文OA文献 >High-resolution sinusoidal analysis for resolving harmonic collisions in music audio signal processing
【2h】

High-resolution sinusoidal analysis for resolving harmonic collisions in music audio signal processing

机译:用于解决音乐音频信号处理中的谐波碰撞的高分辨率正弦分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many music signals can largely be considered an additive combination ofmultiple sources, such as musical instruments or voice. If the musical sourcesare pitched instruments, the spectra they produce are predominantly harmonic,and are thus well suited to an additive sinusoidal model. However,due to resolution limits inherent in time-frequency analyses, when the harmonicsof multiple sources occupy equivalent time-frequency regions, theirindividual properties are additively combined in the time-frequency representationof the mixed signal. Any such time-frequency point in a mixturewhere multiple harmonics overlap produces a single observation from whichthe contributions owed to each of the individual harmonics cannot be triviallydeduced. These overlaps are referred to as overlapping partials or harmoniccollisions. If one wishes to infer some information about individual sources inmusic mixtures, the information carried in regions where collided harmonicsexist becomes unreliable due to interference from other sources. This interferencehas ramifications in a variety of music signal processing applicationssuch as multiple fundamental frequency estimation, source separation, andinstrumentation identification.This thesis addresses harmonic collisions in music signal processing applications.As a solution to the harmonic collision problem, a class of signalsubspace-based high-resolution sinusoidal parameter estimators is explored.Specifically, the direct matrix pencil method, or equivalently, the Estimationof Signal Parameters via Rotational Invariance Techniques (ESPRIT)method, is used with the goal of producing estimates of the salient parametersof individual harmonics that occupy equivalent time-frequency regions. Thisestimation method is adapted here to be applicable to time-varying signalssuch as musical audio. While high-resolution methods have been previouslyexplored in the context of music signal processing, previous work has notaddressed whether or not such methods truly produce high-resolution sinusoidal parameter estimates in real-world music audio signals. Therefore, thisthesis answers the question of whether high-resolution sinusoidal parameterestimators are really high-resolution for real music signals.This work directly explores the capabilities of this form of sinusoidal parameterestimation to resolve collided harmonics. The capabilities of thisanalysis method are also explored in the context of music signal processingapplications. Potential benefits of high-resolution sinusoidal analysis areexamined in experiments involving multiple fundamental frequency estimationand audio source separation. This work shows that there are indeedbenefits to high-resolution sinusoidal analysis in music signal processing applications,especially when compared to methods that produce sinusoidalparameter estimates based on more traditional time-frequency representations.The benefits of this form of sinusoidal analysis are made most evidentin multiple fundamental frequency estimation applications, where substantialperformance gains are seen. High-resolution analysis in the context ofcomputational auditory scene analysis-based source separation shows similarperformance to existing comparable methods.
机译:许多音乐信号在很大程度上可以视为多种来源(例如乐器或语音)的加法组合。如果音乐源是变调乐器,则它们产生的频谱主要是谐波,因此非常适合加法正弦模型。但是,由于时频分析固有的分辨率限制,当多个源的谐波占据等效时频区域时,它们的各自属性会在混合信号的时频表示中进行累加组合。在多个谐波重叠的混合物中,任何这样的时频点都会产生一个观测值,因此不能轻易推断出每个谐波的贡献。这些重叠称为重叠部分或谐波碰撞。如果希望推断出有关个别来源的音乐混合物的某些信息,则由于其他来源的干扰,在存在谐波谐调的区域中携带的信息将变得不可靠。这种干扰在多种音乐信号处理应用中会产生多种后果,例如多重基频估计,信号源分离和乐器识别。本文针对音乐信号处理应用中的谐波冲突。为解决谐波冲突问题,一类基于信号子空间的高具体来说,使用直接矩阵铅笔法,或等效地,通过旋转不变技术(ESPRIT)方法估计信号参数,目的是对占据等效时间的单个谐波的显着参数进行估计。频率区域。该估计方法在此适用于适用于随时间变化的信号,例如音乐音频。尽管先前已经在音乐信号处理的上下文中探索了高分辨率方法,但是先前的工作尚未解决这些方法是否在真实世界的音乐音频信号中真正产生高分辨率正弦参数估计。因此,本文回答了高分辨率正弦参数估计器是否真的对真实音乐信号具有高分辨率的问题。本工作直接探讨了这种形式的正弦参数估计器解决碰撞谐波的能力。在音乐信号处理应用程序的上下文中也探索了这种分析方法的功能。在涉及多个基本频率估计和音频源分离的实验中,检查了高分辨率正弦分析的潜在好处。这项工作表明,音乐信号处理应用中的高分辨率正弦分析确实有好处,特别是与基于更传统的时频表示产生正弦参数估计的方法相比时,这种形式的正弦分析的好处在多个方面最明显基本频率估计应用,其中可以看到显着的性能提升。在基于计算听觉场景分析的源分离环境中的高分辨率分析显示出与现有可比方法相似的性能。

著录项

  • 作者

    Ehmann Andreas;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号