首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An evaluation of score-informed methods for estimating fundamental frequency and power from polyphonic audio
【24h】

An evaluation of score-informed methods for estimating fundamental frequency and power from polyphonic audio

机译:对基于分数的方法评估和弦音频的基本频率和功率的评估

获取原文

摘要

Robust extraction of performance data from polyphonic musical performances requires precise frame-level estimation of fundamental frequency (f) and power. This paper evaluates a new score-guided approach to f and power estimation in polyphonic audio and compares the use of four different input features: the central bin frequencies of the spectrogram, the instantaneous frequency, and two variants of a high resolution spectral analysis. These four features were evaluated on four-part multi-track ensemble recordings, consisting of either four vocalists or bassoon, clarinet, saxophone, and violin (the Bach10 data set) created from polyphonic mixes of the monophonic tracks both with and without artificial reverberation. Score information was used to identify time-frequency regions of interest in the polyphonic mixes for each note in a corresponding aligned score, from which f and power estimates were made. The approach was able to recover ground truth f within 20 cents on average in reverberation and power within 5 dB for anechoic mixtures, but only within 10 dB for reverberant.
机译:从和弦音乐表演中提取表演数据需要对基本频率(f)和功率进行精确的帧级估计。本文评估了复音音频中f和功率估计的一种新的分数引导方法,并比较了四种不同输入功能的使用:频谱图的中心bin频率,瞬时频率以及高分辨率频谱分析的两个变体。这四个功能是在多部分的多声道合奏录音中进行评估的,该录音由四名主唱或巴松管,单簧管,萨克斯管和小提琴(Bach10数据集)组成,这些声音是由带有或不带有人工混响的单声道复音混合而成的。分数信息用于在相应的对齐分数中识别每个音符在复音混音中感兴趣的时频区域,并据此估算f和功率。该方法能够在混响中平均恢复20%的地面真值f,对于无回声混合物,其功率恢复在5 dB以内,而对于混响,则只能恢复10 dB以内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号