首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 >Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend
【24h】

Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend

机译:使用清晰/清晰分类前端减少嘈杂条件下F0跟踪算法的F0帧错误

获取原文

摘要

In this paper, we propose an F0 Frame Error (FFE) metric which combines Gross Pitch Error (GPE) and Voicing Decision Error (VDE) to objectively evaluate the performance of fundamental frequency (F0) tracking methods. A GPE-VDE curve is then developed to show the trade-off between GPE and VDE. In addition, we introduce a model-based Unvoiced/Voiced (U/V) classification frontend which can be used by any F0 tracking algorithm. In the U/V classification, we train speaker independent U/V models, and then adapt them to speaker dependent models in an unsupervised fashion. The U/V classification result is taken as a mask for F0 tracking. Experiments using the KEELE corpus with additive noise show that our statistically-based U/V classifier can reduce VDE and FFE for the pitch tracker TEMPO in both white and babble noise conditions, and that minimizing FFE instead of VDE results in a reduction in error rates for a number of F0 tracking algorithms, especially in babble noise.
机译:在本文中,我们提出了一种F0帧误差(FFE)指标,该指标结合了总音高误差(GPE)和发声决策误差(VDE)来客观地评估基本频率(F0)跟踪方法的性能。然后绘制一条GPE-VDE曲线以显示GPE和VDE之间的权衡。此外,我们介绍了基于模型的清音/清音(U / V)分类前端,该前端可以被任何F0跟踪算法使用。在U / V分类中,我们训练独立于扬声器的U / V模型,然后以不受监督的方式将它们适应于依赖扬声器的模型。 U / V分类结果用作F0跟踪的掩码。使用带有附加噪声的KEELE语料库进行的实验表明,基于统计的U / V分类器可以在白噪声和ba声噪声条件下降低音调跟踪器TEMPO的VDE和FFE,并且最小化FFE而不是VDE可以降低错误率适用于许多F0跟踪算法,尤其是在ba声中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号