Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend

机译：使用清晰/清晰分类前端减少嘈杂条件下F0跟踪算法的F0帧错误

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an F0 Frame Error (FFE) metric which combines Gross Pitch Error (GPE) and Voicing Decision Error (VDE) to objectively evaluate the performance of fundamental frequency (F0) tracking methods. A GPE-VDE curve is then developed to show the trade-off between GPE and VDE. In addition, we introduce a model-based Unvoiced/Voiced (U/V) classification frontend which can be used by any F0 tracking algorithm. In the U/V classification, we train speaker independent U/V models, and then adapt them to speaker dependent models in an unsupervised fashion. The U/V classification result is taken as a mask for F0 tracking. Experiments using the KEELE corpus with additive noise show that our statistically-based U/V classifier can reduce VDE and FFE for the pitch tracker TEMPO in both white and babble noise conditions, and that minimizing FFE instead of VDE results in a reduction in error rates for a number of F0 tracking algorithms, especially in babble noise.

机译：在本文中，我们提出了一种F0帧误差（FFE）指标，该指标结合了总音高误差（GPE）和发声决策误差（VDE）来客观地评估基本频率（F0）跟踪方法的性能。然后绘制一条GPE-VDE曲线以显示GPE和VDE之间的权衡。此外，我们介绍了基于模型的清音/清音（U / V）分类前端，该前端可以被任何F0跟踪算法使用。在U / V分类中，我们训练独立于扬声器的U / V模型，然后以不受监督的方式将它们适应于依赖扬声器的模型。 U / V分类结果用作F0跟踪的掩码。使用带有附加噪声的KEELE语料库进行的实验表明，基于统计的U / V分类器可以在白噪声和ba声噪声条件下降低音调跟踪器TEMPO的VDE和FFE，并且最小化FFE而不是VDE可以降低错误率适用于许多F0跟踪算法，尤其是在ba声中。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009》|2009年|3969-3972|共4页
会议地点 Taipei(CT);Taipei(CT)
作者
Wei Chu; Alwan, A.;
展开▼
作者单位

Dept. of Electr. Eng. Univ. of California Los Angeles CA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
signal classification; speaker recognition; frame error metric; fundamental frequency tracking; gross pitch error; noisy conditions; pitch tracker; speaker dependent model; tracking algorithm; unvoiced/voiced classification frontend; voicing decision error; Evaluation Metrics; Fundamental Frequency; Noise Robustness; Pitch Tracking; Unvoiced/Voiced Classification;

机译：信号分类说话人识别；帧错误度量；基本频率跟踪；总螺距误差嘈杂的条件；音调跟踪器；说话者相关模型跟踪算法；清浊/清浊分类前端；语音决策错误；评估指标；基本频率；噪声鲁棒性；音调跟踪；清音/清音分类;

相似文献

外文文献
中文文献
专利

1. A Method for Voiced/Unvoiced Classification of Noisy Speech by Analyzing Time-Domain Features of Spectrogram Image [J] . Kazi Mahmudul Hassan, Ekramul Hamid, Khademul Islam Molla Science Journal of Circuits, Systems and Signal Processing . 2017,第2期

机译：分析频谱图图像时域特征的语音语音清浊分类方法
2. Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis [J] . Takeshi Saitou, Masashi Unoki, Masato Akagi Speech Communication . 2005,第3a4期

机译：基于F0动态特性的F0控制模型的歌声合成
3. An exploration of the accentuation effect: errors in memory for voice fundamental frequency (F0) and speech rate [J] . Gous Georgina, Dunn Andrew K., Baguley Thom, Language, cognition and neuroscience . 2018,第1期

机译：探索突发效果：语音基频（F0）和语音率内存中的错误
4. REDUCING FO FRAME ERROR OF FO TRACKING ALGORITHMS UNDER NOISY CONDITIONS WITH AN UNVOICED/VOICED CLASSIFICATION FRONTEND [C] . Wei Chu, Abeer Alwan IEEE International Conference on Acoustics, Speech, and Signal Processing . 2009

机译：在嘈杂的条件下减少噪声跟踪算法的帧误差，具有清音/浊音前端
5. Cognition Modulates Neural Responsiveness During Voluntary Voice F0 Control [D] . Atkins, Christopher 2012

机译：自愿语音F0控制过程中的认知调节神经反应。
6. Comparison of voice F0 responses to pitch-shift onset and offset conditions (L) [O] . Charles R. Larson, Theresa A. Burnett, Jay J. Bauer, -1

机译：语音F0对音高开始和偏移条件的响应比较（L）
7. Reducing f0 frame error of f0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend [O] . Wei Chu, Abeer Alwan 2015

机译：使用清音/浊音分类前端减少嘈杂条件下f0跟踪算法的f0帧误差

Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend

摘要

著录项

相似文献

相关主题

期刊订阅