A pitch extraction algorithm tuned for automatic speech recognition

机译：调整音调提取算法以实现自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an algorithm that produces pitch and probability-of-voicing estimates for use as features in automatic speech recognition systems. These features give large performance improvements on tonal languages for ASR systems, and even substantial improvements for non-tonal languages. Our method, which we are calling the Kaldi pitch tracker (because we are adding it to the Kaldi ASR toolkit), is a highly modified version of the getf0 (RAPT) algorithm. Unlike the original getf0 we do not make a hard decision whether any given frame is voiced or unvoiced; instead, we assign a pitch even to unvoiced frames while constraining the pitch trajectory to be continuous. Our algorithm also produces a quantity that can be used as a probability of voicing measure; it is based on the normalized autocorrelation measure that our pitch extractor uses. We present results on data from various languages in the BABEL project, and show a large improvement over systems without tonal features and systems where pitch and POV information was obtained from SAcC or getf0.

机译：在本文中，我们提出了一种算法，该算法可产生音高和发声概率估计值，以用作自动语音识别系统中的功能。这些功能为ASR系统的音调语言带来了很大的性能改进，甚至为非音调语言带来了实质性的改进。我们称为Kaldi音调跟踪器的方法（因为我们将其添加到Kaldi ASR工具箱中）是getf0（RAPT）算法的高度修改版本。与原始的getf0不同，我们不会对任何给定的帧是浊音还是清音做出艰难的决定。取而代之的是，我们在将音高轨迹限制为连续的同时，甚至将音高分配给未发声的帧。我们的算法还产生了可以用作发声测量概率的数量。它基于我们的音高提取器使用的归一化自相关度量。我们介绍了BABEL项目中来自各种语言的数据结果，并显示了对没有音调特征的系统和从SAcC或getf0获得音高和POV信息的系统的巨大改进。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|2494-2498|共5页
会议地点
作者
Ghahremani Pegah; BabaAli Bagher; Povey Daniel; Riedhammer Korbinian; Trmal Jan; Khudanpur Sanjeev;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Automatic Speech Recognition; Pitch; Probability Of Voicing; Tone;

机译：自动语音识别;沥青;发声的可能性;音;

相似文献

外文文献
中文文献
专利

1. Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech [J] . Agarwalla Swapna, Sarma Kandarpa Kumar Neural Networks: The Official Journal of the International Neural Network Society . 2016,第Null期

机译：基于机器学习的样本提取用于使用方言阿萨姆语语音进行自动语音识别
2. Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech [J] . Agarwalla Swapna, Sarma Kandarpa Kumar Neural Networks: The Official Journal of the International Neural Network Society . 2016,第Null期

机译：基于机器学习的样本提取自动语音识别使用方言issamese语言
3. Feature Extraction Based on Speech Attractors in the Reconstructed Phase Space for Automatic Speech Recognition Systems [J] . Yasser Shekofteh, Farshad Almasganj ETRI journal . 2013,第1期

机译：自动语音识别系统重构相空间中基于语音吸引子的特征提取
4. A PITCH EXTRACTION ALGORITHM TUNED FOR AUTOMATIC SPEECH RECOGNITION [C] . Pegah Ghahremani, Bagher BabaAli, Daniel Povey, IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：调整自动语音识别的俯仰提取算法
5. Advanced feature extraction algorithms for automatic fingerprint recognition systems. [D] . Wu, Chaohong. 2007

机译：用于自动指纹识别系统的高级特征提取算法。
6. Multiple Adaptive Neuro-Fuzzy Inference System with Automatic Features Extraction Algorithm for Cervical Cancer Recognition [O] . Mohammad Subhi Al-batah, Nor Ashidi Mat Isa, Mohammad Fadel Klaib, 2014

机译：具有特征提取算法的多自适应神经模糊推理系统对宫颈癌的识别
7. AUTOMATIC AND ACCUARTE PITCH MARKING OF SPEECH SIGNAL USING AN EXPERT SYSTEM BASED ON LOGICAL COMBINATIONS OF DIFFERENT ALGORITHMS OUTPUTS [O] . Ashouri Keyvan, Savoji Mohammad Hassan 2004

机译：基于不同算法输出逻辑组合的专家系统的语音信号自动准确标记

A pitch extraction algorithm tuned for automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅