首页> 外文学位 >Harmonic grouping pitch detection and application to speech recognition systems.

【24h】

Harmonic grouping pitch detection and application to speech recognition systems.

机译：谐波分组音调检测及其在语音识别系统中的应用。

获取原文

获取原文并翻译 | 示例

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work has successfully achieved a robust, fast and accurate pitch detection system called the Harmonic Grouping Pitch Detection. This system is able to perform pitch detection on a wide variety of signals such as speech, signing, whistling and musical instruments. On a 1.5Ghz AMD processor, the running time is 10x faster than real time. Using the CSTR database and the GPE measure of accuracy we have shown that the accuracy of Harmonic Grouping pitch detection is higher than other common systems.; The front-end of Harmonic Grouping pitch detection has been designed to match the front-end of state of the art speech recognition systems. Therefore, the computation requirements such as windowing and FFT calculation can be shared if the two systems are combined into a single application. This feature makes Harmonic Grouping an ideal choice for utilizing pitch information in a speech recognition system. Finally, two methods for utilizing pitch information in speech front-ends are presented to improve the recognition accuracy. These methods are: "Pitch-dependent models" and "Harmonic Density Normalization (HDN)". These methods can be utilized together in a speech recognition system and are shown to improve the recognition accuracy.

机译：这项工作成功地实现了一种强大，快速且准确的音高检测系统，称为“谐波分组音高检测”。该系统能够对各种信号（例如语音，签名，口哨和乐器）执行音高检测。在1.5Ghz AMD处理器上，运行时间比实时速度快10倍。使用CSTR数据库和GPE精度度量，我们已经表明，谐波分组音高检测的精度高于其他常见系统。谐波分组音调检测的前端已被设计为与先进的语音识别系统的前端相匹配。因此，如果将两个系统组合为一个应用程序，则可以共享诸如开窗和FFT计算之类的计算要求。此功能使“谐波分组”成为在语音识别系统中利用音调信息的理想选择。最后，提出了两种在语音前端中利用音调信息的方法来提高识别精度。这些方法是：“与音高有关的模型”和“谐波密度归一化（HDN）”。这些方法可以在语音识别系统中一起使用，并且可以提高识别精度。

著录项

作者
Mohajer, Keyvan.;
展开▼
作者单位

Stanford University.;

展开▼
授予单位 Stanford University.;
学科 Engineering Electronics and Electrical.; Computer Science.
学位 Ph.D.
年度 2007
页码 101 p.
总页数 101
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition [J] . Hui Yin, Climent Nadeu, Volker Hohmann EURASIP Journal on Audio, Speech, and Music Processing . 2010,第1期
2. Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition [J] . Hui Yin, Climent Nadeu, Volker Hohmann EURASIP journal on audio, speech, and music processing . 2009,第9期

机译：基于基音和共振峰的分数阶傅里叶变换的阶数自适应及其在语音识别中的应用
3. Pseudo pitch synchronous analysis of speech with applications to speaker recognition [J] . Zilca R.D., Kingsbury B., Navratil J., IEEE transactions on audio, speech and language processing . 2006,第2期

机译：语音的伪音高同步分析及其在说话人识别中的应用
4. INTEGRATED PITCH AND MFCC EXTRACTION FOR SPEECH RECONSTRUCTION AND SPEECH RECOGNITION APPLICATIONS [C] . Xu Shao, Ben Milner, Stephen Cox European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.3; 20030901-04; Geneva(CH) . 2003

机译：用于语音重构和语音识别应用程序的集成音高和MFCC提取

Harmonic grouping pitch detection and application to speech recognition systems.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅