首页> 外文学位 >Harmonic grouping pitch detection and application to speech recognition systems.
【24h】

Harmonic grouping pitch detection and application to speech recognition systems.

机译:谐波分组音调检测及其在语音识别系统中的应用。

获取原文
获取原文并翻译 | 示例

摘要

This work has successfully achieved a robust, fast and accurate pitch detection system called the Harmonic Grouping Pitch Detection. This system is able to perform pitch detection on a wide variety of signals such as speech, signing, whistling and musical instruments. On a 1.5Ghz AMD processor, the running time is 10x faster than real time. Using the CSTR database and the GPE measure of accuracy we have shown that the accuracy of Harmonic Grouping pitch detection is higher than other common systems.; The front-end of Harmonic Grouping pitch detection has been designed to match the front-end of state of the art speech recognition systems. Therefore, the computation requirements such as windowing and FFT calculation can be shared if the two systems are combined into a single application. This feature makes Harmonic Grouping an ideal choice for utilizing pitch information in a speech recognition system. Finally, two methods for utilizing pitch information in speech front-ends are presented to improve the recognition accuracy. These methods are: "Pitch-dependent models" and "Harmonic Density Normalization (HDN)". These methods can be utilized together in a speech recognition system and are shown to improve the recognition accuracy.
机译:这项工作成功地实现了一种强大,快速且准确的音高检测系统,称为“谐波分组音高检测”。该系统能够对各种信号(例如语音,签名,口哨和乐器)执行音高检测。在1.5Ghz AMD处理器上,运行时间比实时速度快10倍。使用CSTR数据库和GPE精度度量,我们已经表明,谐波分组音高检测的精度高于其他常见系统。谐波分组音调检测的前端已被设计为与先进的语音识别系统的前端相匹配。因此,如果将两个系统组合为一个应用程序,则可以共享诸如开窗和FFT计算之类的计算要求。此功能使“谐波分组”成为在语音识别系统中利用音调信息的理想选择。最后,提出了两种在语音前端中利用音调信息的方法来提高识别精度。这些方法是:“与音高有关的模型”和“谐波密度归一化(HDN)”。这些方法可以在语音识别系统中一起使用,并且可以提高识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号