【24h】

Improving Thai Spelling Recognition with Tone Features

机译:通过音调功能提高泰国拼写识别

获取原文
获取原文并翻译 | 示例

摘要

Spelling recognition has been used for several purposes, such as enhancing speech recognition systems and implementing name retrieval systems. Tone information is an important clue, in addition to phones, for recognizing speeches in tonal languages. In this paper, we present a method to improve accuracy of spelling recognition in Thai, a tonal language, by incorporating tone-related acoustic features to a well-known front-end feature named Perceptual Linear Prediction Coefficients (PLP). The proposed method makes use of three kinds of tone information: fundamental frequency (pitch), pitch delta and pitch acceleration, to enhance the original features. Compared to the baseline result gained from the original feature, our HMMs-based recognition model shows improvement of 1.73%, 2.85% and 3.16% of letter accuracy for close-type, mix-type and open-type language models, respectively.
机译:拼写识别已用于多种目的,例如增强语音识别系统和实现名称检索系统。除电话外,音调信息是识别音调语言语音的重要线索。在本文中,我们提出了一种方法,通过将与音有关的声学特征合并到称为“感知线性预测系数”(PLP)的前端功能中,从而提高了泰语(一种声调语言)的拼写识别的准确性。所提出的方法利用三种音调信息:基频(音调),音调增量和音调加速度来增强原始特征。与从原始功能获得的基线结果相比,我们的基于HMM的识别模型显示出封闭型,混合型和开放型语言模型的字母准确性分别提高了1.73%,2.85%和3.16%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号