首页> 外文学位 >Whole word phonetic displays for speech articulation training.
【24h】

Whole word phonetic displays for speech articulation training.

机译:用于语音清晰度训练的全字语音显示。

获取原文
获取原文并翻译 | 示例

摘要

The main objective of this dissertation is to investigate and develop speech recognition technologies for speech training for people with hearing impairments. During the course of this work, a computer aided speech training system for articulation speech training was also designed and implemented. The speech training system places emphasis on displays to improve children's pronunciation of isolated Consonant-Vowel-Consonant (CVC) words, with displays at both the phonetic level and whole word level. This dissertation presents two hybrid methods for combining Hidden Markov Models (HMMs) and Neural Networks (NNs) for speech recognition. The first method uses NN outputs as posterior probability estimators for HMMs. The second method uses NNs to transform the original speech features to normalized features with reduced correlation. Based on experimental testing, both of the hybrid methods give higher accuracy than standard HMM methods. The second method, using the NN to create normalized features, outperforms the first method in terms of accuracy. Several graphical displays were developed to provide real time visual feedback to users, to help them to improve and correct their pronunciations.
机译:本文的主要目的是研究和开发语音识别技术,用于听觉障碍者的语音训练。在这项工作的过程中,还设计并实现了用于发音语音训练的计算机辅助语音训练系统。语音训练系统侧重于显示,以提高儿童对孤立的辅音,辅音和辅音(CVC)单词的发音,同时在语音级别和整个单词级别上进行显示。本文提出了两种结合隐马尔可夫模型和神经网络进行语音识别的混合方法。第一种方法使用NN输出作为HMM的后验概率估计器。第二种方法使用NN将原始语音特征转换为相关性降低的归一化特征。根据实验测试,两种混合方法均比标准HMM方法具有更高的准确性。使用NN创建归一化特征的第二种方法在准确性方面优于第一种方法。开发了几种图形显示以向用户提供实时视觉反馈,以帮助他们改善和纠正其发音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号