Whole word phonetic displays for speech articulation training.

机译：用于语音清晰度训练的全字语音显示。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main objective of this dissertation is to investigate and develop speech recognition technologies for speech training for people with hearing impairments. During the course of this work, a computer aided speech training system for articulation speech training was also designed and implemented. The speech training system places emphasis on displays to improve children's pronunciation of isolated Consonant-Vowel-Consonant (CVC) words, with displays at both the phonetic level and whole word level. This dissertation presents two hybrid methods for combining Hidden Markov Models (HMMs) and Neural Networks (NNs) for speech recognition. The first method uses NN outputs as posterior probability estimators for HMMs. The second method uses NNs to transform the original speech features to normalized features with reduced correlation. Based on experimental testing, both of the hybrid methods give higher accuracy than standard HMM methods. The second method, using the NN to create normalized features, outperforms the first method in terms of accuracy. Several graphical displays were developed to provide real time visual feedback to users, to help them to improve and correct their pronunciations.

机译：本文的主要目的是研究和开发语音识别技术，用于听觉障碍者的语音训练。在这项工作的过程中，还设计并实现了用于发音语音训练的计算机辅助语音训练系统。语音训练系统侧重于显示，以提高儿童对孤立的辅音，辅音和辅音（CVC）单词的发音，同时在语音级别和整个单词级别上进行显示。本文提出了两种结合隐马尔可夫模型和神经网络进行语音识别的混合方法。第一种方法使用NN输出作为HMM的后验概率估计器。第二种方法使用NN将原始语音特征转换为相关性降低的归一化特征。根据实验测试，两种混合方法均比标准HMM方法具有更高的准确性。使用NN创建归一化特征的第二种方法在准确性方面优于第一种方法。开发了几种图形显示以向用户提供实时视觉反馈，以帮助他们改善和纠正其发音。

著录项

作者
Meng, Fansheng.;
展开▼
作者单位

Old Dominion University.;

展开▼
授予单位 Old Dominion University.;
学科 Engineering Electronics and Electrical.; Computer Science.
学位 Ph.D.
年度 2006
页码 112 p.
总页数 112
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Dual routes for verbal repetition: Articulation-based and acoustic-phonetic codes for pseudoword and word repetition, respectively [J] . YooS., ChungJ.-Y., JeonH.-A., Brain and language . 2012,第1期

机译：言语重复的双重途径：分别针对假单词和单词重复的基于发音的代码和语音代码
2. Speech motor planning in the context of phonetically similar words: Evidence from apraxia of speech and aphasia [J] . Mailend Marja-Liisa, Maas Edwin, Beeson Pelagie M., Neuropsychologia . 2019,第期

机译：语音相似的词语中的语音电机规划：来自言语和失语症的牺牲品的证据
3. Visibility of speech articulation enhances auditory phonetic convergence [J] . Dias James W., Rosenblum Lawrence D. Attention, perception & psychophysics . 2016,第1期

机译：语音清晰度可增强听觉语音的融合
4. Recognition of Speech Isolated Words Based on Pyramid Phonetic Bag of Words Model Display and Kernel-Based Support Vector Machine Classifier Model [C] . Sodabeh Salehi Rekavandi, Hamidreza Ghaffary, Maryam Davodpour International conference on fundamental research in electrical engineering . 2019

机译：基于金字塔语音袋的单词模型显示和基于内核支持向量机分类器模型的识别言语孤立的词汇
5. Effects of phonetic duration and phonological complexity on immediate recall and articulation rate of English and Chinese words. [D] . Li, Tao-yuan. 2000

机译：语音持续时间和语音复杂性对英语单词和中文单词的即时回想和发音率的影响。
6. Visual Phonetic Processing Localized Using Speech and Non-Speech Face Gestures in Video and Point-Light Displays [O] . Lynne E. Bernstein, Jintao Jiang, Dimitrios Pantazis, 2011

机译：视觉语音处理本地化使用语音和非语音面部手势视频和点光显示器
7. Viewing speech in action: speech articulation videos in the public domain that demonstrate the sounds of the International Phonetic Alphabet (IPA) [O] . Nakai, Satsuki., Beavan, D., Lawson, Eleanor., 2016

机译：查看行动中的语音：公共领域的语音表达视频，演示国际语音字母（IPA）的声音
8. A Phonetic-Context Controlled Strategy for Segmentation and Phonetic Labeling of Speech. [R] . Mermelstein, P. 1974

机译：一种语音 - 语境控制的语音分词和语音标注策略。

Whole word phonetic displays for speech articulation training.

摘要

著录项

相似文献

相关主题

期刊订阅