首页> 外文会议>Mexican international conference on artificial intelligence >Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech

【24h】

Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech

机译：用遗传算法动态估计音素混淆模式，以提高元模型对无序语音识别的性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A field of research in Automatic Speech Recognition (ASR) is the development of assistive technology, particularly for people with speech disabilities. Diverse techniques have been proposed to accomplish accurately this task, among them the use of Metamodels. In this paper we present an approach to improve the performance of Metamodels which consists in using a speaker's phoneme confusion matrix to model the pronunciation patterns of this speaker. In contrast with previous confusion-matrix approaches, where the confusion-matrix is only estimated with fixed settings for language model, here we explore on the response of the ASR for different language model restrictions. A Genetic Algorithm (GA) was applied to further balance the contribution of each confusion-matrix estimation, and thus, to provide more reliable patterns. When incorporating these estimates into the ASR process with the Metamodels, consistent improvement in accuracy was accomplished when tested with speakers of mild to severe dysarthria which is a common speech disorder.

机译：自动语音识别（ASR）的研究领域是辅助技术的发展，特别是对有语言障碍的人。已经提出了多种技术来精确地完成此任务，其中包括使用元模型。在本文中，我们提出了一种改善元模型性能的方法，该方法包括使用说话人的音素混淆矩阵来建模该说话人的发音模式。与以前的混淆矩阵方法（仅在语言模型的固定设置下估计混淆矩阵）形成对比的情况下，这里我们探讨了ASR对不同语言模型限制的响应。应用遗传算法（GA）可以进一步平衡每个混淆矩阵估计的贡献，从而提供更可靠的模式。当将这些估计与元模型结合到ASR过程中时，使用轻度至重度构音障碍（一种常见的言语障碍）的说话者进行测试时，可以实现准确性的持续改善。

著录项

来源
《Mexican international conference on artificial intelligence》|2013年|175-187|共13页
会议地点
作者
Santiago Omar Caballero Morales; Felipe Trujillo Romero;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Genetic Algorithms; Disordered Speech Recognition; Metamodels;

机译：遗传算法;语音识别障碍;元模型;

相似文献

外文文献
中文文献
专利

1. Confusion analysis in phoneme based speech recognition in Hindi [J] . Bhatt Shobha, Dev Amita, Jain Anurag Journal of ambient intelligence and humanized computing . 2020,第10期

机译：印地文中音素语音识别的困惑分析
2. Phoneme recognition using zerocrossing interval distribution of speech patterns and ANN [J] . R.K. Sunil Kumar, V.L. Lajish International journal of speech technology . 2013,第1期

机译：使用语音模式和神经网络的零交叉间隔分布的音素识别
3. Phoneme recognition using zerocrossing interval distribution of speech patterns and ANN [J] . R. K. Sunil Kumar, V. L. Lajish International Journal of Speech Technology . 2013,第1期

机译：使用语音模式和神经网络的零交叉间隔分布的音素识别
4. Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech [C] . Santiago Omar Caballero Morales, Felipe Trujillo Romero Mexican international conference on artificial intelligence . 2013

机译：具有遗传算法的音素混淆模式的动态估计，提高元模型识别词性语音的性能
5. Improved phoneme-based myoelectric speech recognition. [D] . Zhou, Quan. 2008

机译：改进的基于音素的肌电语音识别。
6. Classifying acoustic signals into phoneme categories: average and dyslexic readers make use of complex dynamical patterns and multifractal scaling properties of the speech signal [O] . Fred Hasselman -1

机译：将声音信号分为音素类别：普通和阅读困难的读者利用语音信号的复杂动态模式和多重分形缩放特性
7. Predictability of the effects of phoneme merging on speech recognition performance by quantifying phoneme relations [O] . Bucar Shigemori Lia Saki, Reichel Uwe D., Schiel Florian 2013

机译：通过量化音素关系来预测音素合并对语音识别性能的影响

Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech

摘要

著录项

相似文献

相关主题

期刊订阅