首页> 外文会议>Mexican international conference on artificial intelligence >Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech
【24h】

Dynamic Estimation of Phoneme Confusion Patterns with a Genetic Algorithm to Improve the Performance of Metamodels for Recognition of Disordered Speech

机译:用遗传算法动态估计音素混淆模式,以提高元模型对无序语音识别的性能

获取原文

摘要

A field of research in Automatic Speech Recognition (ASR) is the development of assistive technology, particularly for people with speech disabilities. Diverse techniques have been proposed to accomplish accurately this task, among them the use of Metamodels. In this paper we present an approach to improve the performance of Metamodels which consists in using a speaker's phoneme confusion matrix to model the pronunciation patterns of this speaker. In contrast with previous confusion-matrix approaches, where the confusion-matrix is only estimated with fixed settings for language model, here we explore on the response of the ASR for different language model restrictions. A Genetic Algorithm (GA) was applied to further balance the contribution of each confusion-matrix estimation, and thus, to provide more reliable patterns. When incorporating these estimates into the ASR process with the Metamodels, consistent improvement in accuracy was accomplished when tested with speakers of mild to severe dysarthria which is a common speech disorder.
机译:自动语音识别(ASR)的研究领域是辅助技术的发展,特别是对有语言障碍的人。已经提出了多种技术来精确地完成此任务,其中包括使用元模型。在本文中,我们提出了一种改善元模型性能的方法,该方法包括使用说话人的音素混淆矩阵来建模该说话人的发音模式。与以前的混淆矩阵方法(仅在语言模型的固定设置下估计混淆矩阵)形成对比的情况下,这里我们探讨了ASR对不同语言模型限制的响应。应用遗传算法(GA)可以进一步平衡每个混淆矩阵估计的贡献,从而提供更可靠的模式。当将这些估计与元模型结合到ASR过程中时,使用轻度至重度构音障碍(一种常见的言语障碍)的说话者进行测试时,可以实现准确性的持续改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号