首页> 外文会议>International conference on spoken language processing >Structured Redefinition of Sound Units by Merging and Splitting for Improved Speech Recognition
【24h】

Structured Redefinition of Sound Units by Merging and Splitting for Improved Speech Recognition

机译:通过合并和分裂来实现声音单位的结构重新定义,以改进语音识别

获取原文

摘要

The performance of speech recgnition systems degrades when the basic sound units used are poorly defined or inconsistently used. Several attempts have been made to improve dictionaries automatically, either by redefining pronunciations of words in terms of existing sound units, or by redefining the sound units themselves completely. The problem with these approaches is that, while the former is limited by the sound units used, the latter discards all human information that has been incorporated into an expert-designed recognition dictionary. In this paper we propose a new merging-and-splitting algorithm that attempts to redefine the basic sound units used in the dictionary, whiel maintaining the expert knowledge built into a manually designed dictionary. Sound units from an existing dictionary are merged based on their inherent confusability, as measured by a Monte-Carlo based metric, and subsequently split to maximize the likelihood of the training data. Experiemtns with the Resource Management database indicate that this approach results in an improvement in recognition accuracy when context-independent models are used for recognition. When context-dependent models are used for recognition. When context-dependent models are used, the improvement observeed is reduced.
机译:当使用的基本声音单元定义或不一致地使用时,语音再试系统的性能降低。已经通过在现有声音单元的术语中重新定义单词的发音或通过重新定义完全重新定义声音单位来改进词典来自动改进词典。这些方法的问题是,虽然前者受到所使用的声音单元的限制,但后者丢弃已被纳入专家设计的识别词典的所有人类信息。在本文中,我们提出了一种新的合并 - 分裂算法,该算法试图重新定义字典中使用的基本声音单元,将内置于手动设计的词典中内置的专家知识。来自现有词典的声音单元根据其固有的可混淆来合并,通过基于Monte-Carlo基于Monte-Carlo基本的公制来合并,随后拆分以最大化训练数据的可能性。使用资源管理数据库的体验表明,当使用上下文的模型用于识别时,该方法会导致识别准确性的提高。当上下文依赖模型用于识别时。当使用上下文依赖模型时,还减少了改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号