首页> 外文期刊>Expert systems with applications >A dynamic term discovery strategy for automatic speech recognizers with evolving dictionaries
【24h】

A dynamic term discovery strategy for automatic speech recognizers with evolving dictionaries

机译:具有不断发展词典的自动语音识别器的动态术语发现策略

获取原文
获取原文并翻译 | 示例

摘要

We present a dynamic term discovery (TD) strategy that is capable of automatically adapting the dictionaries managed by ASR systems to the input speech, in terms of lexicon and language model (LM). The adaptation tries to solve the problem of out-of-vocabulary (OOV) words that are likely to appear in most realistic scenarios and uses external knowledge sources for extending the capabilities of the LMs present in the systems. The handling of the OOV words is made by existing TD strategies that are able to detect and solve OOVs, plus special word selection processes that decide which words are to be added or deleted, so as to update the vocabulary constantly. We also propose a mathematical model for controlling the vocabulary size of the ASR system as well as the word addition and deletion rates that are involved. Then, the update of the overall LM is based on an interpolation scheme with smaller LMs built with external language knowledge that depends on the current speech and the words to be added at each time. We designed a realistic experimental framework for evaluating the strategy, employing ASR systems with moderated vocabulary sizes and a couple of test speech corpora with very distinct features. The results show that the dynamic TD strategy is able to offer a general positive tendency in WER improvement over systems without it, being able indeed to reach a significant difference after few hours of speech processing.
机译:我们提出了一种动态术语发现(TD)策略,其能够自动调整由ASR系统管理的词典,以便在Lexicon和语言模型(LM)方面。适应试图解决可能出现在最具现实场景中的词汇(OOV)单词的问题,并使用外部知识源来扩展系统中存在的LMS的能力。 OOV字的处理是通过现有的TD策略进行的,能够检测和解决OOV,以及决定要添加或删除哪些单词的特殊单词选择进程,以便不断更新词汇。我们还提出了一种用于控制ASR系统的词汇量的数学模型以及所涉及的单词添加和删除率。然后,整个LM的更新基于具有较小LMS的插值方案,其内部语言知识构建,这取决于当前语音和每次要添加的单词。我们设计了一种逼真的实验框架,用于评估策略,采用具有中等词汇表的ASR系统以及一些具有非常明显的特征的测试语音集团。结果表明,动态TD策略能够在没有它的情况下对系统的改进提供一般的积极趋势,能够在几小时的语音处理后达到显着差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号