首页> 外文期刊>Electronics and Communications in Japan. Part 3, Fundamental Electronic Science >Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System
【24h】

Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System

机译:压缩分解表并在连续语音识别系统中对不可用的单词假设进行垃圾收集

获取原文
获取原文并翻译 | 示例

摘要

We have investigated methods for reducing the number of word hypotheses registered in the word graph and the amount of memory used by the factoring tables for the tree-structured dictionary with the objective of reducing the memory requirements of a continuous speech recognition system. By assigning word hypotheses in the word graph attributes relating to the number of continuation hypotheses in which they are included, we are able to efficiently determine unusable word hypotheses during pruning and can perform garbage collection. This procedure allows us to reduce the amount of memory needed for generating word hypotheses from 127 MB to 6.9 MB. In addition, by approximating the bigram values held in the factoring tables with POS bigrams, we were able to reduce the memory consumption of the factoring tables from 56 MB to 19 MB with almost no impairment of recognition performance. As a result of these reductions in memory requirements, the memory consumption of the decoder has been reduced from 246 MB to 113 MB.
机译:为了减少连续语音识别系统的存储需求,我们已经研究了减少在单词图中注册的单词假设的数量以及因式分解表所使用的存储量的方法。通过在单词图属性中分配与包含这些假设的连续假设数量相关的单词假设,我们可以在修剪过程中有效地确定不可用的单词假设,并可以执行垃圾收集。此过程使我们可以将生成单词假设所需的内存量从127 MB减少到6.9 MB。此外,通过用POS双元文件近似分解表中保存的双元文件值,我们能够将分解表的内存消耗从56 MB减少到19 MB,而几乎不影响识别性能。这些减少的内存需求的结果是,解码器的内存消耗已从246 MB减少到113 MB。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号