Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System

Hiroaki Kokubo; Teruaki Hayashi; Hirofumi Yamamoto; Genichiro Kikui

首页> 外文期刊>Electronics and Communications in Japan. Part 3, Fundamental Electronic Science >Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System

【24h】

Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System

机译：压缩分解表并在连续语音识别系统中对不可用的单词假设进行垃圾收集

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We have investigated methods for reducing the number of word hypotheses registered in the word graph and the amount of memory used by the factoring tables for the tree-structured dictionary with the objective of reducing the memory requirements of a continuous speech recognition system. By assigning word hypotheses in the word graph attributes relating to the number of continuation hypotheses in which they are included, we are able to efficiently determine unusable word hypotheses during pruning and can perform garbage collection. This procedure allows us to reduce the amount of memory needed for generating word hypotheses from 127 MB to 6.9 MB. In addition, by approximating the bigram values held in the factoring tables with POS bigrams, we were able to reduce the memory consumption of the factoring tables from 56 MB to 19 MB with almost no impairment of recognition performance. As a result of these reductions in memory requirements, the memory consumption of the decoder has been reduced from 246 MB to 113 MB.

机译：为了减少连续语音识别系统的存储需求，我们已经研究了减少在单词图中注册的单词假设的数量以及因式分解表所使用的存储量的方法。通过在单词图属性中分配与包含这些假设的连续假设数量相关的单词假设，我们可以在修剪过程中有效地确定不可用的单词假设，并可以执行垃圾收集。此过程使我们可以将生成单词假设所需的内存量从127 MB减少到6.9 MB。此外，通过用POS双元文件近似分解表中保存的双元文件值，我们能够将分解表的内存消耗从56 MB减少到19 MB，而几乎不影响识别性能。这些减少的内存需求的结果是，解码器的内存消耗已从246 MB减少到113 MB。

著录项

来源
《Electronics and Communications in Japan. Part 3, Fundamental Electronic Science》 |2006年第2期|p.54-64|共11页
作者
Hiroaki Kokubo; Teruaki Hayashi; Hirofumi Yamamoto; Genichiro Kikui;
展开▼
作者单位

ATR Spoken Language Translation Research Labs., Kyoto, 619-0288 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类一般性问题;
关键词
speech recognition; small memory footprint; tree-structured dictionary; word hypotheses; garbage collection;

机译：语音识别;小内存占用;树状字典;词假设;垃圾收集;

相似文献

外文文献
中文文献
专利

1. Words into action II: A task-oriented system: Harpy is an experimental, continuous-speech recognition system that exploits a low-cost minicomputer [J] . Reddy Raj Spectrum, IEEE . 1980,第6期

机译：言谈成语II：面向任务的系统：Harpy是一种实验性，连续语音识别系统，利用低成本的微型计算机
2. English Phrase Speech Recognition Based on Continuous Speech Recognition Algorithm and Word Tree Constraints [J] . Haifan Du, Haiwen Duan Complexity . 2021,第a期

机译：英语短语语音识别基于连续语音识别算法和字树约束
3. Spoken Word Recognition of Chinese Words in Continuous Speech [J] . Yip Michael C. W. Journal of psycholinguistic research . 2015,第6期

机译：连续语音中汉语单词的语音识别
4. Analysis of N-Best Output Hypotheses for Fast Speech in Large Vocabulary Continuous Speech Recognition [C] . Tibor Fabian, Thilo Pfau, Guenther Ruske European conference on speech communication and technology . 2001

机译：大词汇持续语音识别中快速言论的n最佳输出假设分析
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Recognition of time-compressed speech does not predict recognition of natural fast-rate speech by older listeners [O] . Sandra Gordon-Salant, Danielle J. Zion, Carol Espy-Wilson -1

机译：时间压缩语音的识别无法预测年长听众对自然快速语音的识别
7. Automatic Detection Of New Words In A Large Vocabulary Continuous Speech Recognition System [O] . Ayman Asadit, Richard Schwartz, John Makhoul 2014

机译：大词汇量连续语音识别系统中新词的自动检测

Compressing the Factoring Table and Performing Garbage Collection on Unusable Word Hypotheses in a Continuous Speech Recognition System

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅