首页> 外文会议>Grammatical Inference: Theory, Applications and Alternatives, IEE Colloquium on >Analysis of a simple bipos language model-attempt at a strategy toimprove language models for speech recognition
【24h】

Analysis of a simple bipos language model-attempt at a strategy toimprove language models for speech recognition

机译:分析简单的bipos语言模型-尝试以下策略改善语音识别的语言模型

获取原文

摘要

A speech recognizer has to choose, at each point in the utterance,the words among all the words in the vocabulary, that are the mostlikely. To that end, it uses an acoustic model and a language model andthe author focuses on the language model. The bipos model is presentedand analysed. A method is introduced called probability decomposition tomeasure which part of the model is performing particularly well orpoorly. Based on this analysis, the author modifies the modeling ofunknown words and this leads to a reduction in the entropy of at least14% (up to 21%). Other conclusions obtained from the analysis are alsogiven. An attempt at a strategy to improve language models in general isgiven. To that end, the author defines a class of models called statelanguage models. This class contains most currently employed models.However, these currently used models cover only a small area in thespace of all possible state language models. A more systematic study ofthis space is proposed in order to improve current language models. Astatistical method, called classification and regression trees ispresented as a tool for this purpose
机译:语音识别器必须在发声的每个点上选择 词汇表中所有单词中最多的单词 可能的。为此,它使用了声学模型和语言模型, 作者专注于语言模型。介绍了bipos模型 并进行分析。引入了一种称为概率分解的方法 衡量模型的哪个部分表现特别好,或者 糟糕。在此分析的基础上,作者修改了 未知的单词,这导致至少降低了熵 14%(最高21%)。从分析中得出的其他结论也是 给定的。总体而言,尝试一种改善语言模型的策略是 给定的。为此,作者定义了一类称为状态的模型 语言模型。此类包含当前使用最多的模型。 但是,这些当前使用的模型仅覆盖了 所有可能的状态语言模型的空间。对系统进行更系统的研究 提出这个空间是为了改善当前的语言模型。一种 统计方法,称为分类和回归树是 表示为此目的的工具

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号