【24h】

W-n-gram: a Hybrid Language Model

机译:W-n-gram:一种混合语言模型

获取原文
获取原文并翻译 | 示例

摘要

been a current research topic in many domains including speech recognition, optical character recognition, handwriting recognition, machine translation, and spelling correction. There are mainly two types of language models. One type is based on mathematics. The most widely used language model of this type is n-gram adopting statistics. There are three problems in n-gram: long distance restriction, recursive nature, and partial language understanding. Another type of language model is based on linguistics. There are many difficulties using this type of language model to process large scale real texts. In order to integrate the advantage of n-gram based statistics and the language model based linguistics, we present a new hybrid language model, which makes use of grammatical rules or semantic rules to improve an n-gram. Problems such as long distance restriction, recursive nature and partial language understanding can be solved by this hybrid model using suitable rules.
机译:语音识别,光学字符识别,手写识别,机器翻译和拼写校正是许多领域的当前研究主题。语言模型主要有两种。一种类型是基于数学的。这种类型使用最广泛的语言模型是采用统计的n-gram。 n-gram中存在三个问题:长距离限制,递归性质和部分语言理解。语言模型的另一种类型是基于语言学的。使用这种类型的语言模型来处理大规模真实文本存在许多困难。为了整合基于n-gram的统计和基于语言模型的语言学的优势,我们提出了一种新的混合语言模型,该模型利用语法规则或语义规则来改进n-gram。这种混合模型可以使用适当的规则解决诸如长距离限制,递归性质和部分语言理解之类的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号