【24h】

Integrating Probability into LR Parsing

机译:将概率集成到LR解析中

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we describe the methods of acquiring stochastic knowledge from corpus and integrating both of the lexical and syntactic probabilities into a LR parser in order to improve its disambiguatign ability. Based on the Hidden Markov Model (HMM) tagging system, we introduce the lexical statistic information acquistion. These information can be learned from the corpus no matter the corpus is tagged or not. On the syntactic(grammar) level, we proposed two different concepts; derivation porbability and reduction probability of Contect Free Grammar(CFG) rule.Because the original Inside-OUtside algorithm can only estimate the derivation probabilities of rules, we designed and implemented an automatic method to estimate the reduction porbabilities of rules from corpus. We also applied both the lexical porbability and rule probability of grammar to the CFG parser of a English to Chinese MT system. We set up a new kind of scoring system based on the statistic knowledge as the criteria of disambiguating the syntactic structures. Experiment shows that the accuracy of the parsing has been improved.
机译:在本文中,我们描述了从语料库获取随机知识并将词汇和句法概率都集成到LR解析器中的方法,以提高其歧义消除能力。基于隐马尔可夫模型(HMM)标记系统,我们引入了词法统计信息获取。无论是否标记了语料库,都可以从语料库中学习这些信息。在语法(语法)级别上,我们提出了两个不同的概念:由于无内部语法算法只能估计规则的推导概率,因此设计并实现了一种自动方法来从语料库中估计规则的减少概率。我们还将词汇的可拼性和规则的规则概率应用于英语到中文MT系统的CFG解析器。我们建立了一种基于统计知识的新型评分系统,作为消除句法结构歧义的标准。实验表明,解析的准确性得到了提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号