【24h】

Computation of the N Best Parse Trees for Weighted and Stochastic Context-Free Grammars

机译:加权和随机上下文无关文法的N个最佳解析树的计算

获取原文
获取原文并翻译 | 示例

摘要

Context-Free Grammars are the object of increasing interest in the pattern recognition research community in an attempt to overcome the limited modeling capabilities of the simpler regular grammars, and have application in a variety of fields such as language modeling, speech recognition, optical character recognition, computational biology, etc. This paper proposes an efficient algorithm to solve one of the problems associated to the use of weighted and stochastic Context-Free Grammars: the problem of computing the N best parse trees of a given string. After the best parse tree has been computed using the CYK algorithm, a large number of alternative parse trees are obtained, in order by weight (or probability), in a small fraction of the time required by the CYK algorithm to find the best parse tree. This is confirmed by experimental results using grammars from two different domains: a chromosome grammar, and a grammar modeling natural language sentences from the Wall Street Journal corpus.
机译:上下文无关文法是模式识别研究界越来越感兴趣的对象,旨在克服较简单的常规语法的有限建模功能,并已在语言建模,语音识别,光学字符识别等各个领域中得到应用本文提出了一种有效的算法来解决与加权和随机上下文无关文法的使用相关的问题之一:计算给定字符串的N个最佳解析树的问题。使用CYK算法计算出最佳解析树后,可以按权重(或概率)的顺序,在CYK算法找到最佳解析树所需的时间的一小部分中,获得了大量的备用解析树。 。使用来自两个不同领域的语法得到的实验结果证实了这一点:染色体语法,以及对《华尔街日报》语料库中的自然语言句子进行建模的语法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号