An Efficient Algorithm to Induce Minimum Average Lookahead Grammarsfor Incremental LR Parsing

机译：增量式LR解析的最小平均超前语法的有效算法

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We define a new learning task, minimum averagelookahead grammar induction, with strong potentialimplications for incremental parsing in NLP andcognitive models. Our thesis is that a suitable learningbias for grammar induction is to minimize thedegree of lookahead required, on the underlyingtenet that language evolution drove grammars to beefficiently parsable in incremental fashion. The inputto the task is an unannotated corpus, plus a nondeterministicconstraining grammar that serves asan abstract model of environmental constraints con-firming or rejecting potential parses. The constraininggrammar typically allows ambiguity and is itselfpoorly suited for an incremental parsing model,since it gives rise to a high degree of nondeterminismin parsing. The learning task, then, is to inducea deterministic LR(k) grammar under whichit is possible to incrementally construct one of thecorrect parses for each sentence in the corpus, suchthat the average degree of lookahead needed to doso is minimized. This is a significantly more dif-ficult optimization problem than merely compilingLR(k) grammars, since k is not specified in advance.Clearly, na(I)ve approaches to this optimizationcan easily be computationally infeasible. However,by making combined use of GLR ancestor tablesand incremental LR table construction methods,we obtain an O(n3 + 2m) greedy approximationalgorithm for this task that is quite efficient inpractice.

机译：我们定义了一个新的学习任务，最低平均前瞻性语法归纳法，潜力巨大对NLP中增量解析的影响，以及认知模型。我们的论点是，适当的学习语法归纳的偏见是为了最大程度地减少基础上所需的超前程度语言进化使语法成为语言的宗旨有效地以增量方式进行解析。输入该任务是一个无注释的语料，再加上一个不确定的语料约束语法环境约束的抽象模型坚定或拒绝潜在的分析。约束语法通常允许歧义，它本身就是不太适合增量解析模型，因为它引起高度的不确定性在解析中。因此，学习任务是诱导确定性LR（k）语法，在该语法下可以逐步构造其中之一为语料库中的每个句子正确解析，例如需要进行的平均超前度因此被最小化。这是非常不同的最优化的问题不仅仅是编译 LR（k）语法，因为没有预先指定k。显然，仅此一种优化方法可能在计算上不可行。然而，通过组合使用GLR祖先表和递增的LR表构造方法，我们获得O（n3 + 2m）贪婪近似该任务的算法在实践。

著录项

来源
《;42nd Annual Meeting of the Association for Computational Linguistics》|2004年|p.1-8|共8页
会议地点
作者
Dekai WU; Yihai SHEN;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. The IELR(1) algorithm for generating minimal LR(1) parser tables for non-LR(1) grammars with conflict resolution [J] . Joel E. Denny, Brian A. Malloy Science of Computer Programming . 2010,第11期

机译：IELR（1）算法，用于为具有冲突解决方案的非LR（1）语法生成最小的LR（1）解析器表
2. BUILDING EFFICIENT INCREMENTAL LL PARSERS BY AUGMENTING LL TABLES AND THREADING PARSE TREES [J] . WARREN X. LI Computer Languages, Systems & Structures . 1996,第4期

机译：通过增强LL表和线程表树来构建有效的LL递减器
3. ON SOME INCREMENTAL ALGORITHMS FOR THE MINIMUM SUM-OF-SQUARES CLUSTERING PROBLEM. PART 1: ORDIN AND BAGIROV'S INCREMENTAL ALGORITHM [J] . Tran Hung Cuong, Yao Jen-Chih, Nguyen Dong Yen Journal of nonlinear and convex analysis . 2019,第8期

机译：关于一些增量算法，用于最小平方群体群集问题。第1部分：ordin和Bagirov的增量算法
4. A unifying model for lookahead LR parsing [C] . Bermudez, M.E. . 1988

机译：前瞻性LR解析的统一模型
5. The multiobjective average network flow problem: Shortest path and minimum cost flow formulations, algorithms, heuristics, and complexity. [D] . Jordan, Jeremy D. 2012

机译：多目标平均网络流量问题：最短路径和最低成本的流量公式，算法，启发式方法和复杂性。
6. Efficient Algorithms for Searching the Minimum Information Partition in Integrated Information Theory [O] . Jun Kitazono, Ryota Kanai, Masafumi Oizumi 2018

机译：用于在集成信息理论中搜索最小信息分区的高效算法
7. A general context-free parsing algorithm running in linear time on every LR(k) grammar without using lookahead [O] . Leo Joop M.I.M. 1991

机译：在每个LR（k）语法上线性时间运行的通用无上下文解析算法，无需提前使用
8. Average Network Flow Problem: Shortest Path and Minimum Cost Flow Formulations, Algorithms, Heuristics, and Complexity. [R] . Jordan, J. D. 2012

机译：平均网络流量问题：最短路径和最小成本流量公式，算法，启发式和复杂性。

An Efficient Algorithm to Induce Minimum Average Lookahead Grammarsfor Incremental LR Parsing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅