首页> 外文OA文献 >Analyzing holistic parsers: Implications for robust parsing and systematicity
【2h】

Analyzing holistic parsers: Implications for robust parsing and systematicity

机译:分析整体解析器:对稳健解析和系统性的启示

摘要

Holistic parsers offer a viable alternative to traditional algorithmic parsers. They have good generalization performance and are robust inherently. In a holistic parser, parsing is achieved by mapping the connectionist representation of the input sentence to the connectionist representation of the target parse tree directly. Little prior knowledge of the underlying parsing mechanism thus needs to be assumed. However, it also makes holistic parsing difficult to understand. In this article, an analysis is presented for studying the operations of the confluent preorder parser (CPP). In the analysis, the CPP is viewed as a dynamical system, and holistic parsing is perceived as a sequence of state transitions through its state-space. The seemingly one-shot parsing mechanism can thus be elucidated as a step-by-step inference process, with the intermediate parsing decisions being reflected by the states visited during parsing. The study serves two purposes. First, it improves our understanding of how grammatical errors are corrected by the CPP. The occurrence of an error in a sentence will cause the CPP to deviate from the normal track that is followed when the original sentence is parsed. But as the remaining terminals are read, the two trajectories will gradually converge until finally the correct parse tree is produced. Second, it reveals that having systematic parse tree representations alone cannot guarantee good generalization performance in holistic parsing. More important, they need to be distributed in certain useful locations of the representational space. Sentences with similar trailing terminals should have their corresponding parse tree representations mapped to nearby locations in the representational space. The study provides concrete evidence that encoding the linearized parse trees as obtained via preorder traversal can satisfy such a requirement.
机译:整体解析器为传统算法解析器提供了可行的替代方案。它们具有良好的泛化性能,并且具有固有的鲁棒性。在整体解析器中,解析是通过将输入语句的连接主义表示直接映射到目标解析树的连接主义表示而实现的。因此,需要假定几乎没有基础解析机制的先验知识。但是,这也使整体分析难以理解。在本文中,将进行分析以研究融合式预解析器(CPP)的操作。在分析中,CPP被视为一个动态系统,整体解析被视为通过其状态空间进行的一系列状态转换。因此,看似单步的解析机制可以作为逐步推理过程加以阐明,中间解析决策由解析期间访问的状态反映出来。该研究有两个目的。首先,它提高了我们对CPP如何纠正语法错误的理解。句子中出现错误将导致CPP偏离解析原始句子时遵循的常规轨道。但是,当读取剩余的终端时,这两个轨迹将逐渐收敛,直到最终生成正确的解析树。其次,它揭示了仅具有系统的分析树表示形式并不能保证整体分析中具有良好的泛化性能。更重要的是,它们需要分布在表示空间的某些有用位置。尾部相似的句子的对应解析树表示应映射到表示空间中的附近位置。该研究提供了具体的证据,证明通过预遍历获得的对线性化分析树进行编码可以满足这一要求。

著录项

  • 作者

    Ho KSE; Chan LW;

  • 作者单位
  • 年度 2001
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号