首页> 外文会议>Grammatical inference: Theoretical results and applications >Bounding the Maximal Parsing Performance of Non-Terminally Separated Grammars
【24h】

Bounding the Maximal Parsing Performance of Non-Terminally Separated Grammars

机译:限制非终结语法的最大解析性能

获取原文
获取原文并翻译 | 示例

摘要

Unambiguous Non-Terminally Separated (UNTS) grammars have good learnability properties but are too restrictive to be used for natural language parsing. We present a generalization of UNTS grammars called Unambiguous Weakly NTS (UWNTS) grammars that preserve the learnability properties. Then, we study the problem of using them to parse natural language and evaluating against a gold treebank. If the target language is not UWNTS, there will be an upper bound in the parsing performance. In this paper we develop methods to find upper bounds for the unlabeled F performance that any UWNTS grammar can achieve over a given treebank. We define a new metric, show that its optimization is NP-Hard but solvable with specialized software, and show a translation of the result to a bound for the F1. We do experiments with the WSJ10 corpus, finding an F1 bound of 76.1% for the UWNTS grammars over the POS tags alphabet.
机译:明确的非术语分离(UNTS)语法具有良好的可学习性,但过于局限,无法用于自然语言解析。我们提出了称为无歧义的弱NTS(UWNTS)语法的UNTS语法的概括,该语法保留了可学习性。然后,我们研究了使用它们来解析自然语言并针对金树库进行评估的问题。如果目标语言不是UWNTS,则解析性能会有一个上限。在本文中,我们开发了一些方法来找到未标记的F性能的上限,任何UWNTS语法都可以在给定的树库上实现该上限。我们定义了一个新的指标,表明其优化是NP-Hard的,但可以通过专用软件解决,并显示结果转换为F1的界限。我们使用WSJ10语料库进行了实验,发现POS标签字母上的UWNTS语法的F1边界为76.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号