首页> 外文会议>International conference on computational linguistics >Learning the Taxonomy of Function Words for Parsing
【24h】

Learning the Taxonomy of Function Words for Parsing

机译:学习功能词的分类法进行解析

获取原文

摘要

Completely data-driven grammar training is prone to over-fitting. Human-defined word class knowledge is useful to address this issue. However, the manual word class taxonomy may be unreliable and irrational for statistical natural language processing, aside from its insufficient linguistic phenomena coverage and domain adaptivity. In this paper, a formalized representation of function word subcategorization is developed for parsing in an automatic manner. The function word classification representing intrinsic features of syntactic usages is used to supervise the grammar induction, and the structure of the taxonomy is learned simultaneously. The grammar learning process is no longer a unilaterally supervised training by hierarchical knowledge, but an interactive process between the knowledge structure learning and the grammar training. The established taxonomy implies the stochastic significance of the diversified syntactic features. The experiments on both Penn Chinese Treebank and Tsinghua Treebank show that the proposed method improves parsing performance by 1.6% and 7.6% respectively over the baseline.
机译:完全由数据驱动的语法培训容易过分拟合。人工定义的词类知识对于解决此问题很有用。但是,除了语言现象的覆盖范围和领域适应性不足之外,手动词类分类法对于统计自然语言处理而言可能不可靠且不合理。在本文中,功能词子类别的形式化表示被开发出来以自动方式进行解析。代表句法用法内在特征的功能词分类被用来监督语法归纳,并且同时学习分类法的结构。语法学习过程不再是由层次知识进行的单方面监督训练,而是知识结构学习与语法训练之间的交互过程。建立的分类法暗示了多种句法特征的随机意义。在Penn Chinese Treebank和Tsinghua Treebank上的实验表明,所提出的方法分别比基线提高了1.6%和7.6%的解析性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号