首页> 外文会议>Workshop on multiword expressions >Multiword Expression Identification with Recurring Tree Fragments and Association Measures
【24h】

Multiword Expression Identification with Recurring Tree Fragments and Association Measures

机译:用重复的树片段和关联措施多相表达识别

获取原文

摘要

We present a novel approach for the identification of multiword expressions (MWEs). The methodology extracts a large set of recurring syntactic fragments from a given treebank using a Tree-Kernel method. Differently from previous studies, the expressions underlying these fragments are arbitrarily long and can include intervening gaps. In the initial study we use these fragments to identify MWEs as a parsing task (in a supervised manner) as proposed by Green et al. (2011). Here we obtain a small improvement over previous results. In the second part, we compare various association measures in reranking the expressions underlying these fragments in an unsupervised fashion. We show how a newly defined measure (Log Inside Ratio) based on statistical parsing techniques is able to outperform classical association measures in the French data.
机译:我们提出了一种用于识别多字级表达式(MWE)的新方法。该方法使用树内核法从给定的树木制作一大组重复的句法片段。与以前的研究不同,这些碎片下面的表达是任意长的,并且可以包括干预差距。在初始研究中,我们使用这些片段将MWE识别为Green等人提出的解析任务(以监督方式)。 (2011)。在这里,我们对以前的结果获得了很小的改进。在第二部分中,我们将各种关联措施进行比较重新划分这些碎片的表达方式以无人监督的方式。我们展示了基于统计解析技术的新定义度量(对数率)是如何能够在法国数据中优于经典关联措施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号