首页> 外文会议>Workshop on multiword expressions >Multiword Expression Identification with Recurring Tree Fragments and Association Measures

【24h】

Multiword Expression Identification with Recurring Tree Fragments and Association Measures

机译：用重复的树片段和关联措施多相表达识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel approach for the identification of multiword expressions (MWEs). The methodology extracts a large set of recurring syntactic fragments from a given treebank using a Tree-Kernel method. Differently from previous studies, the expressions underlying these fragments are arbitrarily long and can include intervening gaps. In the initial study we use these fragments to identify MWEs as a parsing task (in a supervised manner) as proposed by Green et al. (2011). Here we obtain a small improvement over previous results. In the second part, we compare various association measures in reranking the expressions underlying these fragments in an unsupervised fashion. We show how a newly defined measure (Log Inside Ratio) based on statistical parsing techniques is able to outperform classical association measures in the French data.

机译：我们提出了一种用于识别多字级表达式（MWE）的新方法。该方法使用树内核法从给定的树木制作一大组重复的句法片段。与以前的研究不同，这些碎片下面的表达是任意长的，并且可以包括干预差距。在初始研究中，我们使用这些片段将MWE识别为Green等人提出的解析任务（以监督方式）。（2011）。在这里，我们对以前的结果获得了很小的改进。在第二部分中，我们将各种关联措施进行比较重新划分这些碎片的表达方式以无人监督的方式。我们展示了基于统计解析技术的新定义度量（对数率）是如何能够在法国数据中优于经典关联措施。

著录项

来源
《Workshop on multiword expressions》|2015年||共9页
会议地点
作者
Federico Sangati; Andreas van Cranenburgh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. A classification-based approach to the identification of Multiword Expressions (MWEs) in Magahi Applying SVM [J] . Shivek Kumar, Pitambar Behera, Girish Nath Jha Procedia Computer Science . 2017,第1期

机译：Magahi中应用SVM的基于分类的多词表达（MWE）识别方法
2. A classification-based approach to the identification of Multiword Expressions (MWEs) in Magahi Applying SVM [J] . Shivek Kumar, Pitambar Behera, Girish Nath Jha Procedia Computer Science . 2017,第22期

机译：Magahi中应用SVM的基于分类的多词表达（MWE）识别方法
3. Identification of Multiword Expressions by Combining Multiple Linguistic Information Sources [J] . Yulia Tsvetko, Shuly Wintne Computational linguistics . 2014,第2期

机译：结合多种语言信息源识别多词表达
4. Multiword Expression Identification with Recurring Tree Fragments and Association Measures [C] . Federico Sangati, Andreas van Cranenburgh 11th Workshop on multiword expressions . 2015

机译：带有重复树碎片的多词表达识别和关联度量
5. The Effects of Using Textual Enhancement on Processing and Learning Multiword Expressions [D] . Alshaikhi, Adel Zain. 2018

机译：使用文本增强对处理和学习多个表达的影响
6. Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification [O] . Uxoa Inurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, 2020

机译：从Corpora学习言论学的语言论：ullwword表达识别的语言上积极的方法
7. Distinguishing subtypes of multiword expressions using linguistically-motivated statistical measures [O] . Afsaneh Fazly, Suzanne Stevenson 2007

机译：使用语言动机的统计量来区分多词表达的子类型

Multiword Expression Identification with Recurring Tree Fragments and Association Measures

摘要

著录项

相似文献

相关主题

期刊订阅