首页> 外文会议>Workshop on multiword expressions >Combining Different Features of Idiomaticity for the Automatic Classification of Noun+Verb Expressions in Basque
【24h】

Combining Different Features of Idiomaticity for the Automatic Classification of Noun+Verb Expressions in Basque

机译:结合成语的不同特征对巴斯克语中的名词+动词表达进行自动分类

获取原文

摘要

We present an experimental study of how different features help measuring the idiomaticity of noun+verb (NV) expressions in Basque. After testing several techniques for quantifying the four basic properties of multiword expressions or MWEs (institutionalization, semantic non-compositionality, morphosyntac-tic fixedness and lexical fixedness), we test different combinations of them for classification into idioms and collocations, using Machine Learning (ML) and feature selection. The results show the major role of distributional similarity, which measures composi-tionality, in the extraction and classification of MWEs, especially, as expected, in the case of idioms. Even though cooccurrence and some aspects of morphosyntactic flexibility contribute to this task in a more limited measure, ML experiments make benefit of these sources of knowledge, allowing to improve the results obtained using exclusively distributional similarity features.
机译:我们提供了一项有关不同功能如何帮助测量巴斯克语中名词+动词(NV)表达式的惯用性的实验研究。在测试了几种量化多词表达或MWE的四个基本属性的技术(制度化,语义非组成性,词法固定性和词法固定性)之后,我们使用机器学习(ML)测试了它们的不同组合,以将其分类为成语和搭配。 )和功能选择。结果表明,分布相似性在MWE的提取和分类中(尤其是在成语的情况下,如预期的那样)在MWE的提取和分类中起着重要作用。即使共现和形态句法灵活性的某些方面在某种程度上限制了此任务,但ML实验还是利用了这些知识资源,从而可以改善仅使用分布相似性功能获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号