首页> 外文会议>LREC-2012 >Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques

【24h】

Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques

机译：通过分布相似技术测量巴斯克地震中NV表达式的组成性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present several experiments aiming at measuring the semantic compositionality of NV expressions in Basque. Our approach is based on the hypothesis that compositionality can be related to distributional similarity. The contexts of each NV expression are compared with the contexts of its corresponding components, by means of different techniques, as similarity measures usually used with the Vector Space Model (VSM), Latent Semantic Analysis (LSA) and some measures implemented in the Lemur Toolkit, as Indri index, tf-idf, Okapi index and Kullback-Leibler divergence. Using our previous work, with cooccurrence techniques as a baseline, the results point to improvements using the Indri index or Kullback-Leibler divergence, and a slight further improvement when used in combination with cooccurrence measures such as t-score, via rank-aggregation. This work is part of a project for MWE extraction and characterization using different techniques aiming at measuring the properties related to idiomaticity, as institutionalization, non-compositionality and lexico-syntactic fixedness.

机译：我们在旨在测量巴斯克中NV表达的语义构成性的几个实验。我们的方法是基于该假设，即合成性可能与分布相似性有关。通过不同的技术将每个NV表达的上下文与其相应组件的上下文进行比较，因为通常与矢量空间模型（VSM），潜在语义分析（LSA）和Lemur Toolkit中实现的一些措施一起使用的相似度措施，作为Indri索引，TF-IDF，OKAPI索引和Kullback-Leibler发散。使用我们以前的工作，用Cooccurrence技术作为基线，结果指出了使用Indri指数或Kullback-Leibler发散的改进，并且当与C秩聚集相结合使用时使用QuicCurrence措施（例如T-Score）的逐渐进一步改善。这项工作是使用不同技术的MWE提取和表征的项目的一部分，旨在测量与惯用性有关的性质，作为制度化，非合成性和词典语法固定性。

著录项

来源
《LREC-2012》|2012年||共6页
会议地点
作者
Antton Gurrutxaga; I?aki Alegria;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 41.11083;
关键词
MWEs; idioms; collocations; compositionality; distributional similarity;

机译：MWES;成语;搭配;合作;分配相似性;

相似文献

外文文献
中文文献
专利

1. Semantic Similarity Measures for the Generation of Science Tests in Basque [J] . Aldabe I., Maritxalar M. Learning Technologies, IEEE Transactions on . 2014,第4期

机译：巴斯克科学测试生成的语义相似性度量
2. A hybrid image similarity measure based on a new combination of different similarity techniques [J] . Nisreen Ryadh Hamza, Rasha Ail Dihin, Mohammed Hasan Abdulameer International Journal of Electrical and Computer Engineering . 2020,第2期

机译：一种基于不同相似性技术的新组合的混合图像相似度量
3. Comparison of Gene Expression Programming with neuro-fuzzy and neural network computing techniques in estimating daily incoming solar radiation in the Basque Country (Northern Spain) [J] . Gorka Landeras, Jose Javier Lopez, Ozgur Kisi, Energy Conversion & Management . 2012,第期

机译：将基因表达程序与神经模糊和神经网络计算技术进行比较，以估计巴斯克地区（西班牙北部）每天的太阳辐射
4. Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques [C] . Antton Gurrutxaga, Inaki Alegria International conference on language resources and evaluation . 2012

机译：通过分布相似技术测量巴斯克地区NV表达的组成
5. Measuring the similarity between sample data and continuous distributions. [D] . Glisic, Ranko. 2010

机译：测量样本数据和连续分布之间的相似性。
6. Measuring Distribution Similarities Between Samples: A Distribution-Free Overlapping Index [O] . Massimiliano Pastore, Antonio Calcagnì 2005

机译：测量样本之间的分布相似性：无分布重叠指数
7. Using Distributional Similarity of Multi-Way Translations to Predict Multiword Expression Compositionality [O] . Bahar Salehi, Paul Cook, Timothy Baldwin 2014

机译：利用多向翻译的分布相似性来预测多字表达的组合性

Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques

摘要

著录项

相似文献

相关主题

期刊订阅