首页> 外文期刊>Cognitive Systems Research >A study of metrics of distance and correlation between ranked lists for compositionality detection
【24h】

A study of metrics of distance and correlation between ranked lists for compositionality detection

机译:用于成分检测的排名列表之间的距离和相关性度量标准的研究

获取原文
获取原文并翻译 | 示例
           

摘要

Compositionality in language refers to how much the meaning of some phrase can be decomposed into the meaning of its constituents and the way these constituents are combined. Based on the premise that substitution by synonyms is meaning-preserving, compositionality can be approximated as the semantic similarity between a phrase and a version of that phrase where words have been replaced by their synonyms. Different ways of representing such phrases exist (e.g., vectors (Kiela and Clark, 2013) or language models (Lioma, Simonsen, Larsen, and Hansen, 2015)), and the choice of representation affects the measurement of semantic similarity. We propose a new compositionality detection method that represents phrases as ranked lists of term weights. Our method approximates the semantic similarity between two ranked list representations using a range of well-known distance and correlation metrics. In contrast to most state-of-the-art approaches in compositionality detection, our method is completely unsupervised. Experiments with a publicly available dataset of 1048 human-annotated phrases shows that, compared to strong supervised baselines, our approach provides superior measurement of compositionality using any of the distance and correlation metrics considered. (C) 2017 Elsevier B.V. All rights reserved.
机译:语言中的构成性是指某个短语的含义可以分解为其成分的含义以及这些成分的组合方式。基于同义词替换是保留含义的前提,可以将组成性近似为短语和该短语的版本之间的语义相似性,在该版本中,单词已被其同义词替换。存在表示这种短语的不同方式(例如向量(Kiela和Clark,2013)或语言模型(Lioma,Simonsen,Larsen和Hansen,2015)),并且表示的选择会影响语义相似度的度量。我们提出了一种新的成分检测方法,该方法将短语表示为术语权重的排序列表。我们的方法使用一系列众所周知的距离和相关度量来近似估计两个排名列表表示形式之间的语义相似性。与大多数最新的成分检测方法相反,我们的方法是完全不受监督的。在公开可用的1048个人类注释短语的数据集上进行的实验表明,与严格的监督基准相比,我们的方法可以使用所考虑的任何距离和相关性指标来提供出色的成分测量。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号