Cross level semantic similarity: an evaluation framework for universal measures of similarity

Jurgens David; Pilehvar Mohammad Taher; Navigli Roberto

首页> 外文期刊>Language Resources and Evaluation >Cross level semantic similarity: an evaluation framework for universal measures of similarity

【24h】

Cross level semantic similarity: an evaluation framework for universal measures of similarity

机译：跨级别语义相似性：通用相似性度量的评估框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic similarity has typically been measured across items of approximately similar sizes. As a result, similarity measures have largely ignored the fact that different types of linguistic item can potentially have similar or even identical meanings, and therefore are designed to compare only one type of linguistic item. Furthermore, nearly all current similarity benchmarks within NLP contain pairs of approximately the same size, such as word or sentence pairs, preventing the evaluation of methods that are capable of comparing different sized items. To address this, we introduce a new semantic evaluation called cross-level semantic similarity (CLSS), which measures the degree to which the meaning of a larger linguistic item, such as a paragraph, is captured by a smaller item, such as a sentence. Our pilot CLSS task was presented as part of SemEval-2014, which attracted 19 teams who submitted 38 systems. CLSS data contains a rich mixture of pairs, spanning from paragraphs to word senses to fully evaluate similarity measures that are capable of comparing items of any type. Furthermore, data sources were drawn from diverse corpora beyond just newswire, including domain-specific texts and social media. We describe the annotation process and its challenges, including a comparison with crowdsourcing, and identify the factors that make the dataset a rigorous assessment of a method's quality. Furthermore, we examine in detail the systems participating in the SemEval task to identify the common factors associated with high performance and which aspects proved difficult to all systems. Our findings demonstrate that CLSS poses a significant challenge for similarity methods and provides clear directions for future work on universal similarity methods that can compare any pair of items.

机译：语义相似度通常是在近似相似大小的项目之间测量的。结果，相似性度量在很大程度上忽略了以下事实：不同类型的语言项可能具有相似甚至相同的含义，因此被设计为仅比较一种类型的语言项。此外，NLP中几乎所有当前的相似性基准都包含近似相同大小的对，例如单词或句子对，从而无法评估能够比较不同大小项目的方法。为了解决这个问题，我们引入了一种新的语义评估，称为跨级语义相似性（CLSS），它测量较大的语言项（例如段落）的含义被较小的项（例如句子）捕获的程度。。我们的CLSS试验任务是SemEval-2014的一部分，吸引了19个团队提交了38个系统。 CLSS数据包含丰富的成对组合，从段落到词义，以全面评估能够比较任何类型项目的相似性度量。此外，数据来源还来自新闻通讯社之外的各种语料库，包括特定领域的文本和社交媒体。我们描述了注释过程及其挑战，包括与众包的比较，并确定了使数据集成为对方法质量的严格评估的因素。此外，我们详细检查了参与SemEval任务的系统，以确定与高性能相关的常见因素，以及哪些方面对所有系统而言都是困难的。我们的发现表明，CLSS对相似性方法提出了重大挑战，并为将来可以比较任何对项目的通用相似性方法提供了明确的指导。

著录项

来源
《Language Resources and Evaluation》 |2016年第1期|5-33|共29页
作者
Jurgens David; Pilehvar Mohammad Taher; Navigli Roberto;
展开▼
作者单位

McGill Univ, Montreal, PQ, Canada;

Univ Roma La Sapienza, Piazzale Aldo Moro 5, I-00185 Rome, Italy;

Univ Roma La Sapienza, Piazzale Aldo Moro 5, I-00185 Rome, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Similarity; Evaluation; Semantic textual similarity;

机译：相似度;评估;语义文本相似度;

相似文献

外文文献
中文文献
专利

1. Modelling expertise at different levels of granularity using semantic similarity measures in the context of collaborative knowledge-curation platforms [J] . Ziaimatin Hasti, Groza Tudor, Tudorache Tania, Journal of Intelligent Information Systems . 2016,第3期

机译：在协作知识管理平台的上下文中使用语义相似性度量对不同粒度级别的专业知识进行建模
2. Semantic tracking and recommendation using fourfold similarity measure from large scale data using hadoop distributed framework in cloud [J] . Priyadarshini R., Latha Tamilselvan, Rajendran N. International journal of intelligent unmanned systems . 2019,第4期

机译：在云中使用hadoop分布式框架从大规模数据中使用四重相似性度量进行语义跟踪和推荐
3. A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain [J] . Sebastien Harispe, David Sanchez, Sylvie Ranwez, Journal of biomedical informatics. . 2014,第Null期

机译：统一基于本体的语义相似性度量的框架：生物医学领域的研究
4. Duluth : Measuring Cross-Level Semantic Similarity with First and Second-Order Dictionary Overlaps [C] . Ted Pedersen 8th International workshop on semantics evaluation . 2014

机译：Duluth：使用一阶和二阶字典重叠测量跨级别语义相似度
5. Using semantic similarity measures in the biomedical domain for computing functional similarity between genes based on gene ontology [D] . Khabiri, Elham 2007

机译：在生物医学领域中使用语义相似性度量基于基因本体计算基因之间的功能相似性
6. Sherlock: A Semi-automatic Framework for Quiz Generation Using a Hybrid Semantic Similarity Measure [O] . Chenghua Lin, Dong Liu, Wei Pang, -1

机译：Sherlock：使用混合语义相似性测度生成测验的半自动框架
7. Duluth: Measuring Cross–Level Semantic Similarity with First and Second–Order Dictionary Overlaps [O] . Ted Pedersen 2016

机译：德卢斯：用一阶和二阶字典重叠来衡量跨层语义相似度

Cross level semantic similarity: an evaluation framework for universal measures of similarity

摘要

著录项

相似文献

相关主题

期刊订阅