A Comparison of Co-occurrence and Similarity Measures as Simulations of Context

机译：与语境模拟的共同发生和相似度措施的比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Observations of word co-occurrences and similarity computations are often used as a straightforward way to represent the global contexts of words and achieve a simulation of semantic word similarity for applications such as word or document clustering and collocation extraction. Despite the simplicity of the underlying model, it is necessary to select a proper significance, a similarity measure and a similarity computation algorithm. However, it is often unclear how the measures are related and additionally often dimensionality reduction is applied to enable the efficient computation of the word similarity. This work presents a linear time complexity approximative algorithm for computing word similarity without any dimensionality reduction. It then introduces a large-scale evaluation based on two languages and two knowledge sources and discusses the underlying reasons for the relative performance of each measure.

机译：单词共同发生和相似性计算的观察通常用作表示单词的全局背景的直接方式，并实现诸如单词或文档聚类和搭配提取的应用的语义词相似度的模拟。尽管底层模型的简单性，但是必须选择适当的意义，相似度测量和相似性计算算法。然而，通常不清楚措施如何相关且否则通常会应用维度减少，以便能够有效地计算单词相似度。该工作提出了一种用于计算字相似性的线性时间复杂度近似算法，而无需任何维度降低。然后，基于两种语言和两个知识来源介绍了大规模评估，并讨论了每个措施相对性能的基本原因。

著录项

来源
《International Conference on Computational Linguistics and Intelligent Text Processing》|2008年||共12页
会议地点
作者
Stefan Bordag;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Herb Leaves Recognition using Gray Level Co-occurrence Matrix and Five Distance-based Similarity Measures [J] . R. Rizal Isnanto, Munawar Agus Riyadi, Muhammad Fahmi Awaj International Journal of Electrical and Computer Engineering . 2018,第3期

机译：基于灰度共生矩阵和五种基于距离的相似性度量的药草叶识别
2. A NEW GEOGRAPHIC CONTEXT MEASURE TO SIMILARITY ASSESSMENT BASED ON THE SHAPE CONTEXT DESCRIPTOR [J] . E. M. A. Xavier, M. A. Ure?a-Cámara, F. J. Ariza-López International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第4期

机译：基于形状上下文描述符的新地理上下文测量到相似性评估
3. Context-awareness in similarity measures and pattern discoveries of trajectories: a context-based dynamic time warping method [J] . Sharif Mohammad, Alesheikh Ali Asghar GIScience & remote sensing . 2017,第3期

机译：轨迹的相似性度量和模式发现中的上下文意识：基于上下文的动态时间扭曲方法
4. A Comparison of Co-occurrence and Similarity Measures as Simulations of Context [C] . Stefan Bordag Computational Linguistics and Intelligent Text Processing . 2008

机译：共现和相似度量作为上下文模拟的比较
5. QUERY-FOCUSED EXTRACTIVE SUMMARIZATION BASED ON DEEP LEARNING: COMPARISON OF SIMILARITY MEASURES FOR PSEUDO GROUND TRUTH GENERATION [D] . Yuliska 2019

机译：基于深度学习的查询重点摘要：伪地面真相生成相似度量的比较
6. Comparison of simulations with PHITS and HIBRAC with experimental data in the context of particle therapy monitoring [O] . Heide Rohling, Lembit Sihver, Marlen Priegnitz, 2014

机译：在粒子治疗监测的背景下使用PHITS和HIBRAC进行的模拟与实验数据的比较
7. A Comparison of Co-occurrence and Similarity Measures as Simulations of Context [O] . Stefan Bordag 2008

机译：作为语境模拟的共现与相似度量的比较

A Comparison of Co-occurrence and Similarity Measures as Simulations of Context

摘要

著录项

相似文献

相关主题

期刊订阅