A Comparison of Co-occurrence and Similarity Measures as Simulations of Context

机译：共现和相似度量作为上下文模拟的比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Observations of word co-occurrences and similarity computations are often used as a straightforward way to represent the global contexts of words and achieve a simulation of semantic word similarity for applications such as word or document clustering and collocation extraction. Despite the simplicity of the underlying model, it is necessary to select a proper significance, a similarity measure and a similarity computation algorithm. However, it is often unclear how the measures are related and additionally often dimensionality reduction is applied to enable the efficient computation of the word similarity. This work presents a linear time complexity approximative algorithm for computing word similarity without any dimensionality reduction. It then introduces a large-scale evaluation based on two languages and two knowledge sources and discusses the underlying reasons for the relative performance of each measure.

机译：单词共现和相似性计算的观察通常用作表示单词全局上下文并为诸如单词或文档聚类和搭配提取之类的应用程序实现语义单词相似性模拟的直接方法。尽管基础模型很简单，但仍然需要选择适当的重要性，相似性度量和相似性计算算法。然而，常常不清楚这些度量之间如何相关，并且另外经常应用降维以实现单词相似度的有效计算。这项工作提出了一种线性时间复杂度近似算法，用于在不降低维数的情况下计算单词相似度。然后介绍了基于两种语言和两种知识来源的大规模评估，并讨论了每种方法相对性能的潜在原因。

著录项

来源
《Computational Linguistics and Intelligent Text Processing》|2008年|P.52-63|共12页
会议地点 Haifa(IL);Haifa(IL)
作者
Stefan Bordag;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Herb Leaves Recognition using Gray Level Co-occurrence Matrix and Five Distance-based Similarity Measures [J] . R. Rizal Isnanto, Munawar Agus Riyadi, Muhammad Fahmi Awaj International Journal of Electrical and Computer Engineering . 2018,第3期

机译：基于灰度共生矩阵和五种基于距离的相似性度量的药草叶识别
2. A NEW GEOGRAPHIC CONTEXT MEASURE TO SIMILARITY ASSESSMENT BASED ON THE SHAPE CONTEXT DESCRIPTOR [J] . E. M. A. Xavier, M. A. Ure?a-Cámara, F. J. Ariza-López International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第4期

机译：基于形状上下文描述符的新地理上下文测量到相似性评估
3. Context-awareness in similarity measures and pattern discoveries of trajectories: a context-based dynamic time warping method [J] . Sharif Mohammad, Alesheikh Ali Asghar GIScience & remote sensing . 2017,第3期

机译：轨迹的相似性度量和模式发现中的上下文意识：基于上下文的动态时间扭曲方法
4. A Comparison of Co-occurrence and Similarity Measures as Simulations of Context [C] . Stefan Bordag International Conference on Computational Linguistics and Intelligent Text Processing . 2008

机译：与语境模拟的共同发生和相似度措施的比较
5. QUERY-FOCUSED EXTRACTIVE SUMMARIZATION BASED ON DEEP LEARNING: COMPARISON OF SIMILARITY MEASURES FOR PSEUDO GROUND TRUTH GENERATION [D] . Yuliska 2019

机译：基于深度学习的查询重点摘要：伪地面真相生成相似度量的比较
6. Comparison of simulations with PHITS and HIBRAC with experimental data in the context of particle therapy monitoring [O] . Heide Rohling, Lembit Sihver, Marlen Priegnitz, 2014

机译：在粒子治疗监测的背景下使用PHITS和HIBRAC进行的模拟与实验数据的比较
7. A Comparison of Co-occurrence and Similarity Measures as Simulations of Context [O] . Stefan Bordag 2008

机译：作为语境模拟的共现与相似度量的比较

A Comparison of Co-occurrence and Similarity Measures as Simulations of Context

摘要

著录项

相似文献

相关主题

期刊订阅