Semantic similarity measurement using historical google search patterns

Jorge Martinez-Gil; José F. Aldana-Montes

首页> 外文期刊>Information Systems Frontiers >Semantic similarity measurement using historical google search patterns

【24h】

Semantic similarity measurement using historical google search patterns

机译：使用历史Google搜索模式进行语义相似度测量

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Computing the semantic similarity between terms (or short text expressions) that have the same meaning but which are not lexicographically similar is an important challenge in the information integration field. The problem is that techniques for textual semantic similarity measurement often fail to deal with words not covered by synonym dictionaries. In this paper, we try to solve this problem by determining the semantic similarity for terms using the knowledge inherent in the search history logs from the Google search engine. To do this, we have designed and evaluated four algorithmic methods for measuring the semantic similarity between terms using their associated history search patterns. These algorithmic methods are: a) frequent co-occurrence of terms in search patterns, b) computation of the relationship between search patterns, c) outlier coincidence on search patterns, and d) forecasting comparisons. We have shown experimentally that some of these methods correlate well with respect to human judgment when evaluating general purpose benchmark datasets, and significantly outperform existing methods when evaluating datasets containing terms that do not usually appear in dictionaries.

机译：在信息集成领域，计算具有相同含义但在字典上不相似的术语（或短文本表达）之间的语义相似性是一项重要的挑战。问题在于，用于文本语义相似性度量的技术通常无法处理同义词词典未涵盖的单词。在本文中，我们尝试通过使用Google搜索引擎的搜索历史日志中固有的知识确定术语的语义相似性来解决此问题。为此，我们设计并评估了四种算法方法，用于使用相关的历史搜索模式来测量术语之间的语义相似性。这些算法方法是：a）搜索模式中词语的频繁共现； b）搜索模式之间关系的计算； c）搜索模式中的异常一致；以及d）预测比较。我们已经通过实验表明，其中一些方法在评估通用基准数据集时与人类判断相关性很好，并且在评估包含通常不出现在词典中的术语的数据集时，其性能明显优于现有方法。

著录项

来源
《Information Systems Frontiers》 |2013年第3期|399-410|共12页
作者
Jorge Martinez-Gil; José F. Aldana-Montes;
展开▼
作者单位

Department of Computer Science University of Malaga">(1);

Department of Computer Science University of Malaga">(1);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Information integration; Web Intelligence; Semantic similarity;

机译：信息整合;网络情报;语义相似度;

相似文献

外文文献
中文文献
专利

1. Semantic similarity measurement using historical google search patterns [J] . Jorge Martinez-Gil, Jose F. Aldana-Montes Information systems frontiers . 2013,第3期

机译：使用历史Google搜索模式进行语义相似度测量
2. Google vs. the Library (Part Ⅱ): Student Search Patterns and Behaviors when Using Google and a Federated Search Tool [J] . Helen Georgas Portal:libraries and the academy . 2014,第4期

机译：Google与图书馆（第二部分）：使用Google和联合搜索工具时的学生搜索模式和行为
3. Web Search Engine Based Semantic Similarity Measure Between Words Using Pattern Retrieval Algorithm [J] . Pushpa C N, Thriveni J, Venugopal K R, Computer Science & Information Technology . 2013,第1期

机译：基于网络搜索引擎的语义相似性测量，使用模式检索算法
4. An Empirical Evaluation on Semantic Search Performance of Keyword-Based and Semantic Search Engines: Google, Yahoo, Msn and Hakia [C] . Tumer D., Shah M.A., Bitirim Y. Internet Monitoring and Protection, 2009. ICIMP '09 . 2009

机译：基于关键字和语义搜索引擎的语义搜索性能的实证评估：Google，Yahoo，Msn和Hakia
5. Semantic similarity measurement [D] . Yang, Beibei 2009

机译：语义相似度测量
6. Similarity of fMRI Activity Patterns in Left Perirhinal Cortex Reflects Semantic Similarity between Words [O] . Rose Bruffaerts, Patrick Dupont, Ronald Peeters, 2013

机译：左周围皮层功能磁共振成像活动模式的相似性反映了单词之间的语义相似性
7. Semantic similarity measurement using historical google search patterns [O] . Martinez-Gil, Jorge, Aldana Montes, José, 2013

机译：使用历史Google搜索模式进行语义相似度测量

Semantic similarity measurement using historical google search patterns

摘要

著录项

相似文献

相关主题

期刊订阅