Information content measures of semantic similarity between documents based on Hadoop system

机译：基于Hadoop系统的文档中语义相似性的信息内容措施

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Retrieving documents in response to the user's query is the most commonly text retrieval task. For our work, we have mainly focused on detecting the semantic similarity between documents in large documents collection and queries. In this paper, we investigated MapReduce as a specific framework for managing distributed processing in dataset pattern and semantic similarity measures of documents. Then we study the state of the art of different approaches for computing the semantic similarity of documents. We propose an approach based on parallel algorithm of semantic similarity measures using MapReduce and WordNet to detect the relevant documents in the face of the query. Finally, we are leading basic experiments to assess the performance of the proposed approach and noted the leverage of Hadoop and MapReduce to the semantic similarity measures between documents.

机译：检索文档响应用户的查询是最常见的文本检索任务。对于我们的工作，我们主要专注于检测大文件收集和查询中的文档之间的语义相似性。在本文中，我们将MapReduce作为管理数据集模式和文档的语义相似度测量中的分布式处理的特定框架。然后，我们研究了计算文档语义相似性的不同方法的领域。我们提出了一种基于使用MapReduce和Wordnet的并行语义相似度措施并行算法的方法来检测面对查询中的相关文档。最后，我们是评估拟议方法的表现的基本实验，并注意到Hadoop和MapReduce在文档之间的语义相似度措施中的杠杆作用。

著录项

来源
《International Conference on Wireless Networks and Mobile Communications》|2016年|290p|共6页
会议地点
作者
Marouane Birjali; Abderrahim Beni-Hssane; Mohammed Erritali; Youness Madani;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN92-53;
关键词
Semantics; Ontologies; Servers; Indexing; Big data; Programming; Frequency measurement;

机译：语义;本体;服务器;索引;大数据;编程;频率测量;

相似文献

外文文献
中文文献
专利

1. An Approach of Semantic Similarity Measure between Documents Based on Big Data [J] . International Journal of Electrical and Computer Engineering . 2016,第5期

机译：基于大数据的文档间语义相似度度量方法
2. Cross-Lingual Document Representation and Semantic Similarity Measure: A Fuzzy Set and Rough Set Based Approach [J] . Huang H-.H., Kuo Y-.H. Fuzzy Systems, IEEE Transactions on . 2010,第6期

机译：跨语言文档表示和语义相似性度量：基于模糊集和粗糙集的方法
3. Semantic linkage of source content dynamically with virtual documents using Wikipedia in Hadoop [J] . R. Priyadarshini, Latha Tamilselvan International journal of advanced intelligence paradigms . 2020,第3a4期

机译：使用Wikipedia在Hadoop中动态地将源内容的语义联动
4. Information content measures of semantic similarity between documents based on Hadoop system [C] . Marouane Birjali, Abderrahim Beni-Hssane, Mohammed Erritali, International conference on wireless networks and mobile communications . 2016

机译：基于Hadoop系统的文档之间语义相似度的信息内容度量
5. Structure and content semantic similarity detection of extensible markup language documents using keys. [D] . Viyanon, Waraporn. 2010

机译：使用密钥的可扩展标记语言文档的结构和内容语义相似性检测。
6. Bridging the gap: incorporating a semantic similarity measure for effectively mapping PubMed queries to documents [O] . Sun Kim, Nicolas Fiorini, W. John Wilbur, -1

机译：缩小差距：纳入语义相似性度量以有效将PubMed查询映射到文档
7. Hadoop-Based Similarity Computation System for Composed Documents [O] . Xiaoming Zhang, Zhipeng Qin, Xuwei Liu, 2015

机译：基于Hadoop的相似性计算系统，用于组成文件

Information content measures of semantic similarity between documents based on Hadoop system

摘要

著录项

相似文献

相关主题

期刊订阅