首页> 外文会议>International Conference on Wireless Networks and Mobile Communications >Information content measures of semantic similarity between documents based on Hadoop system
【24h】

Information content measures of semantic similarity between documents based on Hadoop system

机译:基于Hadoop系统的文档中语义相似性的信息内容措施

获取原文

摘要

Retrieving documents in response to the user's query is the most commonly text retrieval task. For our work, we have mainly focused on detecting the semantic similarity between documents in large documents collection and queries. In this paper, we investigated MapReduce as a specific framework for managing distributed processing in dataset pattern and semantic similarity measures of documents. Then we study the state of the art of different approaches for computing the semantic similarity of documents. We propose an approach based on parallel algorithm of semantic similarity measures using MapReduce and WordNet to detect the relevant documents in the face of the query. Finally, we are leading basic experiments to assess the performance of the proposed approach and noted the leverage of Hadoop and MapReduce to the semantic similarity measures between documents.
机译:检索文档响应用户的查询是最常见的文本检索任务。对于我们的工作,我们主要专注于检测大文件收集和查询中的文档之间的语义相似性。在本文中,我们将MapReduce作为管理数据集模式和文档的语义相似度测量中的分布式处理的特定框架。然后,我们研究了计算文档语义相似性的不同方法的领域。我们提出了一种基于使用MapReduce和Wordnet的并行语义相似度措施并行算法的方法来检测面对查询中的相关文档。最后,我们是评估拟议方法的表现的基本实验,并注意到Hadoop和MapReduce在文档之间的语义相似度措施中的杠杆作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号