首页> 外文会议>IEEE international conference on data engineering >Fast top-k path-based relevance query on massive graphs
【24h】

Fast top-k path-based relevance query on massive graphs

机译:大规模图上基于top-k路径的快速相关性查询

获取原文

摘要

The task of obtaining the items highly-relevant to a given set of query items is a basis for various applications, such as recommendation and prediction. A family of path-based relevance metrics, which quantify item relevance based on the paths in a given item graph, have been shown to be effective in capturing the relevance in many applications. Despite their effectiveness, path-based relevance normally requires time-consuming iterative computation. We propose an approach to obtain the top-k most relevant items for a given query item set quickly. Our approach can obtain the top-k items without having to compute converged scores. The approach is designed for a distributed environment, which makes it scale for massive graphs having hundreds of millions of nodes. Our experimental results show that the proposed approach can produce the result 20 to 50 times faster than a previously proposed approach and can scale well with both the size of input and the number of machines used in the computation.
机译:获得与给定查询项目集高度相关的项目的任务是各种应用程序(例如推荐和预测)的基础。已经显示了一系列基于路径的相关性度量,这些度量基于给定项目图中的路径来量化项目相关性,在许多应用程序中可以有效地捕获相关性。尽管具有有效性,基于路径的相关性通常需要耗时的迭代计算。我们提出了一种快速获取给定查询项集的前k个最相关项的方法。我们的方法无需计算融合分数即可获得前k个项目。该方法是为分布式环境设计的,这使其可以扩展到具有数亿个节点的大量图形。我们的实验结果表明,所提出的方法所产生的结果比先前提出的方法快20至50倍,并且可以在输入大小和计算中使用的机器数量上很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号