首页> 外文OA文献 >TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs
【2h】

TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs

机译:TPA:近似随机散步的快速,可扩展和准确的方法,用亿尺寸图形重启

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Given a large graph, how can we determine similarity between nodes in a fastand accurate way? Random walk with restart (RWR) is a popular measure for thispurpose and has been exploited in numerous data mining applications includingranking, anomaly detection, link prediction, and community detection. However,previous methods for computing exact RWR require prohibitive storage sizes andcomputational costs, and alternative methods which avoid such costs bycomputing approximate RWR have limited accuracy. In this paper, we propose TPA,a fast, scalable, and highly accurate method for computing approximate RWR onlarge graphs. TPA exploits two important properties in RWR: 1) nodes close to aseed node are likely to be revisited in following steps due to block-wisestructure of many real-world graphs, and 2) RWR scores of nodes which residefar from the seed node are proportional to their PageRank scores. Based onthese two properties, TPA divides approximate RWR problem into two subproblemscalled neighbor approximation and stranger approximation. In the neighborapproximation, TPA estimates RWR scores of nodes close to the seed based onscores of few early steps from the seed. In the stranger approximation, TPAestimates RWR scores for nodes far from the seed using their PageRank. Thestranger and neighbor approximations are conducted in the preprocessing phaseand the online phase, respectively. Through extensive experiments, we show thatTPA requires up to 3.5x less time with up to 40x less memory space than otherstate-of-the-art methods for the preprocessing phase. In the online phase, TPAcomputes approximate RWR up to 30x faster than existing methods whilemaintaining high accuracy.
机译:鉴于大图形,我们如何以快速准确的方式确定节点之间的相似性?随机散步与重启(RWR)是一个流行的本文衡量标准,并且已经在许多数据挖掘应用程序中被利用,包括破坏,异​​常检测,链路预测和社区检测。然而,先前用于计算精确RWR的方法需要禁止的存储大小和计算机成本,以及避免近似RWR的替代成本的替代方法具有有限的精度。在本文中,我们提出了TPA,快速,可扩展,高度准确的方法来计算近似RWR onlarge图。 TPA利用RWR:1)在靠近ASEED节点的节点中的两个重要属性可能会在以下步骤中被重新审视,因为许多真实世界图的块衰老,以及2)从种子节点驻留的节点的RWR分数是比例他们的PageRank分数。基于两个属性,TPA将近似RWR问题划分为两个子问题的邻居近似和陌生人近似。在邻近的邻近时,TPA估计靠近种子的种子的节点的RWR分数来自种子的几个早期步骤的种子。在陌生人近似下,TPAESTIMATES用于使用PageRank远离种子的节点的RWR分数。分别在Phaseand的在线阶段进行TheStranger和邻居近似。通过广泛的实验,我们显示TheTPA需要较少3.5倍的时间,内存空间多达40倍,而不是用于预处理阶段的最新方法。在在线阶段,TPACompuS比现有方法近似30倍的近似RWR,均高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号