【24h】

On random walk based graph sampling

机译:基于随机漫步的图形抽样

获取原文

摘要

Random walk based graph sampling has been recognized as a fundamental technique to collect uniform node samples from a large graph. In this paper, we first present a comprehensive analysis of the drawbacks of three widely-used random walk based graph sampling algorithms, called re-weighted random walk (RW) algorithm, Metropolis-Hastings random walk (MH) algorithm and maximum-degree random walk (MD) algorithm. Then, to address the limitations of these algorithms, we propose two general random walk based algorithms, named rejection-controlled Metropolis-Hastings (RCMH) algorithm and generalized maximum-degree random walk (GMD) algorithm. We show that RCMH balances the tradeoff between the limitations of RW and MH, and GMD balances the tradeoff between the drawbacks of RW and MD. To further improve the performance of our algorithms, we integrate the so-called delayed acceptance technique and the non-backtracking random walk technique into RCMH and GMD respectively. We conduct extensive experiments over four real-world datasets, and the results demonstrate the effectiveness of the proposed algorithms.
机译:基于随机漫游的图形采样被识别为从大图中收集统一节点样本的基本技术。在本文中,我们首先对三种广泛使用的随机播放的曲线图采样算法进行了全面的分析,称为重加权随机步行(RW)算法,Metropolis-Hastings随机步行(MH)算法和最大程度步行(MD)算法。然后,为了解决这些算法的局限性,我们提出了两个基于一般的随机播放算法,命名为拒绝控制的Metropolis-Hastings(RCMH)算法和广义最大程度随机步行(GMD)算法。我们展示了RCMH平衡了RW和MH的局限性之间的权衡,GMD平衡了RW和MD缺点之间的权衡。为了进一步提高我们的算法的性能,我们将所谓的延迟验收技术和非返回随机步行技术分别集成到RCMH和GMD中。我们对四个现实世界数据集进行了广泛的实验,结果证明了所提出的算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号