首页> 外文会议>IEEE International Congress on Big Data >Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling
【24h】

Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling

机译:预测的最大程度采样:通过爬网来采样定向网络以最大化节点覆盖范围

获取原文

摘要

Sampling through crawling is an important research topic in social network analysis. However there is very little existing work on sampling through crawling in directed networks. In this paper we present a new method of sampling a directed network, with the objective of maximizing the node coverage. Our proposed method, Predicted Max Degree (PMD) Sampling, works by predicting which k open nodes are most likely to have the highest number of unobserved neighbors in a particular iteration. These nodes are queried, and the whole process repeats until all the available budget has been used up. We compared PMD against three baseline algorithms with three networks, and saw large improvements vs. baseline sampling algorithms: With a budget of 2000, PMD found 15%, 87.4% and 170.2% more nodes than the closest baseline algorithm in the wiki-Votes, soc-Slashdot and webGoogle networks respectively.
机译:通过爬行抽样是社会网络分析中的一个重要研究主题。 然而,通过在定向网络中爬行,对采样的现有工作很少。 在本文中,我们介绍了一种采样定向网络的新方法,其目的是最大化节点覆盖范围。 我们所提出的方法,预测最大程度(PMD)采样,通过预测哪个K开放节点最有可能在特定迭代中具有最高数量的未观察到的邻居。 查询这些节点,并且整个过程重复,直到所有可用预算已用完。 我们将PMD与三个网络的三个基线算法进行了比较,并且看到了大量改进与基线采样算法:预算2000年,PMD发现15%,87.4%和170.2%的节点,而不是维基投票中最近的基线算法, SOC-SLASHDOT和WebGoogle网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号