...
首页> 外文期刊>Computational Social Networks >Revisiting random walk based sampling in networks: evasion of burn-in period and frequent regenerations
【24h】

Revisiting random walk based sampling in networks: evasion of burn-in period and frequent regenerations

机译:回顾网络中基于随机游走的采样:规避老化时间和频繁的再生

获取原文

摘要

Abstract Background In the framework of network sampling, random walk (RW) based estimation techniques provide many pragmatic solutions while uncovering the unknown network as little as possible. Despite several theoretical advances in this area, RW based sampling techniques usually make a strong assumption that the samples are in stationary regime, and hence are impelled to leave out the samples collected during the burn-in period. Methods This work proposes two sampling schemes without burn-in time constraint to estimate the average of an arbitrary function defined on the network nodes, for example, the average age of users in a social network. The central idea of the algorithms lies in exploiting regeneration of RWs at revisits to an aggregated super-node or to a set of nodes, and in strategies to enhance the frequency of such regenerations either by contracting the graph or by making the hitting set larger. Our first algorithm, which is based on reinforcement learning (RL), uses stochastic approximation to derive an estimator. This method can be seen as intermediate between purely stochastic Markov chain Monte Carlo iterations and deterministic relative value iterations. The second algorithm, which we call the Ratio with Tours (RT)-estimator, is a modified form of respondent-driven sampling (RDS) that accommodates the idea of regeneration. Results We study the methods via simulations on real networks. We observe that the trajectories of RL-estimator are much more stable than those of standard random walk based estimation procedures, and its error performance is comparable to that of respondent-driven sampling (RDS) which has a smaller asymptotic variance than many other estimators. Simulation studies also show that the mean squared error of RT-estimator decays much faster than that of RDS with time. Conclusion The newly developed RW based estimators (RL- and RT-estimators) allow to avoid burn-in period, provide better control of stability along the sample path, and overall reduce the estimation time. Our estimators can be applied in social and complex networks.
机译:摘要背景在网络采样的框架中,基于随机游走(RW)的估计技术可提供许多实用的解决方案,同时尽可能少地发现未知网络。尽管在该领域有一些理论上的进步,但是基于RW的采样技术通常会强有力地假定样品处于静止状态,因此被迫省去了在老化期间收集的样品。方法这项工作提出了两个没有老化时间限制的采样方案,以估计网络节点上定义的任意功能的平均值,例如,社交网络中用户的平均年龄。该算法的中心思想在于,在重访时将RW的再生利用到聚合的超级节点或一组节点上,以及通过收缩图或使击中集合更大来提高此类再生频率的策略。我们的第一个基于强化学习(RL)的算法,使用随机逼近来得出估计量。该方法可以看作是纯随机马尔可夫链蒙特卡洛迭代和确定性相对值迭代之间的中间方法。第二种算法,我们称为“比率与游览”(RT)估计器,是响应者驱动采样(RDS)的一种改进形式,它适应了再生的思想。结果我们通过在真实网络上的仿真研究了这些方法。我们观察到,RL估计器的轨迹比基于标准随机游走的估计程序的轨迹要稳定得多,并且其误差性能与响应驱动抽样(RDS)相比,后者的渐近方差比许多其他估计器要小。仿真研究还表明,RT估计器的均方误差随时间的衰减比RDS快得多。结论新开发的基于RW的估算器(RL和RT估算器)可以避免老化期,可以更好地控制沿采样路径的稳定性,并总体上减少了估算时间。我们的估算器可以应用于社交网络和复杂网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号