首页> 外文会议>IEEE international conference on data engineering >Efficient and accurate query evaluation on uncertain graphs via recursive stratified sampling
【24h】

Efficient and accurate query evaluation on uncertain graphs via recursive stratified sampling

机译:通过递归分层采样对不确定图的高效和准确的查询评估

获取原文

摘要

In this paper, we introduce two types of query evaluation problems on uncertain graphs: expectation query evaluation and threshold query evaluation. Since these two problems are #P-complete, most previous solutions for these problems are based on naive Monte-Carlo (NMC) sampling. However, NMC typically leads to a large variance, which significantly reduces its effectiveness. To overcome this problem, we propose two classes of estimators, called class-I and class-II estimators, based on the idea of stratified sampling. More specifically, we first propose two classes of basic stratified sampling estimators, named BSS-I and BSS-II, which partition the entire population into 2r and r+1 strata by picking r edges respectively. Second, to reduce the variance, we find that both BSS-I and BSS-II can be recursively performed in each stratum. Therefore, we propose two classes of recursive stratified sampling estimators called RSS-I and RSS-II respectively. Third, for a particular kind of problem, we propose two cut-set based stratified sampling estimators, named BCSS and RCSS, to further improve the accuracy of the class-I and class-II estimators. For all the proposed estimators, we prove that they are unbiased and their variances are significantly smaller than that of NMC. Moreover, the time complexity of all the proposed estimators are the same as the time complexity of NMC under a mild assumption. In addition, we also apply the proposed estimators to influence function evaluation and expected-reliable distance query problem, which are two instances of the query evaluation problems on uncertain graphs. Finally, we conduct extensive experiments to evaluate our estimators, and the results demonstrate the efficiency, accuracy, and scalability of the proposed estimators.
机译:在本文中,我们在不确定图中介绍了两种类型的查询评估问题:期望查询评估和阈值查询评估。由于这两个问题是#P-Treminity,最先前的这些问题的解决方案基于天真的Monte-Carlo(NMC)采样。然而,NMC通常导致大方差,这显着降低了其有效性。为了克服这个问题,我们提出了两类估计,称为I类和II类估计人,基于分层抽样的想法。更具体地说,我们首先提出了两类基本分层采样估计器,命名为BSS-I和BSS-II,将整个人群分别将整个人口分别分别分别将整个人群分别挑选为2 R 和R + 1个层次。其次,为了减少方差,我们发现BSS-I和BSS-II都可以在每个层中递归地进行。因此,我们提出了两类称为RSS-I和RSS-II的递归分层采样估计。第三,对于特定的问题,我们提出了两个基于切割的分层采样估算器,命名为BCS和RCSS,以进一步提高II类和II类估计的准确性。对于所有提议的估算者,我们证明它们是无偏见的,它们的差异显着小于NMC。此外,所有提议估计器的时间复杂度与温和假设下NMC的时间复杂度相同。此外,我们还应用所提出的估计,以影响函数评估和预期可靠的远程查询问题,这是不确定图中查询评估问题的两个实例。最后,我们进行了广泛的实验来评估我们的估计,结果表明了所提出的估算者的效率,准确性和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号