...
首页> 外文期刊>Distributed and Parallel Databases >Uncertain top-k query processing in distributed environments
【24h】

Uncertain top-k query processing in distributed environments

机译:分布式环境中不确定的top-k查询处理

获取原文
获取原文并翻译 | 示例
           

摘要

The top-k query on uncertain data set has been a very hot topic these years, and there have been many studies on uncertain top-k queries. Unfortunately, most of the existing algorithms only consider centralized processing environments, and they are not suitable for the large-scale data. In this paper, it is the first attempt to process probabilistic threshold top-k queries (an important uncertain top-k query, PT-k for short) in a distributed environment. We propose 3 efficient algorithms. The serial distributed approach adopts a new method, which only requires a few amount of calculations, to serially process PT-k queries in distributed environments. The global sorting first algorithm for PT-k query processing (GSP) is designed for improving the computation speed. In GSP, a distributed sorting operation is performed, and then we compute the candidates for PT-k queries in parallel. The query results can be computed by using a novel incremental method which can reduce the number of calculations. The local filtering first algorithm for PT-k query processing is designed for reducing the network overhead. Specifically, several filtering strategies are proposed to filter out redundant data locally, and then the incremental method in GSP is used to process the PT-k queries. Finally, the effectiveness of our proposed algorithms is verified through a series of experiments.
机译:近年来,对不确定数据集的前k个查询一直是一个非常热门的话题,并且有很多关于不确定前k个查询的研究。不幸的是,大多数现有算法仅考虑集中式处理环境,因此不适用于大规模数据。在本文中,这是在分布式环境中处理概率阈值top-k查询(重要的不确定性top-k查询,简称PT-k)的首次尝试。我们提出3种高效算法。串行分布式方法采用了一种新方法,该方法只需要进行少量计算即可在分布式环境中串行处理PT-k查询。为提高PT-k查询处理(GSP)的全局排序优先算法而设计。在GSP中,将执行分布式排序操作,然后我们并行计算PT-k查询的候选对象。可以使用新颖的增量方法来计算查询结果,该方法可以减少计算数量。设计用于PT-k查询处理的本地过滤优先算法,以减少网络开销。具体来说,提出了几种过滤策略以局部过滤掉冗余数据,然后使用GSP中的增量方法来处理PT-k查询。最后,通过一系列实验验证了我们提出的算法的有效性。

著录项

  • 来源
    《Distributed and Parallel Databases》 |2016年第4期|567-589|共23页
  • 作者

    Wang Xite; Shen Derong; Yu Ge;

  • 作者单位

    Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China;

    Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China;

    Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Uncertain; Top-k query; Distributed;

    机译:不确定;Top-k查询;分布式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号