...
首页> 外文期刊>Knowledge and information systems >Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index
【24h】

Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index

机译:带有滑动窗口划分和网格索引的不确定数据流的并行天际线查询

获取原文
获取原文并翻译 | 示例
           

摘要

Skyline query processing over uncertain data streams has attracted considerable attention in database community recently, due to its importance in helping users make intelligent decisions over complex data in many real applications. Although lots of recent efforts have been conducted to the skyline computation over data streams in a centralized environment typically with one processor, they cannot be well adapted to the skyline queries over complex uncertain streaming data, due to the computational complexity of the query and the limited processing capability. Furthermore, none of the existing studies on parallel skyline computation can effectively address the skyline query problem over uncertain data streams, as they are all developed to address the problem of parallel skyline queries over static certain data sets. In this paper, we formally define the parallel query problem over uncertain data streams with the sliding window streaming model. Particularly, for the first time, we propose an effective framework, named distributed parallel framework to address the problem based on the sliding window partitioning. Furthermore, we propose an efficient approach (parallel streaming skyline) to further optimize the parallel skyline computation with an optimized streaming item mapping strategy and the grid index. Extensive experiments with real deployment over synthetic and real data are conducted to demonstrate the effectiveness and efficiency of the proposed techniques.
机译:由于不确定性数据流的天际线查询处理在许多实际应用中帮助用户对复杂数据做出智能决策的重要性,因此最近在数据库界引起了广泛关注。尽管最近在集中式环境(通常使用一个处理器)中对数据流的天际线计算进行了许多最新努力,但是由于查询的计算复杂性和有限性,它们无法很好地适应复杂的不确定流数据的天际线查询处理能力。此外,关于并行天际线计算的现有研究都无法有效解决不确定数据流上的天际线查询问题,因为它们都是为了解决静态某些数据集上的并行天际线查询问题而开发的。在本文中,我们使用滑动窗口流模型正式定义了不确定数据流上的并行查询问题。特别是,我们首次提出了一种有效的框架,称为分布式并行框架,以解决基于滑动窗口划分的问题。此外,我们提出了一种有效的方法(并行流天际线),以优化的流项目映射策略和网格索引进一步优化并行天际线计算。进行了在合成和真实数据上进行实际部署的广泛实验,以证明所提出技术的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号