首页> 外文期刊>ACM transactions on database systems >Efficient Processing of Skyline-Join Queries over Multiple Data Sources
【24h】

Efficient Processing of Skyline-Join Queries over Multiple Data Sources

机译:有效处理多个数据源上的天际线联接查询

获取原文
获取原文并翻译 | 示例
       

摘要

Efficient processing of skyline queries has been an area of growing interest. Many of the earlier skyline techniques assumed that the skyline query is applied to a single data table. Naturally, these algorithms were not suitable for many applications in which the skyline query may involve attributes belonging to multiple data sources. In other words, if the data used in the skyline query are stored in multiple tables, then join operations would be required before the skyline can be searched. The task of computing skylines on multiple data sources has been coined as the skyline-join problem and various skyline-join algorithms have been proposed. However, the current proposals suffer several drawbacks: they often need to scan the input tables exhaustively in order to obtain the set of skyline-join results; moreover, the pruning techniques employed to eliminate the tuples are largely based on expensive pairwise tuple-to-tuple comparisons. In this article, we aim to address these shortcomings by proposing two novel skyline-join algorithms, namely skyline-sensitive join (S(2)J) and symmetric skyline-sensitive join (S(3)J), to process skyline queries over two data sources. Our approaches compute the results using a novel layer/region pruning technique (LR-pruning) that prunes the join space in blocks as opposed to individual data points, thereby avoiding excessive pairwise point-to-point dominance checks. Furthermore, the S(3)J algorithm utilizes an early stopping condition in order to successfully compute the skyline results by accessing only a subset of the input tables. In addition to S(2)J and S(3)J, we also propose the S(2)J-M and S(3)J-M algorithms. These algorithms extend S(2)J's and S(3)J's two-way skyline-join ability to efficiently process skyline-join queries over more than two data sources. S(2)J-M and S(3)J-M leverage the extended concept of LR-pruning, called M-way LR-pruning, to compute multi-way skyline-joins in which more than two data sources are integrated during skyline processing. We report extensive experimental results that confirm the advantages of the proposed algorithms over state-of-the-art skyline-join techniques.
机译:高效处理天际线查询已成为人们日益关注的领域。许多早期的天际线技术都假定将天际线查询应用于单个数据表。自然,这些算法不适用于许多应用程序,在这些应用程序中,天际线查询可能涉及属于多个数据源的属性。换句话说,如果在天际线查询中使用的数据存储在多个表中,则在可以搜索天际线之前将需要进行联接操作。在天际线连接问题中提出了在多个数据源上计算天际线的任务,并提出了各种天际线连接算法。但是,当前的建议有几个缺点:他们常常需要详尽地扫描输入表以获得一组“天际线连接”结果;此外,用于消除元组的修剪技术很大程度上基于昂贵的成对元组与元组之间的比较。在本文中,我们旨在通过提出两种新颖的天际线联接算法(即天际线敏感联接(S(2)J)和对称天际线敏感联接(S(3)J))来解决这些缺点,以处理两个数据源。我们的方法使用一种新颖的层/区域修剪技术(LR修剪)来计算结果,该技术修剪块中与单个数据点相对的​​连接空间,从而避免了过多的逐对点对点优势检查。此外,S(3)J算法利用早期停止条件,以便仅访问输入表的一个子集来成功计算天际线结果。除了S(2)J和S(3)J,我们还提出了S(2)J-M和S(3)J-M算法。这些算法扩展了S(2)J和S(3)J的双向天际线联接功能,可以有效地处理两个以上数据源上的天际线联接查询。 S(2)J-M和S(3)J-M利用LR修剪的扩展概念(称为M路径LR修剪)来计算多路天际线连接,其中在天际线处理期间集成了两个以上的数据源。我们报告了广泛的实验结果,这些结果证实了拟议算法相对于最新的天际线连接技术的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号