Efficient Processing of Skyline-Join Queries over Multiple Data Sources

Nagendra Mithila; Candan K. Selcuk

首页> 外文期刊>ACM transactions on database systems >Efficient Processing of Skyline-Join Queries over Multiple Data Sources

【24h】

Efficient Processing of Skyline-Join Queries over Multiple Data Sources

机译：有效处理多个数据源上的天际线联接查询

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Efficient processing of skyline queries has been an area of growing interest. Many of the earlier skyline techniques assumed that the skyline query is applied to a single data table. Naturally, these algorithms were not suitable for many applications in which the skyline query may involve attributes belonging to multiple data sources. In other words, if the data used in the skyline query are stored in multiple tables, then join operations would be required before the skyline can be searched. The task of computing skylines on multiple data sources has been coined as the skyline-join problem and various skyline-join algorithms have been proposed. However, the current proposals suffer several drawbacks: they often need to scan the input tables exhaustively in order to obtain the set of skyline-join results; moreover, the pruning techniques employed to eliminate the tuples are largely based on expensive pairwise tuple-to-tuple comparisons. In this article, we aim to address these shortcomings by proposing two novel skyline-join algorithms, namely skyline-sensitive join (S(2)J) and symmetric skyline-sensitive join (S(3)J), to process skyline queries over two data sources. Our approaches compute the results using a novel layer/region pruning technique (LR-pruning) that prunes the join space in blocks as opposed to individual data points, thereby avoiding excessive pairwise point-to-point dominance checks. Furthermore, the S(3)J algorithm utilizes an early stopping condition in order to successfully compute the skyline results by accessing only a subset of the input tables. In addition to S(2)J and S(3)J, we also propose the S(2)J-M and S(3)J-M algorithms. These algorithms extend S(2)J's and S(3)J's two-way skyline-join ability to efficiently process skyline-join queries over more than two data sources. S(2)J-M and S(3)J-M leverage the extended concept of LR-pruning, called M-way LR-pruning, to compute multi-way skyline-joins in which more than two data sources are integrated during skyline processing. We report extensive experimental results that confirm the advantages of the proposed algorithms over state-of-the-art skyline-join techniques.

机译：高效处理天际线查询已成为人们日益关注的领域。许多早期的天际线技术都假定将天际线查询应用于单个数据表。自然，这些算法不适用于许多应用程序，在这些应用程序中，天际线查询可能涉及属于多个数据源的属性。换句话说，如果在天际线查询中使用的数据存储在多个表中，则在可以搜索天际线之前将需要进行联接操作。在天际线连接问题中提出了在多个数据源上计算天际线的任务，并提出了各种天际线连接算法。但是，当前的建议有几个缺点：他们常常需要详尽地扫描输入表以获得一组“天际线连接”结果；此外，用于消除元组的修剪技术很大程度上基于昂贵的成对元组与元组之间的比较。在本文中，我们旨在通过提出两种新颖的天际线联接算法（即天际线敏感联接（S（2）J）和对称天际线敏感联接（S（3）J））来解决这些缺点，以处理两个数据源。我们的方法使用一种新颖的层/区域修剪技术（LR修剪）来计算结果，该技术修剪块中与单个数据点相对的连接空间，从而避免了过多的逐对点对点优势检查。此外，S（3）J算法利用早期停止条件，以便仅访问输入表的一个子集来成功计算天际线结果。除了S（2）J和S（3）J，我们还提出了S（2）J-M和S（3）J-M算法。这些算法扩展了S（2）J和S（3）J的双向天际线联接功能，可以有效地处理两个以上数据源上的天际线联接查询。 S（2）J-M和S（3）J-M利用LR修剪的扩展概念（称为M路径LR修剪）来计算多路天际线连接，其中在天际线处理期间集成了两个以上的数据源。我们报告了广泛的实验结果，这些结果证实了拟议算法相对于最新的天际线连接技术的优势。

著录项

来源
《ACM transactions on database systems》 |2015年第2期|10.1-10.46|共46页
作者
Nagendra Mithila; Candan K. Selcuk;
展开▼
作者单位

Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA;

Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Algorithms; Design; Performance; Skyline-join processing;

机译：算法;设计;性能;天际线联合处理;

相似文献

外文文献
中文文献
专利

1. Skyline-join query processing in distributed databases [J] . Mei BAI, Junchang XIN, Guoren WANG, Frontiers of computer science in China . 2016,第2期

机译：分布式数据库中的天际线联接查询处理
2. Efficient and Scalable Query Authentication for Cloud-Based Storage Systems with Multiple Data Sources [J] . Santosh Chandrasekhar, Mukesh Singhal Services Computing, IEEE Transactions on . 2017,第4期

机译：具有多个数据源的基于云的存储系统的高效且可扩展的查询身份验证
3. Efficient processing of multiple continuous skyline queries over a data stream [J] . Yu Won Lee, Ki Yong Lee, Myoung Ho Kim Information Sciences: An International Journal . 2013,第Null期

机译：高效处理数据流中的多个连续天际线查询
4. On efficiently processing nearest neighbor queries in a loosely coupled set of data sources [C] . Thomas Schwarz, Markus Iofcea, Matthias Grossmann, ACM international workshop on Geographic information systems;Annual ACM international workshop on Geographic information systems . 2004

机译：在松散耦合的数据源集中有效处理最近的邻居查询
5. Efficient Processing of Skyline Queries on Static Data Sources, Data Streams and Incomplete Datasets. [D] . Nagendra, Mithila. 2014

机译：有效处理静态数据源，数据流和不完整数据集上的天际线查询。
6. Private and Efficient Query Processing on Outsourced Genomic Databases [O] . Reza Ghasemi, Momin Al Aziz, Noman Mohammed, -1

机译：外包基因组数据库的私有高效查询处理
7. Efficient Processing of Continuous Join Queries between a Data Stream and Multiple Relations for Real-Time Analysis of E-Commerce Data [O] . Haeri Kim, Ki Yong Lee 2013

机译：高效处理数据流与多个关系之间的连续连接查询，以进行电子商务数据的实时分析

Efficient Processing of Skyline-Join Queries over Multiple Data Sources

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅