Optimization of parallel execution for multi-join queries

Ming-Syan Chen; Yu P.S.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Optimization of parallel execution for multi-join queries

【24h】

Optimization of parallel execution for multi-join queries

机译：优化多联接查询的并行执行

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We study the subject of exploiting interoperator parallelism to optimize the execution of multi-join queries. Specifically, we focus on two major issues: (1) scheduling the execution sequence of multiple joins within a query, and (2) determining the number of processors to be allocated for the execution of each join operation obtained in (1). For the first issue, we propose and evaluate by simulation several methods to determine the general join sequences, or bushy trees. Despite their simplicity, the heuristics proposed can lead to the general join sequences that significantly outperform the optimal sequential join sequence. The quality of the join sequences obtained by the proposed heuristics is shown to be fairly close to that of the optimal one. For the second issue, it is shown that the processor allocation for exploiting interoperator parallelism is subject to more constraints-such as execution dependency and system fragmentation-than those in the study of intraoperator parallelism for a single join. The concept of synchronous execution time is proposed to alleviate these constraints. Several heuristics to deal with the processor allocation, categorized by bottom-up and top-down approaches, are derived and are evaluated by simulation. The relationship between issues (1) and (2) is explored. Among all the schemes evaluated, the two-step approach proposed, which first applies the join sequence heuristic to build a bushy tree as if under a single processor system, and then, in light of the concept of synchronous execution time, allocates processors to execute each join in the bushy tree in a top-down manner, emerges as the best solution to minimize the query execution time.

机译：我们研究了利用互操作符并行性来优化多联接查询执行的主题。具体来说，我们关注两个主要问题：（1）安排查询中多个联接的执行顺序；（2）确定要为执行（1）中获得的每个联接操作而分配的处理器数量。对于第一个问题，我们提出并通过仿真评估了几种方法来确定一般的连接顺序或灌木丛。尽管它们很简单，但提出的启发式方法仍可导致一般的连接序列大大优于最佳的顺序连接序列。通过提出的启发式方法获得的连接序列的质量显示出与最佳序列的质量相当接近。对于第二个问题，这表明与用于单个联接的内部操作员并行性研究相比，用于开发内部操作员并行性的处理器分配受到更多的约束，例如执行依赖性和系统碎片。提出了同步执行时间的概念以减轻这些约束。通过自下而上和自上而下的方法对处理处理器分配的几种启发式方法进行了推导，并通过仿真对其进行了评估。探索了问题（1）和（2）之间的关系。在所有评估的方案中，提出了一种两步方法，该方法首先应用连接序列启发式方法来构建灌木丛树，就好像在单个处理器系统下一样，然后根据同步执行时间的概念，分配要执行的处理器灌木丛树中的每个联接都以自顶向下的方式出现，这是使查询执行时间最短的最佳解决方案。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |1996年第3期|P.416-428|共13页
作者
Ming-Syan Chen; Yu P.S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. DR-QPO: DISCRETE RANK BASED QUERY PATTERN OPTIMIZATION TOWARDS PARALLEL QUERY PLANNING AND EXECUTION FOR DISTRIBUTE TRIPLE STORES [J] . K.SHAILAJA, Dr. P.V. KUMAR, Dr. S.DURGA BHAVANI Journal of Theoretical and Applied Information Technology . 2017,第22期

机译：DR-QPO：基于离散的级别的查询模式优化对分发三重商店的并行查询规划和执行
2. Multi-join query optimization in bucket-based encrypted databases using an enhanced ant colony optimization algorithm [J] . Jafarinejad Mahmoud, Amini Morteza Distributed and Parallel Databases . 2018,第2期

机译：使用增强型蚁群优化算法的基于桶的加密数据库中的多联接查询优化
3. Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses [J] . Swathi Kurunji, Tingjian Ge, Xinwen Fu, International journal of grid and high performance computing . 2013,第4期

机译：优化云数据仓库中多联接查询处理的通信
4. Parallel multi-join query optimization algorithm for distributed sensor network in the internet of things [C] . Yan Zheng Conference on smart sensor phenomena, technology, networks, and systems integration . 2015

机译：物联网中分布式传感器网络的并行多联接查询优化算法
5. Accelerating Analytical Query Processing with Data Placement Conscious Optimization and RDMA-Aware Query Execution [D] . Liu, Feilong. 2018

机译：通过数据放置意识优化和支持RDMA的查询执行来加速分析查询处理
6. Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies [O] . Ahmed Abdulhakim Al-Absi, Najeeb Abbas Al-Sammarraie, Wael Mohamed Shaher Yafooz, -1

机译：并行MapReduce：使用并行执行策略来最大程度地利用云资源并提高性能
7. Optimization Of Parallel Execution For Multi-Join Queries [O] . Ming-syan Chen, Philip S. Yu, Kun-lung Wu 1995

机译：多连接查询并行执行的优化

Optimization of parallel execution for multi-join queries

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅