首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Optimization of parallel execution for multi-join queries
【24h】

Optimization of parallel execution for multi-join queries

机译:优化多联接查询的并行执行

获取原文
获取原文并翻译 | 示例

摘要

We study the subject of exploiting interoperator parallelism to optimize the execution of multi-join queries. Specifically, we focus on two major issues: (1) scheduling the execution sequence of multiple joins within a query, and (2) determining the number of processors to be allocated for the execution of each join operation obtained in (1). For the first issue, we propose and evaluate by simulation several methods to determine the general join sequences, or bushy trees. Despite their simplicity, the heuristics proposed can lead to the general join sequences that significantly outperform the optimal sequential join sequence. The quality of the join sequences obtained by the proposed heuristics is shown to be fairly close to that of the optimal one. For the second issue, it is shown that the processor allocation for exploiting interoperator parallelism is subject to more constraints-such as execution dependency and system fragmentation-than those in the study of intraoperator parallelism for a single join. The concept of synchronous execution time is proposed to alleviate these constraints. Several heuristics to deal with the processor allocation, categorized by bottom-up and top-down approaches, are derived and are evaluated by simulation. The relationship between issues (1) and (2) is explored. Among all the schemes evaluated, the two-step approach proposed, which first applies the join sequence heuristic to build a bushy tree as if under a single processor system, and then, in light of the concept of synchronous execution time, allocates processors to execute each join in the bushy tree in a top-down manner, emerges as the best solution to minimize the query execution time.
机译:我们研究了利用互操作符并行性来优化多联接查询执行的主题。具体来说,我们关注两个主要问题:(1)安排查询中多个联接的执行顺序;(2)确定要为执行(1)中获得的每个联接操作而分配的处理器数量。对于第一个问题,我们提出并通过仿真评估了几种方法来确定一般的连接顺序或灌木丛。尽管它们很简单,但提出的启发式方法仍可导致一般的连接序列大大优于最佳的顺序连接序列。通过提出的启发式方法获得的连接序列的质量显示出与最佳序列的质量相当接近。对于第二个问题,这表明与用于单个联接的内部操作员并行性研究相比,用于开发内部操作员并行性的处理器分配受到更多的约束,例如执行依赖性和系统碎片。提出了同步执行时间的概念以减轻这些约束。通过自下而上和自上而下的方法对处理处理器分配的几种启发式方法进行了推导,并通过仿真对其进行了评估。探索了问题(1)和(2)之间的关系。在所有评估的方案中,提出了一种两步方法,该方法首先应用连接序列启发式方法来构建灌木丛树,就好像在单个处理器系统下一样,然后根据同步执行时间的概念,分配要执行的处理器灌木丛树中的每个联接都以自顶向下的方式出现,这是使查询执行时间最短的最佳解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号