【24h】

Sorting in parallel database systems

机译:在并行数据库系统中排序

获取原文

摘要

Sorting in database processing is frequently required through the use of Order By and Distinct clauses in SQL. Sorting is also widely known in the computer science community at large. Sorting in general covers internal and external sorting. Past published work has extensively focused on external sorting on uni-processors (serial external sorting), and internal sorting on multiprocessors (parallel internal sorting). External sorting on multiprocessors (parallel external sorting) has received surprisingly little attention; furthermore, the way current parallel database systems do sorting is far from optimal in many scenarios. The authors present a taxonomy for parallel sorting in parallel database systems, which covers five sorting methods: namely parallel merge-all sort, parallel binary-merge sort, parallel redistribution binary-merge sort, parallel redistribution merge-all sort, and parallel partitioned sort. The first two methods are previously proposed approaches to parallel external sorting which have been adopted as status quo of parallel database sorting, whereas the latter three methods which are based on redistribution and repartitioning are new, in that the have not been discussed in the literature of parallel external sorting.
机译:通过在SQL中使用Order By和Distinct子句,经常需要在数据库处理中进行排序。在计算机科学界中,排序也是众所周知的。排序一般包括内部和外部排序。过去发表的工作广泛地集中于单处理器的外部排序(串行外部排序)和多处理器的内部排序(并行内部排序)。令人惊讶的是,多处理器上的外部分类(并行外部分类)很少受到关注。此外,当前的并行数据库系统进行排序的方式在许多情况下都不是最优的。作者介绍了并行数据库系统中并行排序的分类法,其中涵盖了五种排序方法:即并行所有合并排序,并行二进制合并排序,并行重新分发二进制合并排序,并行重新分发全部合并排序和并行分区排序。前两种方法是先前提出的用于并行外部排序的方法,已被用作并行数据库排序的现状,而后三种基于重新分配和重新分区的方法是新的,因为在文献中没有对此进行讨论。并行外部排序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号