首页> 外文会议>International conference on computational science >A Parallel Quicksort Algorithm on Manycore Processors in Sunway TaihuLight
【24h】

A Parallel Quicksort Algorithm on Manycore Processors in Sunway TaihuLight

机译:Sunway TaihuLight中Manycore处理器上的并行Quicksort算法

获取原文
获取外文期刊封面目录资料

摘要

In this paper we present a highly efficient parallel quicksort algorithm on SW26010, a heterogeneous manycore processor that makes Sunway TaihuLight the Top-One supercomputer in the world. Motivated by the software-cache and on-chip communication design of SW26010, we propose a two-phase quicksort algorithm, with the first counting elements and the second moving elements. To make the best of such many-core architecture, we design a decentralized workflow, further optimize the memory access and balance the workload. Experiments show that our algorithm scales efficiently to 64 cores of SW26010, achieving more than 32X speedup for int32 elements on all kinds of data distributions. The result outperforms the strong scaling one of Intel TBB (Threading Building Blocks) version of quicksort on x86-64 architecture.
机译:在本文中,我们提出了一种在SW26010上的高效并行快速排序算法,SW26010是一种异构多核处理器,使Sunway TaihuLight成为全球排名第一的超级计算机。基于SW26010的软件缓存和片上通信设计,我们提出了一种具有第一计数元素和第二移动元素的两阶段快速排序算法。为了充分利用这种多核架构,我们设计了分散的工作流程,进一步优化了内存访问并平衡了工作负载。实验表明,我们的算法可以有效地扩展到SW26010的64个内核,从而在各种数据分布上对int32元素实现了超过32倍的加速。结果优于在x86-64架构上快速排序的Intel TBB(线程构建模块)版本的强大可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号