【24h】

Scalable high-quality 1D partitioning

机译:可扩展的高质量1D分区

获取原文

摘要

The decomposition of one-dimensional workload arrays into consecutive partitions is a core problem of many load balancing methods, especially those based on space-filling curves. While previous work has shown that heuristics can be parallelized, only sequential algorithms exist for the optimal solution. However, centralized partitioning will become infeasible in the exascale era due to the vast amount of tasks to be mapped to millions of processors. In this work, we first introduce optimizations to a published exact algorithm. Further, we investigate a hierarchical approach which combines a parallel heuristic and an exact algorithm to form a scalable and high-quality 1D partitioning algorithm. We compare load balance, execution time, and task migration of the algorithms for up to 262 144 processes using real-life workload data. The results show a 300 times speed-up compared to an existing fast exact algorithm, while achieving nearly the optimal load balance.
机译:一维工作负载阵列的分解成连续分区是许多负载平衡方法的核心问题,尤其是基于空间填充曲线的核心问题。虽然以前的工作表明,虽然启发式可以并行化,但只有最佳解决方案的序列算法。然而,由于映射到数百万处理器的大量任务,集中分区将在ExaScale Era中变得不可行。在这项工作中,我们首先向发布的精确算法介绍优化。此外,我们研究了一种分层方法,该方法结合了并行启发式和精确算法来形成可伸缩和高质量的1D分区算法。我们使用现实生活工作负载数据比较算法的负载平衡,执行时间和任务迁移到达262个144进程。结果显示了与现有的快速精确算法相比加速300倍,同时实现了几乎最佳的负载平衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号