首页> 外文会议>International Conference on High Performance Computing Simulation >234 scheduling of 3-2 and 2-1 eliminations for parallel image compositing using non-power-of-two number of processes
【24h】

234 scheduling of 3-2 and 2-1 eliminations for parallel image compositing using non-power-of-two number of processes

机译:234 3-2和2-1使用两种流程的并行图像合成的淘汰的淘汰

获取原文

摘要

Binary-Swap is a parallel image compositing algorithm based on recursive vector halving and distance doubling, and works efficiently when the number of processes is exactly a power-of-two (2). Several power-of-two converting approaches for Binary-Swap have been proposed. Among them, the Telescope method, based on the Binary Blocks algorithm, has been shown as the most promising approach. The Telescope method decomposes an entire set of processes into blocks of power-of-two size and merges the smaller blocks into larger blocks in stepwise fashion. This block merging process corresponds to the communication and computational overhead of the conversion, and since it can only merge one block per stage, it becomes inefficient as the number of binary blocks increases. In this paper, we focus on a single-stage conversion method using the 3-2 and 2-1 elimination approaches. The original scheduling method, proposed by Rabenseifner et al., is limited to an odd number of processes since it always schedules a single 3-2 elimination per conversion. Taking into consideration that the 3-2 elimination can be optimized on modern HPC systems, which can overlap the communication and computation, we propose 234 Scheduling for scheduling multiple 3-2 eliminations per conversion. The multiple 3-2 elimination scheduling enlarges the application range by enabling its use on an even number of processes. We evaluated 234 Scheduling applied to Binary-Swap on the K computer, which is a modern parallel HPC system, and confirmed its effectiveness.
机译:二进制交换是一种基于递归矢量半衰分和距离加倍的并行图像合成算法,并且当过程的数量正常为两(2)时,有效地工作。已经提出了几种用于二进制交换的两种电源转换方法。其中,基于二进制块算法的望远镜方法已被显示为最有希望的方法。望远镜方法将整套过程分解成两块电源尺寸,并将较小的块以逐步的方式合并到更大的块中。该块合并过程对应于转换的通信和计算开销,并且由于它只能合并每个阶段的一个块,因此随着二进制块的数量增加而变得效率低。在本文中,我们专注于使用3-2和2-1消除方法的单级转换方法。 Rabenseifner等人提出的原始调度方法仅限于奇数过程,因为它总是调度每转换单个3-2消除。考虑到3-2消除可以在现代HPC系统上优化,这可以与通信和计算重叠,我们提出了234个调度,以调度每次转换的多个3-2消除。多个3-2消除调度通过在偶数流程上启用其使用来扩大应用范围。我们评估了334调度应用于K计算机上的二进制交换,这是一种现代并行HPC系统,并确认其有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号