首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >Hierarchical phasers for scalable synchronization and reductions in dynamic parallelism
【24h】

Hierarchical phasers for scalable synchronization and reductions in dynamic parallelism

机译:分层相位器,用于可伸缩同步并减少动态并行性

获取原文
获取原文并翻译 | 示例

摘要

The phaser construct is a unification of collective and point-to-point synchronization with dynamic parallelism. This construct gives each task the option of synchronizing on a phaser in signal-only/wait-only mode for producer/consumer synchronization or signal-wait mode for barrier synchronization. A phaser accumulator is a reduction construct that works with phasers in a phased setting. Phasers and accumulators support dynamic parallelism i.e., they allow dynamic addition and removal of tasks from the synchronizations and reductions that they support. Past implementations of phasers and phaser accumulators have used a single master task to advance a phaser to the next phase and to perform computations for lazy reductions, while also supporting dynamic parallelism. Though the single master approach provides an effective solution for modest levels of parallelism, it quickly becomes a scalability bottleneck as the number of threads increases. To address this limitation, we propose an approach based on hierarchical phasers for scalable synchronization and hierarchical accumulators for scalable reduction. Our approach also includes tunable initialization parameters that specify the degree and number of tiers for the phaser hierarchy, thereby allowing different values to be chosen for different platforms. Our performance results show significant scalability benefits from our approach. To the best of our knowledge, this is the first approach to support hierarchical synchronization and reductions in the presence of dynamic parallelism.
机译:移相器构造是具有动态并行性的集体和点对点同步的统一。该构造为每个任务提供了在生产者/消费者同步的仅信号/仅等待模式下或在屏障屏障同步的信号等待模式下在相位器上同步的选项。移相器累加器是一种还原构造,可在分相设置中与移相器一起使用。移相器和累加器支持动态并行性,即它们允许从其支持的同步和精简中动态添加和删除任务。移相器和移相器累加器的过去实现方式使用单个主任务将移相器推进到下一个阶段,并为延迟减少执行计算,同时还支持动态并行性。尽管单一主方法为中等水平的并行性提供了有效的解决方案,但是随着线程数量的增加,它很快成为可伸缩性的瓶颈。为了解决此限制,我们提出了一种基于分层相位器的可伸缩同步方法和基于分层累加器的可伸缩缩减方法。我们的方法还包括可调的初始化参数,这些参数指定了相位器层次结构的层级和数量,从而允许为不同的平台选择不同的值。我们的性能结果表明,我们的方法具有显着的可扩展性。据我们所知,这是在存在动态并行性的情况下支持分层同步和简化的第一种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号