...
首页> 外文期刊>Parallel Computing >Tuning collective communication for Partitioned Global Address Space programming models
【24h】

Tuning collective communication for Partitioned Global Address Space programming models

机译:调整分区全局地址空间编程模型的集体通信

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Partitioned Global Address Space (PGAS) languages offer programmers the convenience of a shared memory programming style combined with locality control necessary to run on large-scale distributed memory systems. Even within a PGAS language programmers often need to perform global communication operations such as broadcasts or reductions, which are best performed as collective operations in which a group of threads work together to perform the operation. In this paper we consider the problem of implementing collective communication within PGAS languages and explore some of the design trade-offs in both the interface and implementation. In particular, PGAS collectives have semantic issues that are different than in send-receive style message passing programs, and different implementation approaches that take advantage of the one-sided communication style in these languages. We present an implementation framework for PGAS collectives as part of the GASNet communication layer, which supports shared memory, distributed memory and hybrids. The framework supports a broad set of algorithms for each collective, over which the implementation may be automatically tuned. Finally, we demonstrate the benefit of optimized GASNet collectives using application benchmarks written in UPC, and demonstrate that the GASNet collectives can deliver scalable performance on a variety of state-of-the-art parallel machines including a Cray XT4, an IBM BlueGene/P, and a Sun Constellation system with InfiniBand interconnect.
机译:分区全局地址空间(PGAS)语言为程序员提供了共享内存编程风格的便利,并结合了在大规模分布式存储系统上运行所必需的位置控制。即使在PGAS语言中,程序员也经常需要执行全局通信操作,例如广播或还原,最好将它们作为一组线程一起工作以执行该操作的集体操作来执行。在本文中,我们考虑了在PGAS语言中实现集体交流的问题,并探讨了在界面和实现方面的一些设计折衷。尤其是,PGAS集合体的语义问题不同于收发样式的消息传递程序中的语义问题,并且利用这些语言中的单面通信样式具有不同的实现方法。我们为PGAS集合提供了一个实施框架,作为GASNet通信层的一部分,该框架支持共享内存,分布式内存和混合存储。该框架为每个集合支持广泛的算法集,在这些算法上可以自动调整实现。最后,我们展示了使用UPC编写的应用程序基准测试来优化GASNet集合体的好处,并展示了GASNet集合体可以在包括Cray XT4,IBM BlueGene / P在内的各种最新并行计算机上提供可扩展的性能。 ,以及具有InfiniBand互连功能的Sun Constellation系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号