首页> 外文会议>Supercomputing '93. Proceedings >Mobile and replicated alignment of arrays in data-parallel programs
【24h】

Mobile and replicated alignment of arrays in data-parallel programs

机译:数据并行程序中阵列的移动和复制对齐

获取原文

摘要

When a data-parallel language like Fortran 90 is compiled for a distributed-memory machine, aggregate data objects (such as arrays) are distributed across the processor memories. The mapping determines the amount of residual communication needed to bring operands of parallel operations into alignment with each other. A common approach is to break the mapping into two stages: first, an alignment that maps all the objects to an abstract template, and then a distribution that maps the template to the processors. The authors solve two facets of the problem of finding alignments that reduce residual communication, i.e., determining both the alignments that vary in loops, and the objects that should have replicated alignments. They show that loop-dependent mobile alignment is sometimes necessary for optimum performance, and they provide algorithms with which a compiler can determine good mobile alignments for objects within do loops. They also identify situations in which replicated alignment is either required by the program itself (via spread operations) or can be used to improve performance. An algorithm based on network flow that determines which objects to replicate so as to minimize the total amount of broadcast communication in replication is proposed.
机译:当为分布式内存计算机编译像Fortran 90这样的数据并行语言时,聚合数据对象(例如数组)将分布在处理器内存中。映射确定使并行操作的操作数彼此对齐所需的剩余通信量。一种常见的方法是将映射分为两个阶段:首先是将所有对象映射到抽象模板的对齐,然后是将模板映射到处理器的分发。作者解决了寻找减少剩余通信的路线的问题的两个方面,即确定循环中变化的路线和应该复制路线的对象。他们表明依赖循环的移动对齐有时对于最佳性能是必需的,并且它们提供了一种算法,编译器可以使用这些算法为do循环内的对象确定良好的移动对齐。它们还确定了程序本身需要复制对齐(通过扩展操作)或可以用来提高性能的情况。提出了一种基于网络流的算法,该算法确定要复制的对象,以最大程度地减少复制中广播通信的总量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号