首页> 外文会议>IEEE International Symposium on Parallel Distributed Processing;IPDPS 2009 >Concurrent SSA for general barrier-synchronized parallel programs
【24h】

Concurrent SSA for general barrier-synchronized parallel programs

机译:并发SSA,用于一般的屏障同步并行程序

获取原文

摘要

Static single assignment (SSA) form has been widely studied and used for sequential programs. This form enables many compiler optimizations to be done efficiently. Work on concurrent static single assignment form (CSSA) for concurrent programs is focused on languages that have limited, implicit barriers (e.g., cobegin/coend and parallel do). Recent programming languages for high-performance computing have general features for barrier/phase synchronization - this is essentially a dual of mutual exclusion and arises mainly in constructing synchronous systems from asynchronous systems. X10 is one such language that has features for general purpose barriers. In X10, barriers are provided through features such as clocks and finish. Since barriers provide explicit synchronization, they offer an opportunity for reducing pi interferences needed for CSSA. This paper provides a means for computing improved CSSA form of a program taking advantage of the general barriers present in it. Our algorithm is based on constructing a control-flow graph of the program and flow equations. The efficiency of analysis and optimizations for parallel programs depends on the number and complexity of pi assignments in their CSSA representations. We demonstrate that our approach of computing CSSA form for languages supporting general barrier synchronization can improve the precision of intermediate representation for computing global value numbering and loop invariant detection.
机译:静态单一分配(SSA)形式已被广泛研究并用于顺序程序。这种形式可以使许多编译器优化有效地完成。用于并发程序的并发静态单分配形式(CSSA)的工作重点是具有有限的隐式障碍(例如,cobegin / coend和parallel do)的语言。用于高性能计算的最新编程语言具有势垒/相位同步的一般功能-这本质上是互斥的对偶,并且主要出现在从异步系统构建同步系统中。 X10是其中一种具有通用障碍功能的语言。在X10中,通过时钟和表面处理等功能提供了障碍。由于屏障提供了显式同步,因此它们为减少CSSA所需的pi干扰提供了机会。本文提供了一种利用程序中存在的一般障碍来计算改进的CSSA程序形式的方法。我们的算法基于构建程序和流方程的控制流图。并行程序的分析和优化效率取决于其CSSA表示形式中pi分配的数量和复杂性。我们证明了我们针对支持通用屏障同步的语言计算CSSA表单的方法可以提高中间表示的精度,以用于计算全局值编号和循环不变检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号