首页> 外文会议>Supercomputing, 1998. SC98. IEEE/ACM Conference on >High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes
【24h】

High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes

机译:用于科学代码并行化的高性能Fortran编译技术

获取原文

摘要

With current compilers for High Performance Fortran (HPF), substantial restructuring and hand- optimization may be required to obtain acceptable performance from an HPF port of an existing Fortran application. A key goal of the Rice dHPF compiler project is to develop optimization techniques that can provide consistently high performance for a broad spectrum of scientific applications with minimal restructuring of existing Fortran 77 or Fortran 90 applications. This paper presents four new optimization techniques we developed to support efficient parallelization of codes with minimal restructuring. These optimizations include computation partition selection for loop nests that use privatizable arrays, along with partial replication of boundary computations to reduce communication overhead; communication- sensitive loop distribution to eliminate inner-loop communications; interprocedural selection of computation partitions; and data availability analysis to eliminate redundant communications. We studied the effectiveness of the dHPF compiler, which incorporates these optimizations, in parallelizing serial versions of the NAS SP and BT application benchmarks. We present experimental results comparing the performance of hand-written MPI code for the benchmarks against code generated from HPF using the dHPF compiler and the Portland Group's pghpf compiler. Using the compilation techniques described in this paper we achieve performance within 15% of hand-written MPI code on 25 processors for BT and within 33% for SP. Furthermore, these results are obtained with HPF versions of the benchmarks that were created with minimal restructuring of the serial code (modifying only approximately 5% of the code).
机译:对于用于高性能Fortran(HPF)的当前编译器,可能需要进行实质性的重组和手动优化,才能从现有Fortran应用程序的HPF端口获得可接受的性能。 Rice dHPF编译器项目的主要目标是开发优化技术,以最小化现有Fortran 77或Fortran 90应用程序的结构为各种科学应用程序提供一致的高性能。本文介绍了我们开发的四种新的优化技术,它们以最少的重组来支持代码的高效并行化。这些优化包括使用可私有化数组的循环嵌套的计算分区选择,以及边界计算的部分复制以减少通信开销;通讯敏感的回路分配,以消除内回路通讯;过程间的计算分区选择;以及数据可用性分析,以消除冗余通信。我们在并行化NAS SP和BT应用程序基准测试的串行版本时,研究了dHPF编译器的有效性,该编译器结合了这些优化功能。我们提供的实验结果将基准的手写MPI代码与使用dHPF编译器和Portland Group的pghpf编译器从HPF生成的代码的性能进行了比较。使用本文中介绍的编译技术,对于BT,在25个处理器上,性能达到手写MPI代码的15%以内,对于SP,性能达到33%。此外,这些结果是通过基准测试的HPF版本获得的,该版本是在对串行代码进行最少重组的情况下创建的(仅修改了大约5%的代码)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号