首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >An implementation framework for HPF distributed arrays on message-passing parallel computer systems
【24h】

An implementation framework for HPF distributed arrays on message-passing parallel computer systems

机译:消息传递并行计算机系统上的HPF分布式阵列的实现框架

获取原文
获取原文并翻译 | 示例

摘要

Data parallel languages, like High Performance Fortran (HPF), support the notion of distributed arrays. However, the implementation of such distributed array structures and their access on message passing computers is not straightforward. This holds especially for distributed arrays that are aligned to each other and given a block-cyclic distribution. In this paper, an implementation framework is presented for HPF distributed arrays on message passing computers. Methods are presented for efficient (in space and time) local index enumeration, local storage, and communication. Techniques for local set enumeration provide the basis for constructing local iteration sets and communication sets. It is shown that both local set enumeration and local storage schemes can be derived from the same equation. Local set enumeration and local storage schemes are shown to be orthogonal, i.e., they can be freely combined. Moreover, for linear access sequences generated by our enumeration methods, the local address calculations can be moved out of the enumeration loop, yielding efficient local memory address generation. The local set enumeration methods are implemented by using a relatively simple general transformation rule for absorbing ownership tests. This transformation rule can be repeatedly applied to absorb multiple ownership tests. Performance figures are presented for local iteration overhead, a simple communication pattern, and storage efficiency.
机译:诸如高性能Fortran(HPF)之类的数据并行语言支持分布式阵列的概念。但是,这种分布式数组结构的实现及其在消息传递计算机上的访问并非易事。这一点尤其适用于彼此对齐并具有块循环分布的分布式阵列。在本文中,为消息传递计算机上的HPF分布式阵列提供了一个实现框架。提出了有效(时空上)本地索引枚举,本地存储和通信的方法。局部集枚举技术为构建局部迭代集和通信集提供了基础。结果表明,本地集枚举和本地存储方案都可以从同一方程式导出。本地集枚举和本地存储方案显示为正交,即可以自由组合。此外,对于通过我们的枚举方法生成的线性访问序列,可以将本地地址计算移出枚举循环,从而产生高效的本地内存地址生成。本地集枚举方法是通过使用相对简单的通用转换规则来吸收所有权测试来实现的。可以重复应用此转换规则以吸收多个所有权测试。给出了性能数字,以反映本地迭代开销,简单的通信模式和存储效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号