首页> 外文会议>High Performance Distributed Computing, 1999. Proceedings. The Eighth International Symposium on >Portable petaFLOP/s programming: applying distributed computing methodology to the grid within a single machine room
【24h】

Portable petaFLOP/s programming: applying distributed computing methodology to the grid within a single machine room

机译:便携式petaFLOP / s编程:将分布式计算方法应用于单个机房内的网格

获取原文

摘要

According to today's best projections, petaFLOP/s computing platforms will combine deep memory hierarchies in both latency and bandwidth with a need for many-thousand-fold parallelism. Unless effective parallel programs are prepared in advance, much of the promise of the first year or two of operation for these systems may be lost. We introduce a candidate for a portable petaFLOP/s programming model that can enable these important early application programs to be developed while, at the same time, permitting these same applications to run efficiently on the most capable computing systems now available. An MPI-based model is portable, but its programming paradigm ignores the potential benefits of hardware support for shared memory within each network node. A threads-based model cannot directly cope with the distributed nature of the memory over the network. Therefore, a new, portable programming model is needed. The shared memory programming model dramatically simplifies the expression of dynamic load balancing strategies for irregular algorithms. The main strategy is a transparent self-scheduled task list performed in parallel so long as specified data-dependent conditions are met. The model used is a cluster of multiprocessor distributed shared memory machines with network-attached disks. Our experimental run-time system allows the programmer to view this computing platform as a single machine with a four-stage memory hierarchy, consisting of coherent processor cache, non-coherent local shared memory, global shared memory, plus a global disk file system.
机译:根据今天的最佳预测,petaFLOP / s计算平台将结合延迟和带宽方面的深层内存层次结构,并需要成千上万的并行性。除非事先准备好有效的并行程序,否则这些系统在运行的第一年或第二年的许诺可能会丢失。我们介绍了便携式petaFLOP / s编程模型的候选人,该模型可以使这些重要的早期应用程序得以开发,同时允许这些相同的应用程序在目前可用的最强大的计算系统上高效运行。基于MPI的模型是可移植的,但是其编程范例忽略了对每个网络节点内的共享内存进行硬件支持的潜在好处。基于线程的模型无法直接应对网络上内存的分布式性质。因此,需要一种新的便携式编程模型。共享内存编程模型极大地简化了针对不规则算法的动态负载平衡策略的表达。只要满足指定的数据相关条件,主要策略是并行执行的透明的自调度任务列表。使用的模型是带有网络连接磁盘的多处理器分布式共享内存计算机的集群。我们的实验运行时系统使程序员可以将此计算平台视为具有四级内存层次结构的单台计算机,该内存层次结构包括相干处理器缓存,不相干本地共享内存,全局共享内存以及全局磁盘文件系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号