首页> 外文会议>International Conference on Parallel Processing >Access descriptor based locality analysis for Distributed-Shared Memory multiprocessors
【24h】

Access descriptor based locality analysis for Distributed-Shared Memory multiprocessors

机译:基于访问描述符的分布式共享内存多处理器的局部分析

获取原文

摘要

Most of today's multiprocessors have a Distributed-Shared Memory (DSM) organization, which enables scalability while retaining the convenience of the shared-memory programming paradigm. Data locality is crucial for performance in DSM machines, due to the difference in access times between local and remote memories. In this paper, we present a compile-time representation that captures the memory locality exhibited by a program in the form of a graph known as Locality-Communication Graph (LCG). In the LCG, each node represents a DO loop nest which can have at most one level of parallelism. Not all loops need to be represented within a node and, therefore, the LCG may contain cycles. Our representation works whether the loops represented by the nodes are perfectly nested or not, and the subscript expressions and loop limits can be affine or non-affine expressions of the loop indices. The LCG provides essential information that a parallelizing compiler can use to automatically choose a good iteration/data distribution and to schedule the communication operations required during program execution.
机译:大多数今天的多处理器都有一个分布式共享的内存(DSM)组织,它可以实现可伸缩性,同时保留共享内存编程范例的便利性。由于本地和远程存储器之间的访问时间差异,数据局部度对于DSM机器中的性能至关重要。在本文中,我们介绍了一种编译时表示,其捕获程序以称为局部通信图(LCG)的图形的形式捕获程序呈现的内存局部。在LCG中,每个节点表示一个可以具有大多数级别的并行度的循环嵌套。并非所有循环都需要在节点内表示,因此,LCG可能包含周期。我们的表示有效地有效是否由节点表示的循环是完全嵌套的,并且下标表达和循环限制可以是循环指标的仿射或非仿射表达式。 LCG提供了并行化编译器可以用于自动选择良好迭代/数据分发的基本信息,并在程序执行期间调度所需的通信操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号