首页> 外文学位 >A dual address space architecture: Implementation and evaluation.
【24h】

A dual address space architecture: Implementation and evaluation.

机译:双重地址空间体系结构:实现和评估。

获取原文
获取原文并翻译 | 示例

摘要

While a hardware supported shared address space offers programmability advantages and better performance for fine-grained applications, the very mechanisms that create those advantages appear to prevent the machines that use them from scaling to large numbers of processors. For this reason, hardware-based distributed shared memory (DSM) platforms have largely been dismissed in the context of scientific computing. This dissertation proposes changes to a hardware-based DSM architecture that allow users to use two address spaces to gain the scalability of distributed architectures while retaining the benefits of the shared address space architecture.;The thesis of this dissertation is that the Dual Address Space Architecture provides both an efficient mechanism for enabling increased user and compiler control over data consistency and data locality, and a high-level interface for programmers to conceptualize and take advantage of the mechanism. The Dual Address Space Architecture thus provides users and compilers an efficient means, particularly appropriate to the structure of scientific applications, of keeping data consistent and local, enabling improved performance over current global address space implementations.;The dissertation evaluates its thesis in two parts. The first part demonstrates the feasibility of the ideas by describing the details of the high-level architecture, including the programming model, and its implementation via extensions to a standard directory-based cache coherence protocol. The second part evaluates the performance of Dual Address Space implementations of two applications that currently underperform on distributed address space platforms. First, cycle-accurate simulation results indicate that a Dual Address Space version of HYCOM, an ocean model which features an irregular data decomposition, significantly reduces the number of last-level cache misses that cause communication, resulting in substantial improvement in performance. Second, experiments with two implementations (distributed memory and shared memory) of the Fast Multipole Method, a tree-based algorithm for solving N-body problems, expose weaknesses in both that would be cleanly and efficiently addressed by a Dual Address Space implementation of the algorithm.
机译:虽然硬件支持的共享地址空间为细粒度应用程序提供了可编程性优势和更好的性能,但正是这些机制创造了这些优势,似乎阻止了使用它们的计算机扩展到大量处理器。因此,在科学计算的背景下,基于硬件的分布式共享内存(DSM)平台已被大量淘汰。本文提出了对基于硬件的DSM体系结构的改变,该体系结构允许用户使用两个地址空间来获得分布式体系结构的可伸缩性,同时保留共享地址空间体系结构的优势。提供了一种用于提高用户和编译器对数据一致性和数据局部性的控制的有效机制,以及为程序员提供概念化和利用该机制的高级接口。因此,双地址空间体系结构为用户和编译器提供了一种有效的方法,特别适合于科学应用程序的结构,以保持数据的一致性和局部性,从而在当前的全局地址空间实现上实现了更高的性能。第一部分通过描述高级体系结构的详细信息(包括编程模型)及其通过对基于标准目录的缓存一致性协议的扩展来实现的方法,论证了这些思想的可行性。第二部分评估了目前在分布式地址空间平台上表现不佳的两个应用程序的双地址空间实现的性能。首先,具有周期精确性的仿真结果表明,HYCOM的双地址空间版本(一种具有不规则数据分解功能的海洋模型)显着减少了导致通信的最后一级缓存未命中的次数,从而显着提高了性能。其次,使用快速多极方法的两种实现方式(分布式内存和共享内存)进行实验,这是一种用于解决N体问题的基于树的算法,它暴露了两者中的弱点,而该弱点可以通过对偶地址空间的实现干净而有效地解决。算法。

著录项

  • 作者

    McCurdy, Collin B.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号