首页> 外文学位 >Efficient run-time support for global view programming of linked data structures on distributed memory parallel systems.
【24h】

Efficient run-time support for global view programming of linked data structures on distributed memory parallel systems.

机译:对分布式内存并行系统上链接数据结构的全局视图编程的有效运行时支持。

获取原文
获取原文并翻译 | 示例

摘要

Developing high-performance parallel applications that use linked data structures on distributed-memory clusters is challenging. Many scientific applications use algorithms based on linked data structures like trees and graphs. These structures are especially useful in representing relationships between data which may not be known until runtime or may otherwise evolve during the course of a computation. Methods such as n-body simulation, Fast Multipole Methods (FMM), and multiresolution analysis all use trees to represent a fixed space populated by a dynamic distribution of elements. Other problem domains, such as data mining, use both trees and graphs to summarize large input datasets into a set of relationships that capture the information in a form that lends itself to efficient mining.;This dissertation first describes a runtime system that provides a programming interface to a global address space representation of generalized distributed linked data structures, while providing scalable performance on distributed memory computing systems. This system, the Global Chunk Layer (GCL), provides data access primitives at the element level, but takes advantage of coarse-grained data movement to enhance locality and improve communication efficiency. The key benefits of using the GCL system include efficient shared-memory style programming of distributed dynamic, linked data structures, the abstraction and optimization of structural elements common to linked data, and the ability to customize many aspects of the runtime to tune application performance.;Additionally, this dissertation presents the design and implementation of a tree-specific system for efficient parallel global address space computing. The Global Trees (GT) library provides a global view of distributed linked tree structures and a set of routines that operate on these structures. GT is built on top of the generalized data structure support provided by the GCL runtime and can inter-operate with other parallel programming models such as MPI, or along with existing global view approaches such as Global Arrays. This approach is based on two key insights: First, tree-based algorithms are easily expressed in a fine-grained manner, but data movement must be done at a much coarser level of granularity for good performance. Second, since GT is focused on a single data abstraction, attributes unique to tree structures can be exploited to provide optimized routines for common operations.;This dissertation also describes techniques for improving program performance and program understanding using these frameworks. Data locality has a significant impact on the communication properties of parallel algorithms. Techniques are presented that use profile-driven data reference traces to perform two types of data layout in Global Trees programs. Lastly, approaches for understanding and analyzing program performance data are presented along with tools for visualizing GT structures. These tools enable GT developers to better understand and optimize program performance.
机译:开发在分布式内存集群上使用链接数据结构的高性能并行应用程序具有挑战性。许多科学应用程序都使用基于链接数据结构(如树和图)的算法。这些结构在表示数据之间的关系时特别有用,这些数据之间的关系可能要到运行时才能知道,否则可能会在计算过程中演变。 n体模拟,快速多极方法(FMM)和多分辨率分析等方法都使用树表示由元素动态分布填充的固定空间。其他问题领域,例如数据挖掘,同时使用树和图将大型输入数据集汇总为一组关系,这些关系以一种有助于高效挖掘的形式捕获信息。;本论文首先介绍了一种提供编程的运行时系统。通用分布式链接数据结构的全局地址空间表示的接口,同时在分布式内存计算系统上提供可伸缩的性能。该系统称为全局块层(GCL),它在元素级别提供数据访问原语,但利用粗粒度数据移动来增强局部性并提高通信效率。使用GCL系统的主要好处包括对分布式动态链接数据结构进行有效的共享内存样式编程,链接数据共有的结构元素的抽象和优化,以及自定义运行时的各个方面以调整应用程序性能的能力。 ;此外,本文提出了一种用于高效并行全局地址空间计算的树状系统的设计和实现。全局树(GT)库提供了分布式链接树结构的全局视图以及在这些结构上运行的一组例程。 GT是在GCL运行时提供的通用数据结构支持的基础上构建的,可以与其他并行编程模型(如MPI)或现有的全局视图方法(如全局数组)进行互操作。这种方法基于两个关键的见解:首先,基于树的算法很容易以细粒度的方式表示,但是为了获得良好的性能,必须以更粗糙的粒度进行数据移动。其次,由于GT专注于单一数据抽象,因此可以利用树结构特有的属性来为常见操作提供优化的例程。本文还介绍了使用这些框架来提高程序性能和程序理解的技术。数据局部性对并行算法的通信属性有重大影响。提出了使用概要文件驱动的数据引用跟踪在Global Trees程序中执行两种类型的数据布局的技术。最后,介绍了用于理解和分析程序性能数据的方法以及用于可视化GT结构的工具。这些工具使GT开发人员可以更好地理解和优化程序性能。

著录项

  • 作者

    Larkins, Darrell Brian.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 166 p.
  • 总页数 166
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号