首页> 外文学位 >Memory systems for parallel programming.
【24h】

Memory systems for parallel programming.

机译:用于并行编程的存储系统。

获取原文
获取原文并翻译 | 示例

摘要

Distributed Shared-Memory (DSM) computers, which partition physical memory among a collection of workstation-like computing nodes, are emerging as the way to implement parallel computers, as they promise scalability and high performance. Shared-memory DSM machines use a coherence protocol to manage the replication of data and to ensure that a parallel program sees a consistent view of memory.; Applications have very different patterns of communication and no single, general-purpose protocol suits all programs. This has prompted interest in systems in which a protocol is implemented in flexible software instead of being fixed in hardware. DSM machines with software-implemented coherence protocols provide opportunities for a variety of more complex and application-specific protocols and allow for protocols that do not just ensure consistent memory, but also provide new functionality and semantics.; Parallel programming has long faced a tension between the goals of high performance and ease of use. Languages and tools can make parallel computers easier to use, but concerns about their efficiency have limited their usage. This thesis demonstrates that some high-level languages and tools can be implemented more efficiently by taking advantage of the cache coherence protocols that underly software DSM machines, thereby improving both performance and ease of use.; This thesis describes a family of custom protocols that efficiently implement a large-grain data-parallel language C{dollar}sp{lcub}**{rcub}{dollar}. On programs for which static analysis is imprecise, these Loosely Coherent Memory (LCM) protocols improve performance from a few percent up to a factor of 3, and reduce memory overheads from a factor of 2 to a factor of 5 over a compiler-copying scheme. LCM also improves performance in C-code programs by up to a factor of 3.; This thesis also presents custom cache-coherence protocols that perform on-the-fly detection of actual data races for programs with barrier synchronization. Overheads in execution time for the race-detection protocols were shown to range from zero to less than a factor of three--a significant improvement over comparable approaches--and race-detection protocols found actual program errors in two applications.
机译:分布式共享内存(DSM)计算机在一系列工作站式计算节点之间划分物理内存,由于它们具有可扩展性和高性能,它们正在成为实现并行计算机的一种方式。共享内存DSM计算机使用一致性协议来管理数据的复制并确保并行程序看到一致的内存视图。应用程序具有非常不同的通信模式,没有一种通用的协议适合所有程序。这引起了人们对在柔性软件中实现协议而不是在硬件中实现协议的系统的兴趣。具有软件实现的一致性协议的DSM机器为各种更复杂的和特定于应用程序的协议提供了机会,并且允许不仅确保一致的内存而且还提供新的功能和语义的协议。长期以来,并行编程一直面临着高性能和易用性之间的矛盾。语言和工具可以使并行计算机更易于使用,但是对并行计算机效率的关注限制了它们的使用。本文证明,通过利用作为软件DSM机器基础的缓存一致性协议,可以更高效地实现某些高级语言和工具,从而提高性能和易用性。本文介绍了一系列自定义协议,这些协议可有效实现大粒度数据并行语言C {dollar} sp {lcub} ** {rcub} {dollar}。在静态分析不精确的程序上,这些松散相干内存(LCM)协议将性能从百分之几提高到三分之一,并且与编译器复制方案相比,将内存开销从二分之一降低到五分之一。 LCM还可以将C代码程序的性能提高多达3倍。本文还介绍了自定义的缓存一致性协议,该协议对带有屏障同步的程序执行实时的数据竞争检测。种族检测协议的执行时间开销从零到不到三倍不等-与同类方法相比有显着改进-种族检测协议在两个应用程序中发现了实际的程序错误。

著录项

  • 作者

    Richards, Bradley Eric.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1996
  • 页码 150 p.
  • 总页数 150
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号