首页> 外文会议>11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing >Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT
【24h】

Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT

机译:多线程分布式共享内存计算机的竞争模型:Cray XMT

获取原文

摘要

Distributed Shared Memory (DSM) machines are a wide class of multi-processor computing systems where a large virtually-shared address space is mapped on a network of physically distributed memories. High memory latency and network contention are two of the main factors that limit performance scaling of such architectures. Modern high-performance computing DSM systems have evolved towards the exploitation of massive hardware multi-threading and fine-grained memory hashing to tolerate irregular latencies, avoiding network hot-spots and improving scalability. Parallel simulation is a promising approach, which has been extensively used to model the performance of such large-scale machines. One of the most critical factors in coping with the simulation speed-accuracy trade-off is network modeling. The Cray XMT is a massively multi-threaded supercomputing architecture that belongs to the DSM class. In this paper, we discuss the development of a network contention model for a full-system XMT simulator. We start by measuring the effects of network contention on a 128-processorXMT machine, we then investigate the trade-off that exists between simulation accuracy and speed, comparing three network models which operate at different levels of accuracy. The comparison and model validation is performed by executing a string-matching algorithm on the full-system simulator and on the actual machine, using three datasets that generate noticeably different contention patterns. Results prove that simulator accuracy in execution time remains within 10% of the real machine. We also show that the slowdown due to contention modeling is limited to 20%, when simulating a small number of processors, and becomes negligible for simulations with higher processor counts.
机译:分布式共享内存(DSM)机器是一类广泛的多处理器计算系统,其中较大的虚拟共享地址空间映射到物理分布式内存的网络上。高内存延迟和网络争用是限制此类体系结构性能扩展的两个主要因素。现代高性能计算DSM系统已向大规模硬件多线程和细粒度内存散列的利用发展,以容忍不规则的延迟,避免了网络热点并提高了可伸缩性。并行仿真是一种很有前途的方法,已被广泛用于对此类大型机器的性能进行建模。网络建模是应对仿真速度-精度折衷的最关键因素之一。 Cray XMT是属于DSM类的大规模多线程超级计算体系结构。在本文中,我们讨论了用于全系统XMT模拟器的网络竞争模型的开发。我们首先在128处理器XMT机器上测量网络争用的影响,然后研究仿真精度和速度之间存在的取舍,比较三种在不同精度级别上运行的网络模型。比较和模型验证是通过使用生成明显不同的竞争模式的三个数据集在整个系统模拟器和实际计算机上执行字符串匹配算法来执行的。结果证明,模拟器执行时间的准确性保持在真实机器的10%以内。我们还显示,在模拟少量处理器时,由于竞争模型而导致的速度下降限制为20%,对于具有更高处理器数量的模拟而言,它可以忽略不计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号