首页> 外文期刊>Concurrency and Computation >Exploring the performance of massively multithreaded architectures
【24h】

Exploring the performance of massively multithreaded architectures

机译:探索大规模多线程体系结构的性能

获取原文
获取原文并翻译 | 示例

摘要

We present a new scheme for evaluating the performance of multithreaded computers and demonstrate its application to the Cray MTA-2 and XMT supercomputers. Our scheme is based on the concept of clock cycles per element, C, plotted against both problem size and the number of processors. This scheme clearly shows if an implementation has achieved its asymptotic efficiency and is more general than (but includes) the commonly used speedup metric. It permits the discovery of any imperfections in both the software as well as the hardware, and is expected to permit a unified comparison of many different parallel architectures. Measurements on a number of well-known parallel algorithms, ranging from matrix multiply to quicksort, are presented for the MTA-2 and XMT and highlight some interesting differences between these machines. The performance of sequence alignment using dynamic programming is evaluated on the MTA-2, XMT, IBM x3755 and SGI Altix 350 and provides a useful comparison of the capabilities of the Cray machines with more conventional shared memory architectures.
机译:我们提出了一种用于评估多线程计算机性能的新方案,并演示了其在Cray MTA-2和XMT超级计算机中的应用。我们的方案基于每个元素C的时钟周期的概念,并针对问题大小和处理器数量进行了绘制。该方案清楚地表明一种实现是否已实现其渐近效率,并且比(但包括)常用的加速指标更通用。它允许发现软件和硬件中的任何缺陷,并且有望允许对许多不同的并行体系结构进行统一比较。对MTA-2和XMT进行了许多从矩阵乘法到快速排序的著名并行算法的测量,并突出了这些机器之间的一些有趣的区别。在MTA-2,XMT,IBM x3755和SGI Altix 350上评估了使用动态编程进行序列比对的性能,并提供了Cray机器与更常规的共享内存体系结构的功能的有用比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号