首页> 外文期刊>Concurrency and Computation >A comparison of the Cray XMT and XMT-2
【24h】

A comparison of the Cray XMT and XMT-2

机译:Cray XMT和XMT-2的比较

获取原文
获取原文并翻译 | 示例

摘要

We explore the comparative performance of the Cray XMT and XMT-2 massively multithreaded supercomputers. We use benchmarks to evaluate memory accesses for various types of loops. We also compare the performance of these machines on matrix multiply and on three previously implemented dynamic programming algorithms. It is shown that the relative performance of these machines is dependent on the size (number of processors) of the configuration, as well as the size of the problem being evaluated. In particular, small configurations of the original XMT can sometimes show slightly better performance than larger configurations of the XMT-2, for the same problem size. We note that, under heavy memory load, performance of loops can saturate well before the maximum number of processors available. This suggests that it may not always be useful to use the maximum number of processors for a specific run. We also show that manual restructuring of nested loops, including decreasing the parallelism, can result in major improvements in performance. The results in this paper indicate that careful exploration of the space of problem sizes, number of processors used, and choices of loop parallelization can yield substantial improvements in performance. These improvements can be very significant for production codes that run for extended periods of time.
机译:我们探索了Cray XMT和XMT-2大型多线程超级计算机的比较性能。我们使用基准来评估各种类型的循环的内存访问。我们还比较了这些机器在矩阵乘法和三种以前实现的动态编程算法上的性能。结果表明,这些机器的相对性能取决于配置的大小(处理器数量)以及所评估问题的大小。特别是,对于相同的问题大小,原始XMT的小配置有时可能会比XMT-2的大配置显示出更好的性能。我们注意到,在沉重的内存负载下,循环性能可能会在最大数量的可用处理器之前达到饱和。这表明对于特定的运行使用最大数量的处理器可能并不总是有用的。我们还表明,嵌套循环的手动重组(包括降低并行度)可以导致性能上的重大改进。本文的结果表明,仔细研究问题大小,使用的处理器数量以及选择循环并行化的空间可以显着提高性能。这些改进对于长时间运行的生产代码非常重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号