A comparison of the Cray XMT and XMT-2

Shahid H. Bokhari; Saniyah S. Bokhari

首页> 外文期刊>Concurrency and Computation >A comparison of the Cray XMT and XMT-2

【24h】

A comparison of the Cray XMT and XMT-2

机译：Cray XMT和XMT-2的比较

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We explore the comparative performance of the Cray XMT and XMT-2 massively multithreaded supercomputers. We use benchmarks to evaluate memory accesses for various types of loops. We also compare the performance of these machines on matrix multiply and on three previously implemented dynamic programming algorithms. It is shown that the relative performance of these machines is dependent on the size (number of processors) of the configuration, as well as the size of the problem being evaluated. In particular, small configurations of the original XMT can sometimes show slightly better performance than larger configurations of the XMT-2, for the same problem size. We note that, under heavy memory load, performance of loops can saturate well before the maximum number of processors available. This suggests that it may not always be useful to use the maximum number of processors for a specific run. We also show that manual restructuring of nested loops, including decreasing the parallelism, can result in major improvements in performance. The results in this paper indicate that careful exploration of the space of problem sizes, number of processors used, and choices of loop parallelization can yield substantial improvements in performance. These improvements can be very significant for production codes that run for extended periods of time.

机译：我们探索了Cray XMT和XMT-2大型多线程超级计算机的比较性能。我们使用基准来评估各种类型的循环的内存访问。我们还比较了这些机器在矩阵乘法和三种以前实现的动态编程算法上的性能。结果表明，这些机器的相对性能取决于配置的大小（处理器数量）以及所评估问题的大小。特别是，对于相同的问题大小，原始XMT的小配置有时可能会比XMT-2的大配置显示出更好的性能。我们注意到，在沉重的内存负载下，循环性能可能会在最大数量的可用处理器之前达到饱和。这表明对于特定的运行使用最大数量的处理器可能并不总是有用的。我们还表明，嵌套循环的手动重组（包括降低并行度）可以导致性能上的重大改进。本文的结果表明，仔细研究问题大小，使用的处理器数量以及选择循环并行化的空间可以显着提高性能。这些改进对于长时间运行的生产代码非常重要。

著录项

来源
《Concurrency and Computation》 |2013年第15期|2123-2139|共17页
作者
Shahid H. Bokhari; Saniyah S. Bokhari;
展开▼
作者单位

Department of Biomedical Informatics, 3190 Graves Hall, 333 W. 10th Avenue Columbus, OH 43210, USA;

Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cray XMT; Cray XMT-2; matrix multiply; dynamic programming; multithreading; parallel algorithms; parallel computing; reassortment; sequence alignment; shared memory; subset-sum problem;

机译：克雷XMT;克雷XMT-2;矩阵乘法动态编程多线程并行算法;并行计算;重新组合;序列比对;共享内存;子和问题;

相似文献

外文文献
中文文献
专利

1. Massively multithreaded maxflow for image segmentation on the Cray XMT-2 [J] . Shahid H. Bokhari, Ümit V. Çatalyürek, Metin N. Gurcan Concurrency and Computation . 2014,第18期

机译：大规模多线程maxflow在Cray XMT-2上进行图像分割
2. Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer [J] . Villa Oreste, Tumeo Antonino, Secchi Simone, Parallel and Distributed Systems, IEEE Transactions on . 2012,第12期

机译：Cray XMT多线程超级计算机的快速准确的仿真
3. Eigensolver performance comparison on Cray XC systems [J] . Brandon Cook, Thorsten Kurth, Jack Deslippe, Concurrency, practice and experience . 2019,第16期

机译：Cray XC系统上的Eigensolver性能比较
4. Experimental comparison of emulated lock-free vs. fine-grain locked data structures on the Cray XMT [C] . Farber R., Mizell D. 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum . 2010

机译：Cray XMT上模拟的无锁与细粒度锁定数据结构的实验比较
5. Performance analysis of pure MPI versus MPI+OpenMP for Jacobi Iteration and a three-dimensional FFT on the Cray XT5. [D] . Weiss, Olga. 2012

机译：纯CPI与MPI + OpenMP进行Jacobi迭代和在Cray XT5上进行三维FFT的性能分析。
6. Massively Multithreaded Maxflow for Image Segmentation on the Cray XMT-2 [O] . Shahid H. Bokhari, Ümit V. Çatalyürek, Metin N. Gurcan -1

机译：大规模多线程Maxflow在Cray XMT-2上进行图像分割
7. Massively Multithreaded Maxflow for Image Segmentation on the Cray XMT-2 [O] . Shahid H. Bokhari, Ümit V. Çatalyürek, Metin N. Gurcan, 2013

机译：用于Cray XMT-2的图像分割的大规模多线程Maxflow

A comparison of the Cray XMT and XMT-2

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅