【24h】

Pseudovectorization, SMP, and Message Passing on the Hitachi SR8000-F1

机译:日立SR8000-F1上的伪向量化,SMP和消息传递

获取原文
获取原文并翻译 | 示例

摘要

The Leibniz-Rechenzentrum in Munich has started operating a 112-node Hitachi SR8000-F1 with a peak performance of 1.3 Teraflops in the second quarter of 2000, the fastest computer in Europe. In order to make use of the full memory bandwidth and hence to obtain a significant fraction of the peak performance for memory intensive applications, the compilers offer preload and prefetch optimization strategies to pipeline load/store operations, as well as automatic parallelization across the 8 processors contained in every node. The nodes are connected by a conflict-free crossbar, enabling efficient communication via standard message-passing interfaces. An overview of the innovative architectural concepts is given. We demonstrate to which extent the capabilities of the compiler to automatically pseudovectorize/parallelize typical application code are sufficient to produce well-performing code.
机译:慕尼黑的Leibniz-Rechenzentrum公司已开始运行112节点的日立SR8000-F1,该机在2000年第二季度的峰值性能为1.3 Teraflops,是欧洲最快的计算机。为了利用全部内存带宽并因此获得内存密集型应用程序的大部分峰值性能,编译器提供了预加载和预取优化策略来流水线加载/存储操作,以及跨8个处理器的自动并行化包含在每个节点中。节点通过无冲突交叉开关连接,从而可以通过标准消息传递接口进行有效通信。概述了创新的建筑概念。我们展示了编译器自动伪矢量化/并行化典型应用程序代码的能力在何种程度上足以产生性能良好的代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号