首页> 外文会议>ACM/IEEE conference on Supercomputing >Performance of the CRAY T3E multiprocessor
【24h】

Performance of the CRAY T3E multiprocessor

机译:CRAY T3E多处理器的性能

获取原文

摘要

The CRAY T3E is a scalable shared-memory multiprocessor based on the DEC Alpha 21164 microprocessor. The system includes a number of novel architectural features designed to tolerate latency, enhance scalability, and deliver high performance on scientific and engineering codes. Included among these are stream buffers, which detect and prefetch down small-stride reference streams, E-registers, which provide latency hiding and non-unit-stride access capabilities, barrier and fetch_and_op synchronization support, and a scalable, high-bandwidth interconnection network.This paper reports our experiences with the CRAY T3E and presents a variety of performance measurements. Section 2 provides a brief overview of the system architecture. Section 3 describes the latency-hiding features (caches, stream buffers and E-registers) in more detail, assesses their performance impact, and discusses coding techniques for using them. Section 4 presents single-processor performance results. Finally, Section 5 discusses system scalability.
机译:CRAY T3E是基于DEC Alpha 21164微处理器的可扩展共享内存多处理器。该系统包括许多新颖的体系结构功能,旨在容忍延迟,增强可伸缩性并在科学和工程规范上提供高性能。其中包括流缓冲区,用于检测并预取小步长参考流, E寄存器,它们提供延迟隐藏和非单步长访问功能,屏障和fetch_and_op同步支持,以及可扩展的高带宽互连网络。本文报告了我们在CRAY T3E方面的经验,并提出了各种性能指标。第2节简要概述了系统体系结构。第3节更详细地描述了延迟隐藏功能(缓存,流缓冲区和E寄存器),评估了它们对性能的影响,并讨论了使用它们的编码技术。第4节介绍了单处理器性能结果。最后,第5节讨论系统可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号