【24h】

Prefetching on the Cray-T3E

机译:在Cray-T3e上预取

获取原文

摘要

In many parallel applications, network latency causes a dramatic loss in processor utilization. This paper examines software controlled access pipelining (SCAP) as a technique for hiding network latency. An analytic model of SCAP describes basic operation techniques and predicts performance. Results are validated with benchmarks on the Cray-T3E. They show vectorized version of SCAP (V-SCAP) to be at least as fast as the highly optimized shared memory system functions. SCAP on the Cray-T3E improves performance compared to a blocking execution between 35% and 900%, while V-SCAP performs better with a factor of 2.1 to 62. SCAP achieves a performance speed-up against HPF between 48% to a factor of 9.2 dependent on the data access pattern. It also performs well on irregular access patterns which are not supported by the standard library.
机译:在许多并行应用中,网络延迟导致处理器利用率的急剧损失。本文将软件控制访问流水线(SCAP)视为隐藏网络延迟的技术。 SCAP的分析模型描述了基本操作技术和预测性能。结果用Cray-T3e的基准验证。它们显示了矢量化版本的SCAP(V-SCAP)至少与高度优化的共享内存系统功能一样快。 CRAY-T3E上的SCAP可以提高性能与35%和900%之间的阻塞执行相比,而V-SCAP更好地使用2.1至62倍。突出的速度降低了HPF的48%之间的性能速度9.2依赖于数据访问模式。它还对标准库不支持的不规则访问模式执行良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号