首页> 外文会议> >Interleaved Parallel Schemes: improving memory throughput on supercomputers
【24h】

Interleaved Parallel Schemes: improving memory throughput on supercomputers

机译:交错并行方案:提高超级计算机上的内存吞吐量

获取原文

摘要

On many commercial supercomputers, several vector register processors share a global highly interleaved memory in a MIMD mode. When all the processors are working on a single vector loop, a significant part of the potential memory throughput may be wasted due to the asynchronism of the processors. In order to limit this loss of memory throughput, a SIMD synchronization mode for vector accesses to memory may be used. But an important part of the memory bandwidth may be wasted when accessing vectors with an even stride. In this paper, we present IPS, a interleaved parallel scheme, which ensures an equitable distribution of elements on a highly interleaved memory for a wide range a vector strides. We show how to organize access to memory, such that unscrambling of vectors from memory to the vector register processors requires a minimum number of passes through the interconnection network.
机译:在许多商用超级计算机上,几个向量寄存器处理器以MIMD模式共享全局高度交错的内存。当所有处理器都在单个矢量循环上工作时,由于处理器的异步性,可能浪费了很大一部分潜在的内存吞吐量。为了限制这种存储器吞吐量的损失,可以使用用于对存储器的向量访问的SIMD同步模式。但是,当跨步访问向量时,可能会浪费内存带宽的重要部分。在本文中,我们提出了IPS,一种交错并行方案,该方案可确保在宽跨度的矢量跨度范围内,在高度交错的内存上公平分配元素。我们展示了如何组织对存储器的访问,从而使向量从存储器到向量寄存器处理器的解密需要通过互连网络的最少次数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号