首页> 外文会议>International conference/exhibition on high performance computing in the Asia-Pacific region;HPC-Asia'2000 >Hiding Latency Through Bulk Transfer and Prefetching in Distributed Shared Memory Multiprocessors
【24h】

Hiding Latency Through Bulk Transfer and Prefetching in Distributed Shared Memory Multiprocessors

机译:通过批量传输和预取在分布式共享内存多处理器中隐藏延迟

获取原文

摘要

Distributed shared memory(DSM) machines provide shared memory paradigm and achieve high performance by the caching of shared data. However, they suffer from cache miss and remote access latency with coarse-grain patterns. In this paper, we suggest the combination of bulk transfer and prefetching as a new latency hiding technique in DSM machines. The purpose of bulk transfer is to replicate re-mote data into local memory and thus reduce remote ac-cesses. Adaptive Granularity was used for bulk transfer, Prefetching is added to fetch those replicated data to the cache at the right teme. We could apply a simple prefetch scheduling as in uniprocessor since bulk transfer converts remote access into local ones. Simulation results show the reduced latency and the potential of AG as a preferable architecture for the prefetching in DSM machines.
机译:分布式共享内存(DSM)机器提供共享内存范例,并通过缓存共享数据来实现高性能。但是,它们遭受高速缓存未命中和具有粗粒度模式的远程访问延迟的困扰。在本文中,我们建议将批量传输和预取结合起来,作为DSM计算机中的一种新的延迟隐藏技术。批量传输的目的是将远程数据复制到本地内存中,从而减少远程访问。自适应粒度用于批量传输,添加了预取功能,以将这些复制的数据以适当的时间取到缓存中。我们可以像在单处理器中一样应用简单的预取调度,因为批量传输会将远程访问转换为本地访问。仿真结果表明,减少的等待时间以及AG作为DSM机器中预取的首选体系结构的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号