首页> 外文会议>10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing >Region-Based Prefetch Techniques for Software Distributed Shared Memory Systems
【24h】

Region-Based Prefetch Techniques for Software Distributed Shared Memory Systems

机译:用于软件分布式共享内存系统的基于区域的预取技术

获取原文
获取原文并翻译 | 示例

摘要

Although shared memory programming models show good programmability compared to message passing programming models, their implementation by page-based software distributed shared memory systems usually suffers from high memory consistency costs. The major part of these costs is inter-node data transfer for keeping virtual shared memory consistent. A good prefetch strategy can reduce this cost. We develop two prefetch techniques, TReP and HReP, which are based on the execution history of each parallel region. These techniques are evaluated using offline simulations with the NAS Parallel Benchmarks and the LINPACK benchmark. On average, TReP achieves an efficiency (ratio of pages prefetched that were subsequently accessed) of 96% and a coverage (ratio of access faults avoided by prefetches) of 65%. HReP achieves an efficiency of 91% but has a coverage of 79%. Treating the cost of an incorrectly prefetched page to be equivalent to that of a miss, these techniques have an effective page miss rate of 63% and 71% respectively. Additionally, these two techniques are compared with two well-known software distributed shared memory (sDSM) prefetch techniques, Adaptive++ and TODFCM. TReP effectively reduces page miss rate by 53% and 34% more, and HReP effectively reduces page miss rate by 62% and 43% more, compared to Adaptive++ and TODFCM respectively. As for Adaptive++, these techniques also permit bulk prefetching for pages predicted using temporal locality, amortizing network communication costs and permitting bandwidth improvement from multi-rail network interfaces.
机译:尽管与消息传递编程模型相比,共享存储器编程模型显示出良好的可编程性,但是通过基于页面的软件分布式共享存储器系统来实现共享存储器编程模型通常会遭受较高的存储器一致性成本。这些成本的主要部分是用于保持虚拟共享内存一致的节点间数据传输。好的预取策略可以降低成本。我们基于每个并行区域的执行历史记录开发了两种预取技术,即TReP和HReP。这些技术是使用具有NAS并行基准和LINPACK基准的离线模拟进行评估的。平均而言,TReP的效率(随后访问的预取页面的比率)为96%,覆盖率(通过预取避免的访问错误比率)为65%。 HReP的效率达到91%,但覆盖率达到79%。将不正确提取的页面的成本等同于丢失的成本,这些技术的有效页面丢失率分别为63%和71%。此外,将这两种技术与两种著名的软件分布式共享内存(sDSM)预取技术进行了比较,即Adaptive ++和TODFCM。与Adaptive ++和TODFCM相比,TReP有效地将页面遗漏率分别降低了53%和34%,而HReP有效地将页面遗漏率降低了62%和43%。对于Adaptive ++,这些技术还允许批量预取使用时间局部性预测的页面,分摊网络通信成本,并允许多轨网络接口提高带宽。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号