首页> 外文期刊>International journal of parallel programming >On the correctness of program execution when cache coherence is maintained locally at data-sharing boundaries in distributed shared memory multiprocessors
【24h】

On the correctness of program execution when cache coherence is maintained locally at data-sharing boundaries in distributed shared memory multiprocessors

机译:关于在分布式共享内存多处理器中的数据共享边界本地保持缓存一致性时的程序执行的正确性

获取原文
获取原文并翻译 | 示例
           

摘要

Emerging multiprocessor architectures such as chip Multiprocessors, embedded architectures, and massively parallel architectures, demand faster, more efficient, and more scalable cache coherence schemes. In devising more cost-efficient schemes, formal insights into a system model is deemed useful. We. in this paper, build formalisms for execution in cache based Distributed shared-memory multiprocessors (DSM) obeying Release Consistency model, and derive conditions for cache coherence. A cost-efficient cache coherence scheme without directories is designed. Our approach relies on processor directed coherence actions, which are early in nature. The scheme exploits sharing information provided by a programmer-centric framework. Per-processor coherence buffers (CB) are employed to impose coherence on live shared variables between consecutive release points in the execution. Simulation of 8 entry 4-way associative CB based system achieves a speedup of 1.07-4.31 over full-map 3-hop directory scheme for six of the SPLASH-2 benchmarks.
机译:新兴的多处理器体系结构,例如芯片多处理器,嵌入式体系结构和大规模并行体系结构,需要更快,更有效和更具可伸缩性的缓存一致性方案。在设计更具成本效益的方案时,对系统模型的正式见解被认为是有用的。我们。在本文中,遵循发布一致性模型,建立了在基于缓存的分布式共享内存多处理器(DSM)中执行的形式主义,并推导了缓存一致性的条件。设计了一种不带目录的经济高效的缓存一致性方案。我们的方法依赖于处理器指导的一致性动作,这种动作本质上是早期的。该方案利用了以程序员为中心的框架提供的共享信息。使用每个处理器的一致性缓冲区(CB)对执行中连续释放点之间的实时共享变量施加一致性。对六个SPLASH-2基准测试,基于8入口4路关联CB的系统的仿真比全图3跳目录方案提高了1.07-4.31的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号