首页> 外文会议>IEEE/ACM International Symposium on Microarchitecture >CoreRacer: A practical memory race recorder for multicore x86 TSO processors
【24h】

CoreRacer: A practical memory race recorder for multicore x86 TSO processors

机译:CoreRacer:适用于多核x86 TSO处理器的实用内存竞赛记录器

获取原文

摘要

Shared memory multiprocessors are difficult to program because of the non-deterministic ways in which the memory operations from different threads interleave. To address this issue, many hardware-based memory race recorders have been proposed that efficiently log an ordering of the shared memory interleavings between threads for deterministic replay. These approaches are challenging to integrate into current processors because they change the cache subsystem or the coherence protocol, and they mostly support a sequentially consistent memory model. In this paper, we describe CoreRacer, a chunk-based memory race recorder architecture for multicore x86 TSO processors. CoreRacer does not modify the cache subsystem and yet it still integrates into the x86 TSO memory model. We show that by leveraging a specific x86 feature, the invariant timestamp, CoreRacer maintains ordering among chunks without piggybacking on cache coherence messages. We provide a detailed implementation and evaluation of CoreRacer on a cycle-accurate x86 simulator. We show that its integration cost into x86 is minimal and its overhead has negligible effect on performance.
机译:共享内存多处理器难以编程,因为来自不同线程的内存操作以非确定性方式进行交织。为了解决这个问题,已经提出了许多基于硬件的内存竞争记录器,它们可以有效地记录线程之间共享内存交错的顺序,以进行确定性重放。这些方法很难集成到当前的处理器中,因为它们更改了缓存子系统或一致性协议,并且它们大多支持顺序一致的内存模型。在本文中,我们描述了CoreRacer,这是一种用于多核x86 TSO处理器的基于块的内存竞赛记录器体系结构。 CoreRacer不会修改缓存子系统,但仍集成到x86 TSO内存模型中。我们展示了通过利用特定的x86功能(不变的时间戳记),CoreRacer可以在块之间保持排序,而不会piggy带缓存一致性消息。我们在周期精确的x86模拟器上提供了CoreRacer的详细实现和评估。我们证明了它与x86的集成成本极低,并且其开销对性能的影响可以忽略不计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号