首页> 外文会议>ACM/IEEE Annual International Symposium on Computer Architecture >Non-Speculative Store Coalescing in Total Store Order
【24h】

Non-Speculative Store Coalescing in Total Store Order

机译:商店总订单中的非投机商店合并

获取原文

摘要

We present a non-speculative solution for a coalescing store buffer in total store order (TSO) consistency. Coalescing violates TSO with respect to both conflicting loads and conflicting stores, if partial state is exposed to the memory system. Proposed solutions for coalescing in TSO resort to speculation-and-rollback or centralized arbitration to guarantee atomicity for the set of stores whose order is affected by coalescing. These solutions can suffer from scalability, complexity, resource-conflict deadlock, and livelock problems. A non-speculative solution that writes out coalesced cachelines, one at a time, over a typical directory-based MESI coherence layer, has the potential to transcend these problems if it can guarantee absence of deadlock in a practical way. There are two major problems for a non-speculative coalescing store buffer: i) how to present to the memory system a group of coalesced writes as atomic, and ii) how to not deadlock while attempting to do so. For this, we introduce a new lexicographical order. Relying on this order, conflicting atomic groups of coalesced writes can be individually performed per cache block, without speculation, rollback, or replay, and without deadlock or livelock, yet appear atomic to conflicting parties and preserve TSO. One of our major contributions is to show that lexicographical orders based on a small part of the physical address (sub-address order) are deadlock-free throughout the system when taking into account resource-conflict deadlocks. Our approach exceeds the performance and energy benefits of two baseline TSO store buffers and matches the coalescing (and energy savings) of a release-consistency store buffer, at comparable cost.
机译:我们为总存储顺序(TSO)一致性提供了一种合并存储缓冲区的非推测性解决方案。如果部分状态暴露给内存系统,则合并在冲突的负载和冲突的存储方面都违反了TSO。在TSO中提出的合并解决方案,采用投机和回滚或集中仲裁的方法,以确保其订单受合并影响的一组商店的原子性。这些解决方案可能会遇到可伸缩性,复杂性,资源冲突死锁和活动锁问题。一种非推测性解决方案,如果可以保证以实际方式避免死锁,则一次可以在典型的基于目录的MESI一致性层上写出合并的高速缓存行,从而有可能克服这些问题。对于非推测性合并存储缓冲区,存在两个主要问题:i)如何将一组合并的写入作为原子提供给内存系统,以及ii)尝试这样做时如何不死锁。为此,我们引入了一个新的词典顺序。依靠此顺序,可以对每个缓存块分别执行合并写入的冲突原子组,而无需推测,回滚或重播,也没有死锁或活动锁,但对于冲突各方却是原子的,并保留了TSO。我们的主要贡献之一是,当考虑到资源冲突死锁时,基于物理地址的一小部分的字典编排顺序(子地址顺序)在整个系统中都是无死锁的。我们的方法超越了两个基准TSO存储缓冲区的性能和能源优势,并以可比较的成本与释放一致性存储缓冲区的合并(和节能)相匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号