首页> 外文会议>2017 IEEE 23rd Symposium on High Performance Computer Architecture >A Split Cache Hierarchy for Enabling Data-Oriented Optimizations
【24h】

A Split Cache Hierarchy for Enabling Data-Oriented Optimizations

机译:拆分缓存层次结构,用于实现面向数据的优化

获取原文
获取原文并翻译 | 示例

摘要

Today's caches tightly couple data with metadata (Address Tags) at the cache line granularity. The co-location of data and its identifying metadata means that they require multiple approaches to locate data (associative way searches and level-by-level searches), evict data (coherent writebacks buffers and associative level-by-level searches) and keep data coherent (directory indirections and associative level-by-level searches). This results in complex implementations with many corner cases, increased latency and energy, and limited flexibility for data optimizations. We propose splitting the metadata and data into two separate structures: a metadata hierarchy and a data hierarchy. Themetadata hierarchy tracks the location of the data in the data hierarchy. This allows us to easily apply many differentoptimizations to the data hierarchy, including smart data placement, dynamic coherence, and direct accesses. The new split cache hierarchy, Direct-to-Master (D2M), provides a unified mechanism for cache searching, eviction, and coherence, that eliminates level-by-level data movement and searches, associative cache address tags comparisons andabout 90% of the indirections through a central directory. Optimizations such as moving LLC slices to the near-side ofthe network and private/shared data classification can easily be built on top off D2M to further improve its efficiency. Thisapproach delivers a 54% improvement in cache hierarchy EDP vs. a mobile processor and 40% vs. a server processor, reducesnetwork traffic by an average of 70%, reduces the L1 miss latency by 30% and is especially effective for workloads with high cache pressure.
机译:当今的高速缓存以高速缓存行的粒度将数据与元数据(地址标签)紧密耦合。数据及其标识元数据的共置意味着它们需要多种方法来定位数据(关联方式搜索和逐级搜索),逐出数据(一致的写回缓冲区和关联的逐级搜索)并保留数据连贯的(目录重定向和逐级关联搜索)。这导致复杂的实现方式有很多极端情况,增加了等待时间和精力,并且数据优化的灵活性受到限制。我们建议将元数据和数据分为两个单独的结构:元数据层次结构和数据层次结构。 Themetadata层次结构跟踪数据在数据层次结构中的位置。这使我们能够轻松地对数据层次结构应用许多不同的优化,包括智能数据放置,动态一致性和直接访问。新的拆分式缓存层次结构,即直接对主设备(D2M),提供了用于缓存搜索,逐出和一致性的统一机制,从而消除了逐级数据移动和搜索,关联的缓存地址标签比较以及大约90%的缓存地址。通过中央目录进行间接访问。可以很容易地在D2M的最顶端建立诸如将LLC切片移动到网络近端以及私有/共享数据分类之类的优化,以进一步提高其效率。与移动处理器相比,此方法可将缓存层次结构的EDP提升54%,与服务器处理器相比,此方法可提升40%,平均减少70%的网络流量,将L1丢失等待时间减少30%,特别适用于具有高缓存的工作负载压力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号