【24h】

A Unified Compressed Cache Hierarchy With Partial Cache Line Prefetching Used for SMT Processor

机译:用于SMT处理器的具有部分缓存行预取的统一压缩缓存层次结构

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a novel compressed cache hierarchy (UCCH, Unified Compressed Cache Hierarchy) that uses a unified compression algorithm, called Simple Frequent Pattern Compression (S-FPC), in both LI data cache and L2 cache to improve SMT processor performance. With the data locality of LI D-cache and L2 cache hurted by the competition of on-chip cache resources among threads, SMT technology distinctly increase the data bus bandwidth requirements between levels of caches and cache capacity requirements of LI D-cache and L2 cache. Because UCCH can increase the cache capacity of LI data cache and L2 cache without any sacrifice of the LI cache access latency, and the layout of compressed data in LI D-cache enable partial cache prefetching without introducing prefetch buffer or increasing cache pollution and memory traffic, UCCH is beneficial to improve SMT processor performance. The experiments showed that our unified compressed cache hierarchy can decrease LI D-cache miss rate 17.1%, decrease L1-L2 bus occupancy rate 15.3%, decrease main memory traffic bandwidth demand 49.3%, and improve instruction throughput 4.5% relative to tradition cache configure in SMT processor.
机译:本文提出了一种新颖的压缩缓存层次结构(UCCH,统一压缩缓存层次结构),该结构在LI数据缓存和L2缓存中均使用称为简单频繁模式压缩(S-FPC)的统一压缩算法,以提高SMT处理器的性能。随着线程间片上缓存资源的竞争损害了LI D缓存和L2缓存的数据局部性,SMT技术显着提高了缓存级别之间的数据总线带宽需求以及LI D缓存和L2缓存的缓存容量需求。因为UCCH可以增加LI数据高速缓存和L2高速缓存的高速缓存容量,而不会牺牲LI高速缓存访​​问延迟,并且LI D高速缓存中压缩数据的布局允许部分高速缓存预取,而不会引入预取缓冲区或增加高速缓存污染和内存流量,UCCH有助于提高SMT处理器性能。实验表明,相对于传统的缓存配置,我们统一的压缩缓存层次结构可以降低LI D缓存未命中率17.1%,降低L1-L2总线占用率15.3%,减少主存储器流量带宽需求49.3%,并提高指令吞吐量4.5%。在SMT处理器中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号