首页> 外文OA文献 >Hardware-Oriented Cache Management for Large-Scale Chip Multiprocessors
【2h】

Hardware-Oriented Cache Management for Large-Scale Chip Multiprocessors

机译:面向大型芯片多处理器的面向硬件的缓存管理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to effectively manage the limited on-chip cache resources shared among co-scheduled threads/processes. This thesis proposes new hardware-oriented solutions for distributed CMP caches. Computer architects are faced with growing challenges when designing cache systems for CMPs. These challenges result from non-uniform access latencies, interference misses, the bandwidth wall problem, and diverse workload characteristics. Our exploration of the CMP cache management problem suggests a CMP caching framework (CC-FR) that defines three main approaches to solve the problem: (1) data placement, (2) data retention, and (3) data relocation. We effectively implement CC-FR's components by proposing and evaluating multiple cache management mechanisms.Pressure and Distance Aware Placement (PDA) decouples the physical locations of cache blocks from their addresses for the sake of reducing misses caused by destructive interferences. Flexible Set Balancing (FSB), on the other hand, reduces interference misses via extending the life time of cache lines through retaining some fraction of the working set at underutilized local sets to satisfy far-flung reuses. PDA implements CC-FR's data placement and relocation components and FSB applies CC-FR's retention approach.To alleviate non-uniform access latencies and adapt to phase changes in programs, Adaptive Controlled Migration (ACM) dynamically and periodically promotes cache blocks towards L2 banks close to requesting cores. ACM lies under CC-FR's data relocation category. Dynamic Cache Clustering (DCC), on the other hand, addresses diverse workload characteristics and growing non-uniform access latencies challenges via constructing a cache cluster for each core and expands/contracts all clusters synergistically to match each core's cache demand. DCC implements CC-FR's data placement and relocation approaches. Lastly, Dynamic Pressure and Distance Aware Placement (DPDA) combines PDA and ACM to cooperatively mitigate interference misses and non-uniform access latencies. Dynamic Cache Clustering and Balancing (DCCB), on the other hand, combines DCC and FSB to employ all CC-FR's categories and achieve higher system performance. Simulation results demonstrate the effectiveness of the proposed mechanisms and show that they compare favorably with related cache designs.
机译:从芯片多处理器(CMP)获得高性能的关键要求之一是有效管理在共同调度的线程/进程之间共享的有限的片上缓存资源。本文提出了一种新的面向硬件的分布式CMP缓存解决方案。在为CMP设计缓存系统时,计算机架构师面临越来越多的挑战。这些挑战是由不均匀的访问等待时间,干扰遗漏,带宽壁垒问题以及各种工作负载特性造成的。我们对CMP缓存管理问题的探索提出了CMP缓存框架(CC-FR),该框架定义了三种解决该问题的方法:(1)数据放置,(2)数据保留和(3)数据重定位。我们通过提出和评估多种缓存管理机制来有效地实现CC-FR的组件。压力和距离感知放置(PDA)将缓存块的物理位置与其地址分离,以减少破坏性干扰造成的遗漏。另一方面,灵活集平衡(FSB)通过将部分工作集保留在未充分利用的本地集上来满足广泛的重用,从而延长了缓存行的寿命,从而减少了干扰遗漏。 PDA实现了CC-FR的数据放置和重定位组件,而FSB应用了CC-FR的保留方法,为缓解不均匀的访问延迟并适应程序的相位变化,自适应控制迁移(ACM)动态地,周期性地将高速缓存块升级为L2库关闭到请求的核心。 ACM属于CC-FR的数据重定位类别。另一方面,动态缓存集群(DCC)通过为每个核心构建一个缓存集群并协同扩展/收缩所有集群以匹配每个核心的缓存需求,来解决各种工作负载特征和日益增长的非均匀访问延迟挑战。 DCC实施CC-FR的数据放置和重定位方法。最后,动态压力和距离感知放置(DPDA)结合了PDA和ACM,以协同缓解干扰遗漏和不均匀的访问延迟。另一方面,动态缓存群集和平衡(DCCB)将DCC和FSB结合起来,以使用所有CC-FR的类别,并获得更高的系统性能。仿真结果证明了所提出机制的有效性,并表明它们与相关的缓存设计相比具有优势。

著录项

  • 作者

    Hammoud Mohammad Hussein;

  • 作者单位
  • 年度 2010
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号