Set Utilization Based Dynamic Shared Cache Partitioning

机译：基于集合利用率的动态共享缓存分区

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the number of processors sharing a cache increases, conflict misses due to interference amongst competing processes have an increasing impact on the individual performance of processes. Cache partitioning is a method of allocating a cache between concurrently executing processes in order to counteract the effects of inter-process conflicts. However, cache partitioning methods commonly divide a shared cache into private partitions dedicated to a single processor, which can lead to underutilized portions of the cache when set accesses are non-uniform. Our proposed method compliments these cache partitioning algorithms by creating an additional shared partition able to be shared amongst all processors. Underutilized areas of the cache are identified by a monitoring circuit and used for the shared partition. Detection of underutilization is based on the number of unique set accesses for a given allocated way. For a 16-way set associative cache, the implementation of our method requires 64 bytes of storage overhead per core in addition to that needed for the method that determines the sizes of the private partitions. For the tested system, our method is able to improve performance over the traditional LRU policy for a number of selected benchmark sets by an average of 1.4% and up to 13.3% for a two core system and an average of 1.4% and up to 7.8% for a four core system, and is able to improve the performance of a conventional cache partitioning method (Utility-Based Cache Partitioning) by an average of 0.1% and up to 0.5% for both a two and four core systems.

机译：随着共享高速缓存的处理器数量的增加，由于竞争进程之间的干扰而导致的冲突遗漏对进程的单个性能产生越来越大的影响。高速缓存分区是一种在并发执行的进程之间分配高速缓存以抵消进程间冲突影响的方法。但是，高速缓存分区方法通常将共享高速缓存划分为专用于单个处理器的专用分区，当集合访问不一致时，这可能导致高速缓存的利用率不足。我们提出的方法通过创建一个能够在所有处理器之间共享的附加共享分区来补充这些缓存分区算法。缓存未充分利用的区域由监视电路标识，并用于共享分区。未充分利用的检测是基于给定分配方式的唯一集合访问的数量。对于16路集关联缓存，我们的方法的实现除确定专用分区大小的方法所需的开销外，每个内核还需要64字节的存储开销。对于经过测试的系统，对于许多选定的基准集，我们的方法能够将性能提高到传统LRU策略之上，对于两核系统，平均提高了1.4％，最高提高了13.3％，而两核系统则提高了1.4％，最高达到7.8对于四核系统，它的性能为％，并且对于两核和四核系统，它能够将传统的高速缓存分区方法（基于实用程序的高速缓存分区）的性能平均提高0.1％，最高可提高0.5％。

著录项

来源
《2011 17th IEEE International Conference on Parallel and Distributed Systems》|2011年|p.284-291|共8页
会议地点
作者
Deayton Peter; Chung Chung-Ping;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类分布式操作系统、并行式操作系统;
关键词
cache partitioning; chip multi-processor; set utilization; shared cache;

机译：缓存分区;芯片多处理器;集利用率;共享缓存;

相似文献

外文文献
中文文献
专利

1. Partition-Based Cache Replacement to Manage Shared L2 Caches [J] . FANG Juan, WANG Jing, LI Chengyan, 电子学报：英文版 . 2014,第003期

机译：基于分区的缓存替换以管理共享L2缓存
2. Dynamic Partition of Shared Cache for Multi-Threaded Application in Multi-Core System [J] . Shuo Li, Feng Wu Key Engineering Materials . 2010,第pta2期

机译：多核系统中多线程应用程序共享缓存的动态分区
3. Dynamic Partitioning of Shared Cache Memory [J] . G. E. SUH, L. RUDOLPH, S. DEVADAS Journal of supercomputing . 2004,第1期

机译：共享缓存的动态分区
4. Set Utilization Based Dynamic Shared Cache Partitioning [C] . Deayton Peter, Chung Chung-Ping IEEE International Conference on Parallel and Distributed Systems . 2011

机译：设置基于使用的动态共享缓存分区
5. Improving fairness and throughput of a CMP processor by optimizing the utilization of the last level shared cache in real-time using a constrained-extended Kalman filter. [D] . Panday, Ashish. 2017

机译：通过使用约束扩展卡尔曼滤波器实时优化最后一级共享缓存的利用率，提高CMP处理器的公平性和吞吐量。
6. Set-based corral control in stochastic dynamical systems: Making almostinvariant sets more invariant [O] . Eric Forgoston, Lora Billings, Philip Yecko, -1

机译：随机动力系统中基于集合的畜栏控制：几乎不变集更不变
7. Set Utilization Based Dynamic Shared Cache Partitioning [O] . Peter Deayton, Chung-ping Chung 2016

机译：设置基于利用率的动态共享缓存分区

Set Utilization Based Dynamic Shared Cache Partitioning

摘要

著录项

相似文献

相关主题

期刊订阅