【24h】

Intra-application cache partitioning

机译:帧内缓存分区

获取原文

摘要

Efficient management of shared on-chip resources such as the shared level 2 (L2) cache has become an important problem with the emergence of chip multiprocessors (CMPs). Partitioning the shared cache in chip multiprocessors (CMPs) among concurrently executing applications can provide important benefits such as throughput improvement, fairness guarantees, and quality of service (QoS) enhancements. In this paper, we pose an interesting related question, which is, if partitioning the shared cache space among concurrently executing threads of the same application can enhance the application performance. We address this problem by identifying and speeding up the slowest thread, also termed as the critical path thread, during each execution interval since the overall performance of a multithreaded application is determined by the critical path thread. To do so, we propose a dynamic, runtime system based, cache partitioning scheme that partitions the shared cache space dynamically among the individual threads of a given application. In a nutshell, we wish to take some cache space away from the faster threads and give it to the critical path thread at each execution interval. We show that speeding up the critical path thread this way, results in overall performance enhancement of the application execution in the long term. Our experimental evaluation indicates that, the proposed dynamic cache partitioning scheme yields benefits up to 15% over a shared cache with no partitions, up to 23% over a statically partitioned cache (private cache) and up to 20% over a throughput-oriented scheme.
机译:有效管理共享的片上资源,例如共享级别2(L2)缓存已成为芯片多处理器(CMP)的出现成为一个重要问题。在同时执行的应用程序中划分芯片多处理器(CMP)中的共享缓存可以提供重要的好处,例如吞吐量改进,公平保证和服务质量(QoS)增强功能。在本文中,我们构成了一个有趣的相关问题,即如果在同一应用程序的同时执行的线程中分区共享缓存空间可以增强应用程序性能。在每个执行间隔期间,通过识别和加速最慢的线程,通过识别和加速作为关键路径线程的最慢的线程来解决此问题,因为多线程应用程序的整体性能由关键路径线程确定。为此,我们提出了一种基于动态的运行时系统的高速缓存分区方案,其在给定应用程序的各个线程中动态地分区共享高速缓存空间。简而言之,我们希望将一些缓存空间远离更快的线程,并在每个执行间隔处将其送到关键路径线程。我们展示了这种方式加快了关键路径螺纹,从长远来看,可以整体性能提高应用程序执行。我们的实验评估表明,建议的动态缓存分区方案在没有分区的共享缓存中产生高达15%的好处,在静态分区的缓存(私有缓存)上,高达23%,高达20%的吞吐量方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号