首页> 外文学位 >Operating System Management of Shared Caches on Multicore Processors.
【24h】

Operating System Management of Shared Caches on Multicore Processors.

机译:多核处理器上共享缓存的操作系统管理。

获取原文
获取原文并翻译 | 示例

摘要

Our thesis is that operating systems should manage the on-chip shared caches of multicore processors for the purposes of achieving performance gains. Consequently, this dissertation demonstrates how the operating system can profitably manage these shared caches. Two shared-cache management principles are investigated: (1) promoting shared use of the shared cache, demonstrated by an automated online thread clustering technique, and (2) providing cache space isolation, demonstrated by a software-based cache partitioning technique. In support of providing isolation, cache provisioning is also investigated, demonstrated by an automated online technique called RapidMRC. We show how these software-based techniques are feasible on existing multicore systems with the help of their hardware performance monitoring units and their associated hardware performance counters. On a 2-chip IBM POWER5 multicore system, promoting sharing reduced processor pipeline stalls caused by cross-chip cache accesses by up to 70%, resulting in performance improvements of up to 7%. On a larger 8-chip IBM POWER5+ multicore system, the potential for up to 14% performance improvement was measured. Providing isolation improved performance by up to 50%, using an exhaustive offline search method to determine optimal partition size. On the other hand, up to 27% performance improvement was extracted from the corresponding workload using an automated online approximation technique, made possible by RapidMRC.
机译:我们的观点是,操作系统应该管理多核处理器的片上共享缓存,以实现性能提升。因此,本文证明了操作系统如何能够有利地管理这些共享缓存。研究了两种共享缓存管理原则:(1)通过自动在线线程聚类技术演示促进共享缓存的共享使用,以及(2)通过基于软件的缓存分区技术演示提供缓存空间隔离。为了支持提供隔离,还对缓存配置进行了调查,并通过称为RapidMRC的自动在线技术进行了演示。我们借助其硬件性能监视单元及其关联的硬件性能计数器,展示了这些基于软件的技术在现有多核系统上如何可行。在2芯片IBM POWER5多核系统上,促进共享,将跨芯片缓存访问所导致的处理器流水线停顿减少了多达70%,从而使性能提高了多达7%。在更大的8芯片IBM POWER5 +多核系统上,测量了将性能提高14%的潜力。使用详尽的脱机搜索方法来确定最佳分区大小,提供隔离性能最多可提高50%。另一方面,RapidMRC可以使用自动在线近似技术从相应的工作负载中提取高达27%的性能提升。

著录项

  • 作者

    Tam, David.;

  • 作者单位

    University of Toronto (Canada).;

  • 授予单位 University of Toronto (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 171 p.
  • 总页数 171
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号