首页> 外文期刊>Operating systems review >Enhancing Operating System Support for Multicore Processors by Using Hardware Performance Monitoring
【24h】

Enhancing Operating System Support for Multicore Processors by Using Hardware Performance Monitoring

机译:通过使用硬件性能监视来增强对多核处理器的操作系统支持

获取原文
获取原文并翻译 | 示例
       

摘要

Multicore processors contain new hardware characteristics that are different from previous generation single-core systems or traditional SMP (symmetric multiprocessing) multiprocessor systems. These new characteristics provide new performance opportunities and challenges. In this paper, we show how hardware performance monitors can be used to provide a fine-grained, closely-coupled feedback loop to dynamic optimizations done by a multicore-aware operating system. These multicore optimizations are possible due to the advanced capabilities of hardware performance monitoring units currently found in commodity processors, such as execution pipeline stall breakdown and data address sampling. We demonstrate three case studies on how a multicore-aware operating system can use these online capabilities for (1) determining cache partition sizes, which helps reduce contention in the shared cache among applications, (2) detecting memory regions with bad cache usage, which helps in isolating these regions to reduce cache pollution, and (3) detecting sharing among threads, which helps in clustering threads to improve locality. Using realistic applications from standard benchmark suites, the following performance improvements were achieved: (1) up to 27% improvement in IPC (instructions-per-cycle) due to cache partition sizing; (2) up to 10% reduction in cache miss rates due to reduced cache pollution, resulting in up to 7% improvement in IPC; and (3) up to 70% reduction in remote cache accesses due to thread clustering, resulting in up to 7% application-level improvement.
机译:多核处理器包含的新硬件特性与上一代单核系统或传统的SMP(对称多处理)多处理器系统不同。这些新特性提供了新的性能机遇和挑战。在本文中,我们展示了如何使用硬件性能监视器为由多核感知的操作系统完成的动态优化提供细粒度,紧密耦合的反馈回路。由于当前在商用处理器中发现的硬件性能监视单元的高级功能(例如执行管道停顿故障和数据地址采样),这些多核优化成为可能。我们演示了三个案例研究,涉及多核感知操作系统如何使用这些在线功能来(1)确定缓存分区大小,这有助于减少应用程序之间的共享缓存争用;(2)检测缓存使用率不佳的内存区域,这帮助隔离这些区域以减少缓存污染,以及(3)检测线程之间的共享,这有助于对线程进行群集以提高局部性。使用标准基准套件中的实际应用程序,可以实现以下性能改进:(1)由于高速缓存分区的大小,IPC(每周期指令数)提高了27%; (2)由于减少了缓存污染,使缓存未命中率降低了10%,从而使IPC提升了7%。 (3)由于线程群集,远程缓存访问量最多减少了70%,从而使应用程序级别的改善最多达到了7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号