首页> 外文会议>Design, Automation Test in Europe Conference Exhibition >The slowdown or race-to-idle question: Workload-aware energy optimization of SMT multicore platforms under process variation
【24h】

The slowdown or race-to-idle question: Workload-aware energy optimization of SMT multicore platforms under process variation

机译:放缓或竞速问题:工艺变化下SMT多核平台的工作负载感知能源优化

获取原文

摘要

Two widely used approaches for reducing energy consumption in multithreaded workloads are slowdown (using DVFS) and race-to-idle. In this paper, we first demonstrate that most energy-efficient choice is dependent on (1) workload (memory bound, CPU bound etc.), (2) process variation and (3) support for Simultaneous Multithreading (SMT). We then propose an approach for mapping application threads on SMT multicore systems at run-time, to minimize energy consumption. The proposed approach interfaces with the OS and hardware performance counters to characterize application threads. This characterization captures the effect of process variation on execution time and identifies the break-even operating point, where one strategy (slowdown or race-to-idle) outperforms the other. Thread mapping is performed using these characterized data by iteratively collapsing application threads (SMT) followed by binary programming-based thread mapping. Finally, performance slack is exploited at run-time to select between slowdown and race-to-idle, based upon the break-even operating point calculated for each individual thread. This end-to-end approach is implemented as a run-time manager for the Linux OS and is validated across a range of high performance applications. Results demonstrate up to 13% energy reduction over all state-of-the-art approaches, with an average of 18% improvement over Linux.
机译:降低速度(使用DVFS)和竞速进入空闲状态是减少多线程工作负载能耗的两种广泛使用的方法。在本文中,我们首先证明最节能的选择取决于(1)工作负载(内存限制,CPU限制等),(2)进程变化和(3)对同时多线程(SMT)的支持。然后,我们提出一种在运行时在SMT多核系统上映射应用程序线程的方法,以最大程度地减少能耗。所提出的方法与操作系统和硬件性能计数器对接,以表征应用程序线程。该特征描述了过程变化对执行时间的影响,并确定了收支平衡的工作点,其中一种策略(减速或竞速)优于另一种策略。线程映射是通过迭代折叠应用程序线程(SMT),然后使用基于二进制编程的线程映射来使用这些特征数据执行的。最终,基于为每个单独线程计算的收支平衡运行点,在运行时利用性能松弛来在减慢和竞速到空闲之间进行选择。这种端到端方法是作为Linux OS的运行时管理器实现的,并且已在一系列高性能应用程序中得到了验证。结果表明,与所有最先进的方法相比,能耗最多可降低13%,与Linux相比平均降低18%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号