首页> 外文会议>International conference on computer design >Using Application-Level Thread Progress Information to Manage Power and Performance
【24h】

Using Application-Level Thread Progress Information to Manage Power and Performance

机译:使用应用程序级线程进度信息来管理功能和性能

获取原文

摘要

Power and thermal limitations make it impossible to run all cores on a multicore system at their maximum frequency. Therefore, modern systems require careful power management. These systems must manage complex tradeoffs between energy, power, and frequency, choosing which cores to accelerate to achieve good performance while maintaining energy efficiency or operating under a power budget. Navigating these tradeoffs is especially hard with multi-threaded applications, where performance depends on the relative progress of parallel worker threads between synchronization points. Prior work on chip-level power management for multi-threaded applications has largely relied on indirect heuristics and metrics calculated from low-level performance counters to estimate each thread's progress. However, these indirect metrics are often inaccurate. Instead, we propose to gather progress information directly from software itself. We present ThreadBeats, a simple application-level annotation framework that directly and accurately conveys thread progress information to hardware. We design DVFS controllers that exploit ThreadBeats information for two purposes: (i) improving performance by equalizing thread progress and (ii) minimizing runtime under a power budget constraint. These controllers reduce wait time at barriers by 77% on average and improve energy-delay product under a power budget by 23% over prior work.
机译:功率和散热限制使得不可能在多核系统上以最大频率运行所有核。因此,现代系统需要仔细的电源管理。这些系统必须管理能量,功率和频率之间的复杂权衡,选择要加速以实现良好性能的核心,同时保持能效或在功率预算下运行。在多线程应用程序中,要权衡这些折衷尤其困难,因为多线程应用程序的性能取决于同步点之间并行工作线程的相对进度。先前在多线程应用程序中进行芯片级电源管理的工作在很大程度上依赖于从低级性能计数器计算出的间接启发法和指标来估计每个线程的进度。但是,这些间接指标通常不准确。相反,我们建议直接从软件本身收集进度信息。我们介绍ThreadBeats,这是一个简单的应用程序级注释框架,该框架直接且准确地将线程进度信息传达给硬件。我们设计的DVFS控制器利用ThreadBeats信息有两个目的:(i)通过均衡线程进度来提高性能,以及(ii)在功耗预算约束下使运行时间最小化。与以前的工作相比,这些控制器平均将在障碍处的等待时间减少了77%,并在功耗预算下将能源延迟产品提高了23%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号