Analyzing Potential Throughput Improvement of Power- and Thermal-Constrained Multicore Processors by Exploiting DVFS and PCPG

Lee J.; Kim N. S.

首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Analyzing Potential Throughput Improvement of Power- and Thermal-Constrained Multicore Processors by Exploiting DVFS and PCPG

【24h】

Analyzing Potential Throughput Improvement of Power- and Thermal-Constrained Multicore Processors by Exploiting DVFS and PCPG

机译：通过利用DVFS和PCPG分析功率和散热受限的多核处理器的潜在吞吐量提高

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Process variability from a range of sources is growing as technology is scaled below 65 nm, increasing variations of transistor delay and leakage current both within a die and across dies. This, in turn, negatively impacts maximum operating frequency and total power consumption of processors. Meanwhile, manufacturers have integrated more cores in a single die to improve the throughput of processors running highly-parallel workloads. However, many existing workloads do not have high enough parallelism to exploit multiple cores in a processor. First, in this paper, we maximize the throughput of power- and thermal-constrained multicore processors using per-core power gating and dynamic voltage/frequency scaling. When we do not have enough parallelism to effectively use all cores, we turn off some cores using per-core power gates that are already available in commercial multicore processors. This provides extra power and thermal headroom, and allows active cores to run faster through voltage/frequency scaling within power, thermal, and voltage scaling limits. Our analysis using a 32 nm predictive technology model demonstrates that jointly optimizing the number of active cores and maximum operating frequency can improve the throughput of a 16-core processor running workloads with limited parallelism by up to 14%. Second, we extend our throughput analysis and optimization to consider the impact of within-die spatial process variations that lead to considerable core-to-core frequency and leakage power variations in multicore processors. Our analysis shows that exploiting core-to-core frequency variations can improve the throughput of a 16-core processor by up to 57%.

机译：随着技术被缩放到65 nm以下，各种来源的工艺差异都在增加，这增加了芯片内以及芯片间晶体管延迟和漏电流的变化。反过来，这会对处理器的最大工作频率和总功耗产生负面影响。同时，制造商在单个裸片中集成了更多内核，以提高运行高度并行工作负载的处理器的吞吐量。但是，许多现有的工作负载没有足够高的并行性来利用处理器中的多个内核。首先，在本文中，我们使用每核功率门控和动态电压/频率缩放功能，将功率和散热受限的多核处理器的吞吐量最大化。当我们没有足够的并行度来有效使用所有内核时，我们将使用商用多核处理器中已经可用的每核功率门关闭一些内核。这提供了额外的功率和散热空间，并允许有源内核通过在功率，热量和电压缩放限制内的电压/频率缩放更快地运行。我们使用32 nm预测技术模型进行的分析表明，共同优化活动内核的数量和最大工作频率可以将并行性受限的16核处理器运行工作负载的吞吐量提高多达14％。其次，我们扩展吞吐量分析和优化，以考虑芯片内空间过程变化的影响，这些变化会导致多核处理器中相当大的核心到核心频率和泄漏功率变化。我们的分析表明，利用内核之间的频率变化可以将16核处理器的吞吐量提高多达57％。

著录项

来源
《Very Large Scale Integration (VLSI) Systems, IEEE Transactions on》 |2012年第2期|p.225-235|共11页
作者
Lee J.; Kim N. S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Core-to-core frequency and leakage power variations; die-to-die and within-die process variations; multicore processor; power and thermal constrained design; throughput;

机译：核心频率和泄漏功率变化;芯片到芯片和芯片内工艺变化;多核处理器;功率和热约束设计;吞吐量;

相似文献

外文文献
中文文献
专利

1. Thermal-Constrained Task Scheduling on 3-D Multicore Processors for Throughput-and-Energy Optimization [J] . Liao Chien-Hui, Wen Charles H.-P. Very Large Scale Integration (VLSI) Systems, IEEE Transactions on . 2015,第11期

机译：3-D多核处理器上的热约束任务调度，以实现吞吐量和能量优化
2. STEM: A Thermal-Constrained Real-Time Scheduling for 3D Heterogeneous-ISA Multicore Processors [J] . Tsai Ting-Hao, Chen Ya-Shu, He Xue-Xin, Fortschritte der Physik . 2018,第6期

机译：杆：3D异构-SISA多核处理器的热受限实时调度
3. Energy-Efficient Operation of Multicore Processors by DVFS, Task Migration, and Active Cooling [J] . Hanumaiah Vinay, Vrudhula Sarma IEEE Transactions on Computers . 2014,第2期

机译：通过DVFS，任务迁移和主动散热实现多核处理器的节能运行
4. Optimizing throughput of power- and thermal-constrained multicore processors using DVFS and per-core power-gating [C] . Jungseob Lee, Nam Sung Kim Proceedings of the 46th Annual Design Automation Conference . 2009

机译：使用DVFS和每核电源门控优化功率和散热受限的多核处理器的吞吐量
5. Exploiting heterogeneous multicore processors through fine-grained scheduling and low-overhead thread migration. [D] . Sawalha, Lina Hakam. 2012

机译：通过细粒度的调度和低开销的线程迁移来利用异构多核处理器。
6. Source Reconstruction of Brain Potentials Using Bayesian Model Averaging to Analyze Face Intra-Domain vs. Face-Occupation Cross-Domain Processing [O] . Ela I. Olivares, Agustín Lage-Castellanos, María A. Bobes, 2018

机译：使用贝叶斯模型平均的脑势源重构以分析人脸内域与人脸跨域处理
7. Enabling Improved Power Management in Multicore Processors through Clustered DVFS [O] . Tejaswini Kolpe, Antonia Zhai, Sachin S. Sapatnekar 2012

机译：通过集群DVFs在多核处理器中实现改进的电源管理
8. Recent Process and Equipment Improvements to Increase High Level Waste Throughput at the Defense Waste Processing Facility (DWPF)-8366 [R] . Leita, J., Coleman, J., Glover, T., 2008

机译：最近的工艺和设备改进，以提高国防废物处理设施（DWpF）-8366的高水平废物吞吐量

Analyzing Potential Throughput Improvement of Power- and Thermal-Constrained Multicore Processors by Exploiting DVFS and PCPG

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅