Optimizing power allocation to CPU and memory subsystems in overprovisioned HPC systems

机译：在超额配置的HPC系统中优化对CPU和内存子系统的电源分配

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Energy consumption and power draw pose two major challenges to the HPC community for designing larger systems. Present day HPC systems consume as much as 10MW of electricity and this is fast becoming a bottleneck. Although energy bills will significantly increase with machine size, power consumption is a hard constraint that must be addressed. Intel's Running Average Power Limit (RAPL) toolkit is a recent feature that enables power capping of CPU and memory subsystems on modern hardware. In this paper, we use RAPL to evaluate the possibility of improving execution time efficiency of an application by capping power while adding more nodes. We profile the strong scaling of an application using different power caps for both CPU and memory subsystems. Our proposed interpolation scheme uses an application profile to optimize the number of nodes and the distribution of power between CPU and memory subsystems to minimize execution time under a strict power budget. We validate these estimates by running experiments on a 20-node (120 cores) Sandy Bridge cluster. Our experimental results closely match the model estimates and show speedups greater than 1.47X for all applications compared to not capping CPU and memory power. We demonstrate that the quality of solution that our interpolation scheme provides matches very closely to results obtained via exhaustive profiling.

机译：能耗和功耗对HPC社区在设计大型系统方面构成了两个主要挑战。如今，HPC系统消耗多达10兆瓦的电力，这正迅速成为瓶颈。尽管随着机器尺寸的增加，电费将大大增加，但是功耗是一个必须解决的硬约束。英特尔的运行平均功率限制（RAPL）工具包是一项最新功能，可对现代硬件上的CPU和内存子系统进行功率限额设置。在本文中，我们使用RAPL来评估在增加更多节点的同时限制功率来提高应用程序执行时间效率的可能性。我们介绍了针对CPU和内存子系统使用不同功率上限的应用程序的强大扩展能力。我们提出的插值方案使用应用程序配置文件来优化节点数量以及CPU和内存子系统之间的电源分配，以在严格的电源预算下最大程度地缩短执行时间。我们通过在20节点（120核）Sandy Bridge集群上运行实验来验证这些估计。我们的实验结果与模型估计值非常吻合，并且与未限制CPU和内存能力相比，所有应用程序的加速都超过1.47倍。我们证明了我们的插值方案提供的解决方案的质量与通过穷举分析获得的结果非常接近。

著录项

来源
《IEEE International Conference on Cluster Computing》|2013年|1-8|共8页
会议地点
作者
Sarood Osman; Langer Akhil; Kale Laxmikant; Rountree Barry;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Power-mode-aware Memory Subsystem Optimization for Low-power System-on-Chip Design [J] . Strobel Manuel, Radetzki Martin ACM Transactions on Embedded Computing Systems . 2019,第5期

机译：电源模式感知内存子系统优化低功耗系统上的设计
2. Minimum power consumption in mobile-phone memory subsystems - Choosing and using the right next generation mobile-phone memory can yield dramatic power savings [J] . Odilio Vargas Portable Design: The Engineer's Resource for Portable Applications . 2005,第9期

机译：手机内存子系统中的最低功耗-选择和使用正确的下一代手机内存可以节省大量电量
3. Availability optimization of a series system with multiple repairable load sharing subsystems considering redundancy and repair facility allocation [J] . Ali A. Yahyatabar Arabi, A. Eshraghniaye Jahromi International journal of systems assurance engineering and management . 2013,第3期

机译：考虑冗余和维修设施分配的具有多个可修复负载共享子系统的串联系统的可用性优化
4. Optimizing power allocation to CPU and memory subsystems in overprovisioned HPC systems [C] . Sarood Osman, Langer Akhil, Kale Laxmikant, IEEE International Conference on Cluster Computing . 2013

机译：优化过透视HPC系统中的CPU和内存子系统的功率分配
5. Design and Optimization of Emerging Interconnection and Memory Subsystems for Future Manycore Architectures [D] . Thakkar, Ishan G. 2018

机译：面向未来的Manycore架构的新兴互连和内存子系统的设计和优化
6. Beam Allocation and Power Optimization for Energy-Efficiency in Multiuser mmWave Massive MIMO System [O] . Saidiwaerdi Maimaiti, Gang Chuai, Weidong Gao, 2021

机译：多用户MMWAVE大型MIMO系统能效的光束分配和功率优化
7. Optimizing Power Allocation to CPU and Memory Subsystems in Overprovisioned HPC Systems [O] . Osman Sarood, Akhil Langer, Laxmikant Kalé, 2013

机译：在过度配置的HpC系统中优化CpU和内存子系统的功率分配
8. Memory Subsystem Performance of Programs with Intensive Heap Allocation [R] . Diwan, A., Tarditi, D., Moss, E. 1993

机译：具有密集堆分配的程序的内存子系统性能

Optimizing power allocation to CPU and memory subsystems in overprovisioned HPC systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅