Towards Power Efficiency in Deep Learning on Data Center Hardware

机译：在数据中心硬件上进行深度学习以提高电源效率

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning (DL) is a computationally intensive workload that is expected to grow rapidly in data centers in the near future. Its high energy demand necessitates finding ways to improve computational efficiency. In this work, we directly measure power used by the whole system as well as that used by GPU, CPU, and RAM during DL training to determine their contributions to the overall energy consumption. We find that while GPUs use most of the power – about 70 % - the consumption of other components is also significant and their optimizations can bring important power savings. Evaluating a multitude of options, we identify the parameters that bring in the most power savings. Overall, an energy savings of over 20% of can be obtained by adjusting system settings alone without changing the workload, at the cost of a minor increase in runtime. Alternatively, if runtime needs to stay constant, an 18% energy savings is identified. In distributed multi-server DL, we find that scale-out overhead has only a small energy cost, making distributed training more energy-efficient than expected. Implications for the field and ways to make DL more energy-efficient going forward are also discussed. (Abstract)

机译：深度学习（DL）是计算密集型工作负载，预计在不久的将来会在数据中心迅速增长。它对能源的需求很高，因此必须找到提高计算效率的方法。在这项工作中，我们直接测量整个系统以及DL训练期间GPU，CPU和RAM所使用的功率，以确定它们对总体能耗的贡献。我们发现，尽管GPU消耗了大部分功率（约70％），但其他组件的消耗也很可观，其优化可以节省大量功率。通过评估众多选项，我们确定了可最大程度节省功率的参数。总体而言，仅通过调整系统设置而不改变工作量即可节省超过20％的能源，但代价是运行时间会略有增加。另外，如果运行时间需要保持恒定，则可以节省18％的能源。在分布式多服务器DL中，我们发现横向扩展开销只有很少的能源成本，这使得分布式培训比预期的能效更高。还讨论了对该领域的影响以及使DL更加节能的方法。（抽象的）

著录项

来源
《IEEE International Conference on Big Data》|2019年|1814-1820|共7页
会议地点
作者
Miro Hodak; Masha Gorkovenko; Ajay Dholakia;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Servers; Graphics processing units; Energy consumption; Data centers; Energy efficiency; Power demand;

机译：培训;服务器;图形处理单元;能耗;数据中心;能源效率;电力需求;

相似文献

外文文献
中文文献
专利

1. SmartFCT: Improving power-efficiency for data center networks with deep reinforcement learning [J] . Sun Penghao, Guo Zehua, Liu Sen, Computer networks . 2020,第Octa9期

机译：SMARTFCT：提高具有深度增强学习的数据中心网络的功率效率
2. Integrating virtualization, speed scaling, and powering on/off servers in data centers for energy efficiency [J] . JULIAN A. GALLEGO ARRUBLA, YOUNG MYOUNG KO, RONNY J. POLANSKY, IIE Transactions . 2013,第10期

机译：在数据中心集成虚拟化，速度扩展和打开/关闭服务器以提高能源效率
3. Error Tolerance Analysis of Deep Learning Hardware Using a Restricted Boltzmann Machine Toward Low-Power Memory Implementation [J] . Takao Marukame, Kodai Ueyoshi, Tetsuya Asai, Circuits and Systems II: Express Briefs, IEEE Transactions on . 2017,第4期

机译：使用受限Boltzmann机器实现低功耗内存的深度学习硬件的容错分析
4. Towards Power Efficiency in Deep Learning on Data Center Hardware [C] . Miro Hodak, Masha Gorkovenko, Ajay Dholakia IEEE International Conference on Big Data . 2019

机译：对数据中心硬件深度学习的电力效率
5. Hardware-Software Techniques for Improving Resource Efficiency in Datacenters [D] . ?Kulkarni, Neeraj Sanjay 2020

机译：用于提高数据中心资源效率的硬件软件技术
6. Deep Learning Analysis of Upright-Supine High-Efficiency SPECT Myocardial Perfusion Imaging for Prediction of Obstructive Coronary Artery Disease: A Multicenter Study [O] . Julian Betancur, Lien-Hsin Hu, Frederic Commandeur, -1

机译：立式仰卧式高效SPECT心肌灌注成像的深度学习分析对阻塞性冠状动脉疾病的预测：多中心研究
7. Deep Learning for Hardware-Based Real-Time Fault Detection and Localization of All Electric Ship MVDC Power System [O] . Qin Liu, Tian Liang, Venkata Dinavahi 2020

机译：基于硬件的实时故障检测和所有电船MVDC电力系统的深度学习

Towards Power Efficiency in Deep Learning on Data Center Hardware

摘要

著录项

相似文献

相关主题

期刊订阅