Device Placement Optimization with Reinforcement Learning

Azalia Mirhoseini; Hieu Pham; Quoc V. Le; Benoit Steiner; Rasmus Larsen; Yuefeng Zhou; Naveen Kumar; Mohammad Norouzi; Samy Bengio; Jeff Dean

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Device Placement Optimization with Reinforcement Learning

【24h】

Device Placement Optimization with Reinforcement Learning

机译：通过强化学习优化设备放置

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The past few years have witnessed a growth in size and computational requirements for training and inference with neural networks. Currently, a common approach to address these requirements is to use a heterogeneous distributed environment with a mixture of hardware devices such as CPUs and GPUs. Importantly, the decision of placing parts of the neural models on devices is often made by human experts based on simple heuristics and intuitions. In this paper, we propose a method which learns to optimize device placement for TensorFlow computational graphs. Key to our method is the use of a sequence-to-sequence model to predict which subsets of operations in a TensorFlow graph should run on which of the available devices. The execution time of the predicted placements is then used as the reward signal to optimize the parameters of the sequence-to-sequence model. Our main result is that on Inception-V3 for ImageNet classification, and on RNN LSTM, for language modeling and neural machine translation, our model finds non-trivial device placements that outperform hand-crafted heuristics and traditional algo-rithmic methods.

机译：过去几年见证了神经网络训练和推理的规模和计算需求的增长。当前，解决这些要求的常用方法是使用异构分布式环境，并混合使用诸如CPU和GPU之类的硬件设备。重要的是，通常由人类专家根据简单的试探法和直觉来做出将部分神经模型放置在设备上的决定。在本文中，我们提出了一种方法来学习优化TensorFlow计算图的设备位置。我们方法的关键是使用序列到序列模型来预测TensorFlow图中的哪些操作子集应在哪些可用设备上运行。然后，将预测放置的执行时间用作奖励信号，以优化序列到序列模型的参数。我们的主要结果是，在用于ImageNet分类的Inception-V3以及用于语言建模和神经机器翻译的RNN LSTM上，我们的模型发现了非平凡的设备布局，其性能优于手工启发式算法和传统的算法算法。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第3期|共10页
作者
Azalia Mirhoseini; Hieu Pham; Quoc V. Le; Benoit Steiner; Rasmus Larsen; Yuefeng Zhou; Naveen Kumar; Mohammad Norouzi; Samy Bengio; Jeff Dean;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Power Optimization in Device-to-Device Communications: A Deep Reinforcement Learning Approach With Dynamic Reward [J] . Ji Zelin, Kiani Adnan K., Qin Zhijin, Wireless Communications Letters, IEEE . 2021,第3期

机译：设备到设备通信中的功率优化：具有动态奖励的深度增强学习方法
2. Virtual Network Function Placement Optimization With Deep Reinforcement Learning [J] . Solozabal Ruben, Ceberio Josu, Sanchoyerto Aitor, IEEE Journal on Selected Areas in Communications . 2020,第2期

机译：虚拟网络功能放置优化与深增强学习
3. Virtual Network Function Placement Optimization With Deep Reinforcement Learning [J] . Ecological restoration . 2020,第2期

机译：虚拟网络功能放置优化与深增强学习
4. Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning [C] . Zixiang Ding, Yaran Chen, Nannan Li, IEEE Symposium Series on Computational Intelligence . 2020

机译：通过单射模型和加固学习对深神经网络的设备放置优化
5. Application Placement in Edge Computing – Optimization, Game, and Deep Reinforcement Learning [D] . Cao, Zhi. 2021

机译：边缘计算中的应用放置 - 优化，游戏和深度加固学习
6. Optimizing the Sensor Placement for Foot Plantar Center of Pressure without Prior Knowledge Using Deep Reinforcement Learning [O] . Cheng-Wu Lin, Shanq-Jang Ruan, Wei-Chun Hsu, 2020

机译：使用深度加强学习优化脚跖压力压力中心的传感器放置
7. Placement Optimization with Deep Reinforcement Learning [O] . Anna Goldie, Azalia Mirhoseini 2020

机译：用深加固学习进行优化

Device Placement Optimization with Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅