首页> 外文会议>ISC High Performance Conference >Distributed Deep Reinforcement Learning: Learn How to Play Atari Games in 21 minutes

【24h】

Distributed Deep Reinforcement Learning: Learn How to Play Atari Games in 21 minutes

机译：分布式深度加强学习：学习如何在21分钟内玩Atari游戏

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a study in Distributed Deep Reinforcement Learning (DDRL) focused on scalability of a state-of-the-art Deep Reinforcement Learning algorithm known as Batch Asynchronous Advantage Actor-Critic, (BA3C). We show that using the Adam optimization algorithm with a batch size of up to 2048 is a viable choice for carrying out large scale machine learning computations. This, combined with careful reexamination of the optimizer's hyperparameters, using synchronous training on the node level (while keeping the local, single node part of the algorithm asynchronous) and minimizing the model's memory footprint, allowed us to achieve linear scaling for up to 64 CPU nodes. This corresponds to a training time of 21 min on 768 CPU cores, as opposed to the 10 h required when using a single node with 24 cores achieved by a baseline single-node implementation.

机译：我们在分布式深度加强学习（DDRL）中展示了专注于称为批量异步优势演员 - 评论家（BA3C）的最先进的深增强学习算法的可扩展性。我们表明，使用批量大小最多可达2048次的ADAM优化算法是进行大规模机器学习计算的可行选择。这将仔细复制优化器的超参数，使用节点级别的同步训练（在保持算法的本地，单节点部分异步）并最大限度地减少模型的内存占用空间，使我们能够实现最多64个CPU的线性缩放节点。这对应于768 CPU核心21分钟的训练时间，而不是使用由基线单节点实现实现的24个核心所需的10小时。

著录项

来源
《ISC High Performance Conference》|2018年|412p|共19页
会议地点
作者
Igor Adamski; Robert Adamski; Tomasz Grel; Adam Jedrych; Kamil Kaczmarek; Henryk Michalewski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301-53;
关键词
Distributed computing; Reinforcement learning; Deep learning; Atari games; Asynchronous computations;

机译：分布式计算;加强学习;深入学习;Atari Games;异步计算;

相似文献

外文文献
中文文献
专利

1. Playing a FPS Doom Video Game with Deep Visual Reinforcement Learning [J] . Adil Khan, Feng Jiang, Shaohui Liu, Automatic Control and Computer Sciences . 2019,第3期

机译：使用深度视觉强化学习的FPS Doom视频游戏
2. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game [J] . Patel Devdhar, Hazan Hananel, Saunders Daniel J., Neural Networks: The Official Journal of the International Neural Network Society . 2019,第1期

机译：改进了加强学习政策的鲁棒性，转换为应用于Atari Breakout游戏的尖峰神经元网络平台
3. Workshop on Distributed Reinforcement Learning and Reinforcement-Learning Games [Conference Reports] [J] . Kyriakos G. Vamvoudakis, Yan Wan, Frank L. Lewis Control Systems, IEEE . 2019,第6期

机译：分布式强化学习和加固学习游戏研讨会[会议报告]
4. Distributed Deep Reinforcement Learning: Learn How to Play Atari Games in 21 minutes [C] . Igor Adamski, Robert Adamski, Tomasz Grel, International conference on high performance computing . 2018

机译：分布式深度强化学习：学习如何在21分钟内玩Atari游戏
5. Utilization of Supervised and Reinforcement Learning in the Automation of the Classical Atari Game “Pong” [D] . Waterreus, Andrew J. 2019

机译：经典阿塔里游戏“PONG”自动化监督和加固学习的利用
6. Learn to Steer through Deep Reinforcement Learning [O] . Keyu Wu, Mahdi Abolfazli Esfahani, Shenghai Yuan, 2018

机译：通过深度强化学习来学习指导
7. Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes [O] . Adamski, Igor, Adamski, Robert, Grel, Tomasz, 2018

机译：分布式深度强化学习：学习如何玩atari游戏在21分钟

Distributed Deep Reinforcement Learning: Learn How to Play Atari Games in 21 minutes

摘要

著录项

相似文献

相关主题

期刊订阅