Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

Dane Corneil; Wulfram Gerstner; Johanni Brea

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

【24h】

Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

机译：变状态列表的基于模型的高效深度强化学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern reinforcement learning algorithms reach super-human performance on many board and video games, but they are sample inefficient, i.e. they typically require significantly more playing experience than humans to reach an equal performance level. To improve sample efficiency, an agent may build a model of the environment and use planning methods to update its policy. In this article we introduce Variational State Tabulation (VaST), which maps an environment with a high-dimensional state space (e.g. the space of visual inputs) to an abstract tabular model. Prioritized sweeping with small backups, a highly efficient planning method, can then be used to update state-action values. We show how VaST can rapidly learn to maximize reward in tasks like 3D navigation and efficiently adapt to sudden changes in rewards or transition probabilities.

机译：现代强化学习算法在许多棋盘游戏和视频游戏中都达到了超人的性能，但是它们的样本效率低下，即，要达到相同的性能水平，它们通常需要比人多得多的游戏体验。为了提高样本效率，代理可以构建环境模型并使用计划方法来更新其策略。在本文中，我们介绍了变分状态制表（VaST），该方法将具有高维状态空间（例如可视输入空间）的环境映射到抽象的表格模型。然后，可以使用具有小规模备份的优先级扫描（一种高效的计划方法）来更新状态操作值。我们展示了VaST如何在3D导航等任务中快速学习以最大限度地提高奖励，并有效地适应奖励或过渡概率的突然变化。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第12期|共10页
作者
Dane Corneil; Wulfram Gerstner; Johanni Brea;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation [J] . Dane Corneil, Wulfram Gerstner, Johanni Brea JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：变状态列表的基于模型的高效深度强化学习
2. A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system [J] . Journal of Process Control . 2020,第期

机译：一种基于模型的深度加强学习方法，适用于非线性控制仿射系统的有限范围最优控制
3. Model-based deep reinforcement learning with heuristic search for satellite attitude control [J] . Xu Ke, Wu Fengge, Zhao Junsuo Industrial Robot . 2019,第3期

机译：基于模型的卫星态度控制启发式搜索的深度增强学习
4. Learning to Control of an Under-actuated Autonomous Surface Vehicle Based on Model-based Deep Reinforcement Learning [C] . Qiuyue Sun, Zhouhua Peng, Dan Wang, International Conference on Information Science and Technology . 2021

机译：基于基于模型的深增强学习的基于模型的深度增强学习来学习控制致动自动表面车辆
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. Design Optimization of a Pneumatic Soft Robotic Actuator Using Model-Based Optimization and Deep Reinforcement Learning [O] . Mahsa Raeisinezhad, Nicholas Pagliocca, Behrad Koohbor, 2021

机译：基于模型的优化和深度加固学习的气动软机器人执行器设计优化
7. Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning [O] . Nathan O. Lambert, Daniel S. Drew, Joseph Yaconelli, 2019

机译：基于模型的深度钢筋学习的四足电场的低级控制

Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

摘要

著录项

相似文献

相关主题

期刊订阅