Fast and slow curiosity for high-level exploration in reinforcement learning

Bougie Nicolas; Ichise Ryutaro

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Fast and slow curiosity for high-level exploration in reinforcement learning

【24h】

Fast and slow curiosity for high-level exploration in reinforcement learning

机译：加固学习中的高级别探索快速和缓慢的好奇心

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep reinforcement learning (DRL) algorithms rely on carefully designed environment rewards that are extrinsic to the agent. However, in many real-world scenarios rewards are sparse or delayed, motivating the need for discovering efficient exploration strategies. While intrinsically motivated agents hold promise of better local exploration, solving problems that require coordinated decisions over long-time horizons remains an open problem. We postulate that to discover such strategies, a DRL agent should be able to combine local and high-level exploration behaviors. To this end, we introduce the concept of fast and slow curiosity that aims to incentivize long-time horizon exploration. Our method decomposes the curiosity bonus into a fast reward that deals with local exploration and a slow reward that encourages global exploration. We formulate this bonus as the error in an agent's ability to reconstruct the observations given their contexts. We further propose to dynamically weight local and high-level strategies by measuring state diversity. We evaluate our method on a variety of benchmark environments, including Minigrid, Super Mario Bros, and Atari games. Experimental results show that our agent outperforms prior approaches in most tasks in terms of exploration efficiency and mean scores.

机译：None

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies 》 |2021年第2期| 共22页
作者
Bougie Nicolas; Ichise Ryutaro;
展开▼
作者单位

Natl Inst Informat Tokyo Japan;

Natl Inst Informat Tokyo Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词
Reinforcement learning; Exploration; Autonomous exploration; Curiosity in reinforcement learning;

机译：加强学习;探索;自主探索;钢筋学习的好奇心;

相似文献

外文文献
中文文献
专利

1. A Graph-Based Reinforcement Learning Method with Converged State Exploration and Exploitation [J] . Han Li, Tianding Chen, Hualiang Teng, 工程与科学中的计算机建模(英文) . 2019 ,第002期
2. A Data-driven Method for Fast AC Optimal Power Flow Solutions via Deep Reinforcement Learning [J] . Yuhao Zhou, Bei Zhang, Chunlei Xu, 现代电力系统与清洁能源学报(英文) . 2020 ,第006期
3. Fast Conflict Resolution Based on Reinforcement Learning in Multi-agent System [J] . PIAOSonghao, HONGBingrong, CHUHaitao 电子学报：英文版 . 2004 ,第001期
4. Modified slow-fast analysis method for slow-fast dynamical systems with two scales in frequency domain [J] . Zhengdi Zhang, Zhangyao Chen, Qinsheng Bi 力学快报(英文版) . 2019 ,第006期
5. SLOW MANIFOLD AND PARAMETER ESTIMATION FOR A NONLOCAL FAST-SLOW DYNAMICAL SYSTEM WITH BROWNIAN MOTION [J] . Hina ZULFIQAR, Ziying HE, Meihua YANG, 数学物理学报（英文版） . 2021 ,第004期
6. Random curiosity-driven exploration in deep reinforcement learning [J] . Li Jing, Shi Xinxin, Li Jiehao, Neurocomputing . 2020 ,第Deca22期

机译：深度加固学习中的随机效果驱动探索
7. Stimulus sampling as an exploration mechanism for fast reinforcement learning [J] . Boris B. Vladimirskiy, Eleni Vasilaki, Robert Urbanczik, Biological Cybernetics . 2009 ,第4期

机译：刺激采样作为快速强化学习的探索机制
8. Fast and Robust Learning by Reinforcement Signals: Explorations in the Insect Brain [J] . Ramon Huerta, Thomas Nowotny Neural computation . 2009 ,第8期

机译：通过强化信号快速而稳健的学习：昆虫大脑中的探索
9. Attention-Based Curiosity-Driven Exploration in Deep Reinforcement Learning [C] . Patrik Reizinger, Márton Szemenyei IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：深度强化学习中基于注意力的好奇心驱动探索
10. Intrinsic Curiosity in Reinforcement Learning by Improving Next State Prediction [D] . Lobo, Paul Lewis. 2020

机译：通过改善下一个状态预测来加强学习的内在好奇心
11. Curiosity driven reinforcement learning for motion planning on humanoids [O] . Mikhail Frank, Jürgen Leitner, Marijn Stollenga, 2013

机译：好奇心驱动的强化学习针对类人动物的运动计划
12. Fast and robust learning by reinforcement signals: explorations in the insect brain [O] . Huerta Ramón, Nowotny Thomas 2009

机译：通过强化信号进行快速而强大的学习：昆虫大脑中的探索
13. Learning State Features from Policies to Bias Exploration in Reinforcement Learning [R] . Singer, B. , Veloso, M. 1999

机译：学习国家特色从政策到强化学习中的偏见探索

Fast and slow curiosity for high-level exploration in reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅