Efficient Exploration by Novelty-Pursuit

机译：高效追求的追求

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient exploration is essential to reinforcement learning in tasks with huge state space and long planning horizon. Recent approaches to address this issue include the intrinsically motivated goal exploration processes (IMGEP) and the maximum state entropy exploration (MSEE). In this paper, we propose a goal-selection criterion in IMGEP based on the principle of MSEE, which results in the new exploration method novelty-pursuit. Novelty-pursuit performs the exploration in two stages: first, it selects a seldom visited state as the target for the goal-conditioned exploration policy to reach the boundary of the explored region; then, it takes random actions to explore the non-explored region. We demonstrate the effectiveness of the proposed method in environments from simple maze environments, MuJoCo tasks, to the long-horizon video game of SuperMarioBros. Experiment results show that the proposed method outperforms the state-of-the-art approaches that use curiosity-driven exploration.

机译：高效的探索对于具有巨大国家空间和长期规划地平线的任务中的加强学习至关重要。最近解决此问题的方法包括本质上积极的目标探索过程（IMGEP）和最大状态熵探索（MSEEE）。在本文中，我们基于MSEE原理提出了IMGEP中的目标选择标准，这导致新的勘探方法新颖追求。新奇追求在两个阶段进行勘探：首先，它选择很少访问的国家作为目标条件探索政策的目标，以实现探索区域的边界; 然后，探索未探索区域需要随机操作。我们展示了从简单迷宫环境，Mujoco任务的环境中提出的方法的有效性，以Supermariobros的长地平视频游戏。实验结果表明，该方法优于利用好奇心驱动勘探的最先进的方法。

著录项

来源
《International Conference on Distributed Artificial Intelligence》|2020年|85-102|共18页
会议地点
作者
Ziniu Li; Xiong-Hui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reinforcement learning; Markov decision process; Efficient exploration;

机译：加强学习;马尔可夫决策过程;高效探索;

相似文献

外文文献
中文文献
专利

1. Area-Efficient Instruction Set Extension Exploration with Hardware Design Space Exploration [J] . I-Wei Wu, Chung-Ping Chung, Jean Jyh-Jiun Shann Journal of information science and engineering . 2011,第5期

机译：具有硬件设计空间探索功能的区域高效指令集扩展探索
2. A Novel Active Optimization Approach for Rapid and Efficient Design Space Exploration Using Ensemble Machine Learning [J] . Opeoluwa Owoyele, Pinaki Pal Journal of Energy Resources Technology . 2021,第3期

机译：一种新的积极优化方法，用于快速高效的设计空间探索使用集合机学习
3. FEECA: Design Space Exploration for Low-Latency and Energy-Efficient Capsule Network Accelerators [J] . Marchisio Alberto, Mrazek Vojtech, Hanif Muhammad Abdullah, IEEE transactions on very large scale integration (VLSI) systems . 2021,第4期

机译：FEECA：低延迟和节能胶囊网络加速器的设计空间探索
4. Efficient Exploration by Novelty-Pursuit [C] . Ziniu Li, Xiong-Hui Chen International Conference on Distributed Artificial Intelligence . 2020

机译：高效追求的追求
5. An efficient design space exploration framework to optimize power-efficient heterogeneous many-core multi-threading embedded processor architectures. [D] . Datta, Kushal. 2011

机译：一个有效的设计空间探索框架，用于优化省电的异构多核多线程嵌入式处理器体系结构。
6. Approaches for Efficiently Detecting Frontier Cells in Robotics Exploration [O] . Phillip Quin, Dac Dang Khoa Nguyen, Thanh Long Vu, 2021

机译：有效地检测机器人勘探中边界电池的方法
7. Mice in a labyrinth: Rapid learning, sudden insight, and efficient exploration [O] . Matthew Rosenberg, Tony Zhang, Pietro Perona, 2021

机译：迷宫的小鼠：快速学习，突然洞察力和高效的探索

Efficient Exploration by Novelty-Pursuit

摘要

著录项

相似文献

相关主题

期刊订阅