Hierarchical Reinforcement Learning: Learning sub-goals and state-abstraction

机译：分层强化学习：学习子目标和状态抽象

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a method that allows an agent to discover and create temporal abstractions autonomously. Our method is based on the concept that to reach the goal, the agent must pass through relevant states that we will interpret as subgoals. To detect useful subgoals, our method creates intersections between several paths leading to a goal. Our research focused on domains largely used in the study of temporal abstractions. We used several versions of the room-to-room navigation problem. We determined that, in the problems tested, an agent can learn more rapidly by automatically discovering subgoals and creating abstractions.

机译：在本文中，我们提出了一种允许代理自动发现和创建时间抽象的方法。我们的方法基于以下概念：要达到目标，代理必须通过我们将解释为子目标的相关状态。为了检测有用的子目标，我们的方法会在通往目标的多条路径之间创建交点。我们的研究集中在时态抽象研究中广泛使用的领域。我们使用了多个版本的“房间到房间”导航问题。我们确定，在测试的问题中，代理可以通过自动发现子目标并创建抽象来更快地学习。

著录项

来源
《6th Iberian Conference on Information Systems and Technologies》|2011年|p.1-4|共4页
会议地点
作者
Jardim David; Nunes Luis; Oliveira Sancho;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
Abstractions; Autonomous Agents; Machine Learning; Reinforcement Learning; Sub-goals;

机译：抽象;自治代理;机器学习;强化学习;子目标;

相似文献

外文文献
中文文献
专利

1. Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification [J] . Chenghao Liu, Fei Zhu, Quan Liu, 自动化学报（英文版） . 2021,第010期

机译：Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification
2. Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification [J] . Chenghao Liu, Fei Zhu, Quan Liu, 自动化学报：英文版 . 2021,第010期

机译：Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification
3. Reinforcement learning for Golog programs with first-order state-abstraction [J] . Daniel Beck* and Gerhard Lakemeyer† Logic Journal of IGPL . 2012,第5期

机译：具有一阶状态抽象的Golog程序的强化学习
4. Hierarchical Reinforcement Learning: Learning Sub-goals and State-Abstraction [C] . David Jardim, Luis Nunes, Sancho Oliveira Iberian Conference on Information Systems and Technologies . 2011

机译：等级强化学习：学习子目标和国家抽象
5. Learning state and action space hierarchies for reinforcement learning using action -dependent partitioning. [D] . Asadi, Mehran. 2006

机译：使用依赖于动作的分区来学习状态和动作空间层次结构，以进行强化学习。
6. Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning [O] . Tulika Saha, Sriparna Saha, Pushpak Bhattacharyya 2020

机译：利用等级强化学习的多意图对话的情感对话策略学习
7. Q-Cut - Dynamic Discovery of Sub-Goals in Reinforcement Learning [O] . Ishai Menache, Shie Mannor, Nahum Shimkin 2002

机译：Q-Cut - 强化学习中子目标的动态发现

Hierarchical Reinforcement Learning: Learning sub-goals and state-abstraction

摘要

著录项

相似文献

相关主题

期刊订阅