Sequential Association Rule Mining for Autonomously Extracting Hierarchical Task Structures in Reinforcement Learning

Ghazanfari Behzad; Afghah Fatemeh; Taylor Matthew E.

首页> 外文期刊>Quality Control, Transactions >Sequential Association Rule Mining for Autonomously Extracting Hierarchical Task Structures in Reinforcement Learning

【24h】

Sequential Association Rule Mining for Autonomously Extracting Hierarchical Task Structures in Reinforcement Learning

机译：序贯关联规则挖掘加强学习中的分层任务结构

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning (RL) techniques, while often powerful, can suffer from slow learning speeds, particularly in high dimensional spaces or in environments with sparse rewards. The decomposition of tasks into a hierarchical structure holds the potential to significantly speed up learning, generalization, and transfer learning. However, the current task decomposition techniques often cannot extract hierarchical task structures without relying on high-level knowledge provided by an expert (e.g., using dynamic Bayesian networks (DBNs) in factored Markov decision processes), which is not necessarily available in autonomous systems. In this paper, we propose a novel method based on Sequential Association Rule Mining that can extract Hierarchical Structure of Tasks in Reinforcement Learning (SARM-HSTRL) in an autonomous manner for both Markov decision processes (MDPs) and factored MDPs. The proposed method leverages association rule mining to discover the causal and temporal relationships among states in different trajectories and extracts a task hierarchy that captures these relationships among sub-goals as termination conditions of different sub-tasks. We prove that the extracted hierarchical policy offers a hierarchically optimal policy in MDPs and factored MDPs. It should be noted that SARM-HSTRL extracts this hierarchical optimal policy without having dynamic Bayesian networks in scenarios with a single task trajectory and also with multiple tasks & x2019; trajectories. Furthermore, we show theoretically and empirically that the extracted hierarchical task structure is consistent with trajectories and provides the most efficient, reliable, and compact structure under appropriate assumptions. The numerical results compare the performance of the proposed SARM-HSTRL method with conventional HRL algorithms in terms of the accuracy in detecting the sub-goals, the validity of the extracted hierarchies, and the speed of learning in several testbeds. The key capabilities of SARM-HSTRL including handling multiple tasks and autonomous hierarchical task extraction can lead to the application of this HRL method in reusing, transferring, and generalization of knowledge in different domains.

机译：强化学习（RL）技术，而通常强大，可以遭受慢的学习速度，特别是在高维空间或具有稀疏奖励的环境中。任务分解成层级结构的潜力能够显着加速学习，泛化和转移学习。然而，目前的任务分解技术通常不能提取分层任务结构，而不依赖于专家提供的高级知识（例如，使用因子马尔可夫决策过程中的动态贝叶斯网络（DBN）），这不一定在自主系统中可用。在本文中，我们提出了一种基于<斜体>顺序关联规则挖掘的新方法，可以提取增强学习中的任务的分层结构（<斜体> sarm-hstrl ）以马尔可夫决策过程（MDP）和因子MDP为自主方式。该方法利用关联规则挖掘来发现不同轨迹中的状态之间的因果关系和时间关系，并提取一个任务层次结构，该任务层次结构将子目标之间的这些关系捕获到不同子任务的终止条件。我们证明了提取的分层策略在MDP和因子MDP中提供了分层最佳策略。应该注意的是，<斜体> SARM-HSTRL 提取该分层最优策略，而不在方案中具有动态贝叶斯网络，其中单个任务轨迹以及多个任务和X2019;轨迹。此外，我们理论上和经验地显示提取的分层任务结构与轨迹一致，并在适当的假设下提供最有效，可靠和紧凑的结构。数值结果比较了在检测子目标的准确性，提取的层次结构的准确性方面，在传统的HRL算法中与传统的HRL算法的性能进行比较，以及提取的层次结构的有效性以及几个学习的速度试验台。 SARM-HSTRL 包括处理多个任务和自主分层任务提取的关键功能可能导致该HRL方法在重用，转移和泛化不同域中的知识中的应用。

著录项

来源
《Quality Control, Transactions》 |2020年第2020期|11782-11799|共18页
作者
Ghazanfari Behzad; Afghah Fatemeh; Taylor Matthew E.;
展开▼
作者单位

No Arizona Univ Sch Informat Comp & Cyber Secur Flagstaff AZ 86011 USA;

No Arizona Univ Sch Informat Comp & Cyber Secur Flagstaff AZ 86011 USA;

Washington State Univ Sch Elect Engn & Comp Sci Pullman WA 99163 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Association rule mining; extracting task structure; hierarchical reinforcement learning;

机译：协会规则挖掘;提取任务结构;等级加固学习;

相似文献

外文文献
中文文献
专利

1. Neural signature of hierarchically structured expectations predicts clustering and transfer of rule sets in reinforcement learning [J] . Collins Anne Gabrielle Eva, Frank Michael Joshua Cognition: International Journal of Cognitive Psychology . 2016,第Null期

机译：分层结构的期望的神经签名可预测强化学习中规则集的聚类和转移
2. Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems [J] . Kaya M., Alhajj R. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2005,第2期

机译：基于模糊OLAP关联规则挖掘的多主体系统模块化强化学习方法
3. Mining Fuzzy Association Rules on Has-A and Is-A Hierarchical Structures [J] . Been-Chian Chien, Ming-Huang Zhong, Jeng-Jung Wang Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2007,第4期

机译：在Has-A和Is-A层次结构上挖掘模糊关联规则
4. Extension of Hierarchical Information Acquirement by Association Rule Mining from Semi-Structured Data [C] . Ryosuke Saga, Chihiro Sugaya, Kodai Kitami International Conference on Data Mining . 2010

机译：从半结构化数据的关联规则挖掘扩展分层信息获取
5. Efficient sequential and parallel algorithms for mining association rules in text databases [D] . Holt, John D. 2003

机译：用于挖掘文本数据库中关联规则的高效顺序和并行算法
6. Disentangling sequential from hierarchical learning in Artificial Grammar Learning: Evidence from a modified Simon Task [O] . Maria Vender, Diego Gabriel Krivochen, Arianna Compostella, 2020

机译：从人工语法学习中的分层学习中解散顺序：来自修改的西蒙任务的证据
7. A Hierarchical Model for Association Rule Mining of Sequential Events: An Approach to Automated Medical Symptom Prediction [O] . McCormick Tyler H., Rudin Cynthia, Madigan David B. 2011

机译：顺序事件的关联规则挖掘的分层模型：一种自动医学症状预测的方法

Sequential Association Rule Mining for Autonomously Extracting Hierarchical Task Structures in Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅