Tree-Based Hierarchical Reinforcement Learning

机译：基于树的分层强化学习

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this thesis, the author investigates methods for speeding up automatic control algorithms. Specifically, he provides new abstraction techniques for Reinforcement Learning and Semi-Markov Decision Processes (SMDPs). He also introduces the use of policies as temporally abstract actions. This is different from previous definitions of temporally abstract actions as he does not have termination criteria. He provides an approach for processing previously solved problems to extract these policies. He also contributes a method for using supplied or extracted policies to guide and speed up the solving of new problems. He treats extracting policies as a supervised learning task and introduces the Lumberjack algorithm, which extracts repeated sub- structure within a decision tree. He then introduces the TTree algorithm, which combines state and temporal abstraction to increase problem solving speed on new problems. TTree solves SMDPs by using both user- and machine-supplied policies as temporally abstract actions while generating its own tree-based abstract state representation. By combining state and temporal abstraction in this way, TTree is the only known SMDP algorithm that is able to ignore irrelevant or harmful subregions within a supplied abstract action while still making use of other parts of the abstract action.

著录项

作者
Uther, W. T.;
展开▼
作者单位

展开▼
年度 2002
页码 1-164
总页数 164
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Algorithms; Robotics; Control systems; Problem solving; Learning machines; Markov processes; Automation; Optimization; Extraction; Subroutines; Dynamic programming; Artificial intelligence; Decision theory; Policies; Decision making; Coding; Theses; Robots;

机译：算法;机器人;控制系统;解决问题;学习机;马尔可夫过程;自动化;优化;提取;子程序;动态规划;人工智能;决策理论;政策;决策;编码;论文;机器人;

相似文献

外文文献
中文文献
专利

1. TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES [J] . Tao Yebin, Wang Lu, Almirall Daniel The Annals of applied statistics . 2018,第3期

机译：基于树的增强学习，用于估算最佳动态治疗制度
2. Tree-based reinforcement learning for optimal water reservoir operation [J] . A. Castelletti, S. Galelli, M. Restelli, Water resources research . 2010,第9期

机译：基于树的加固学习可优化水库运行
3. Tree-Based Batch Mode Reinforcement Learning [J] . Ernst Damien, Geurts Pierre, Wehenkel Louis Journal of machine learning research . 2005,第Apr期

机译：基于树的批处理模式强化学习
4. Local Roots: A Tree-Based Subgoal Discovery Method to Accelerate Reinforcement Learning [C] . Alper Demir, Erkin Cilden, Faruk Polat European conference on machine learning and principles and practice of knowledge discovery in databases . 2016

机译：本地根：基于树的子目标发现方法，可加速强化学习
5. Learning state and action space hierarchies for reinforcement learning using action -dependent partitioning. [D] . Asadi, Mehran. 2006

机译：使用依赖于动作的分区来学习状态和动作空间层次结构，以进行强化学习。
6. TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES [O] . Yebin Tao, Lu Wang, Daniel Almirall -1

机译：基于树的加固学习用于估计最佳动态处理方案
7. A Reinforcement Learning Assisted Eye-Driven Computer Game Employing a Decision Tree-Based Approach and CNN Classification [O] . Joao Perdiz, Luis Garrote, Gabriel Pires, 2021

机译：采用决策树的方法和CNN分类，辅助学习辅助学习辅助眼驱动计算机游戏

Tree-Based Hierarchical Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅