首页> 美国政府科技报告 >First Results with Instance-Based State Identification for Reinforcement Learning

【24h】

First Results with Instance-Based State Identification for Reinforcement Learning

机译：基于实例的状态识别的强化学习的第一个结果

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

When a reinforcement learning agent's next course of action depends oninformation that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, we say the agent suffers from the hidden state problem. State identification techniques use history information to uncover hidden state. Previous approaches to encoding history include: finite state machines, recurrent neural networks, and genetic programming with indexed memory. A chief disadvantage of all these techniques is their long training time. This report presents instance-based state identification, a new approach to reinforcement learning with state identification that learns with much fewer training steps. Noting that learning with history and learning in continuous spaces both share the property that they begin without knowing the granularity of the state space, the approach applies instance-based (or memory-based) learning to history sequences-instead of recording instances in a continuous geometrical space, we record instances in action-perception-reward sequence space. The first implementation of this approach, called Nearest Sequence Memory, learns with an order of magnitude fewer steps than several previous approaches.

著录项

作者
Mccallum, R. A.;
展开▼
作者单位

展开▼
年度 1994
页码
总页数 28
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Algorithms; Machine learning; Neural nets; Perception; Problem solving; Reinforcement; Robotics; Self adaptive control systems; Automata theory; Coding; Education; Field of view; Mapping; Self organizing systems; Sensors; State vectors;

机译：算法;机器学习;神经网络;感知;问题解决;强化;机器人;自适应控制系统;自动机理论;编码;教育;视野;映射;自组织系统;传感器;状态向量;

相似文献

外文文献
中文文献
专利

1. Hidden state and reinforcement learning with instance-based state identification [J] . McCallum R.A. IEEE transactions on systems, man, and cybernetics. Part B . 1996,第3期

机译：隐藏状态和强化学习以及基于实例的状态识别
2. Improving the Robustness of Instance-Based Reinforcement Learning Robots by Metalearning [J] . Toshiyuki Yasuda, Kousuke Araki, Kazuhiro Ohkura Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2011,第8a87期

机译：通过Metalearning提高基于实例的强化学习机器人的鲁棒性
3. Preservation and Application of Acquired Knowledge Using Instance-Based Reinforcement Learning for Multi-Robot Systems [J] . Junki Sakanoue, Toshiyuki Yasuda, Kazuhiro Ohkura Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2011,第8a87期

机译：基于实例的强化学习对多机器人系统的知识的保存和应用
4. Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task [C] . Chelian Suhas E., Paik Jaehyon, Pirolli Peter, Joint IEEE International Conference on Development and Learning and Epigenetic Robotics . 2015

机译：强化学习和基于实例的学习方法可在预测性觅食任务中为人类决策建模
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Integration of Instance-Based Learning and Text Mining for Identification of Potential Virus/Bacterium as Bio-terrorism Weapons [O] . Xiaohua Hu, Xiaodan Zhang, Daniel Wu, -1

机译：基于实例的学习和文本挖掘的集成用于识别潜在的病毒/细菌作为生物恐怖主义武器
7. Experiments in Robot Control for an Instance-Based Reinforcement Learning Algorithm based on Prior Information [O] . Carlos H.C. Ribeiro, Elder M. Hemerly 1999

机译：基于先验信息的基于实例的强化学习算法的机器人控制实验
8. Reinforcement Learning Applications to Combat Identification. [R] . Mooren, E. M. 2017

机译：强化学习应用对抗识别。

First Results with Instance-Based State Identification for Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅