首页> 外文会议>Machinee learning >Self-improvement Based On Reinforcement Learning, Planning and Teaching

【24h】

Self-improvement Based On Reinforcement Learning, Planning and Teaching

机译：基于强化学习，计划和教学的自我完善

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

AHC-learning and Q-learning are slow learning methods. This paper investigates three extensions to the two basic learning algorithms. The three extensions are 1) experience replay, 2) learning action models for planning, and 3) teaching. The basic algorithms and their extensions were evaluated using a dynamic environment as a testbed. The environment is nontrivial and nondeter-ministic. The results show that the extensions can effectively improve the learning rate and in many cases even the asymptotic performance.

机译：AHC学习和Q学习是缓慢的学习方法。本文研究了两种基本学习算法的三个扩展。这三个扩展是1）体验重播，2）学习用于计划的动作模型以及3）教学。使用动态环境作为测试平台评估了基本算法及其扩展。环境是不平凡的，不确定的。结果表明，扩展可以有效地提高学习率，甚至在许多情况下甚至可以提高渐近性能。

著录项

来源
《Machinee learning》|1991年|323-327|共5页
会议地点 Evanston IL(US);Evanston IL(US)
作者
Long-Ji Lin;
展开▼
作者单位

School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania 15213;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词

相似文献

外文文献
中文文献
专利

1. Acceleration of game learning with prediction-based reinforcement learning - toward the emergence of planning behavior [J] . Yu Ohigashi, Takashi Omori, Koji Morikawa, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2002,第627期

机译：通过基于预测的强化学习来加速游戏学习-朝计划行为的方向发展
2. Acceleration of game learning with prediction-based reinforcement learning - toward the emergence of planning behavior [J] . Yu Ohigashi, Takashi Omori, Koji Morikawa, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2002,第627期

机译：基于预测的加强学习的游戏学习加速 - 朝向规划行为的出现
3. Automatic Treatment Planning in a Human-like manner: Operating Treatment Planning Systems by a Deep Reinforcement Learning based Virtual Treatment Planner [J] . Shen C., Gonzalez Y., Chen L., International Journal of Radiation Oncology, Biology, Physics . 2019,第1Suppla期

机译：以人类的方式自动治疗规划：由基于深度加强学习的虚拟治疗计划者操作治疗计划系统
4. Motion planning algorithm for non-holonomic autonomous underwater vehicle in disturbance using reinforcement learning and teaching method [C] . Kawano, H., Ura, Robotics and Automation, 2002. Proceedings. ICRA '02. IEEE International Conference on . 2002

机译：基于强化学习与教学方法的非完整自主水下航行器扰动运动规划算法
5. Model-Based Reinforcement Learning for Cooperative Multi-Agent Planning: Exploiting Hierarchies, Bias, and Temporal Sampling [D] . Ma, Aaron. 2020

机译：基于模型的合作多智能经纪人规划的强化学习：利用层次结构，偏见和时间采样
6. A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control [O] . Chuzhao Liu, Junyao Gao, Yuanzhen Bi, 2020

机译：基于深度强化学习和双同步控制的面向多任务的机器人手臂运动计划方案
7. Self-improving reactive agents based on reinforcement learning, planning and teaching [O] . Long-ji Lin 1992

机译：基于强化学习，计划和教学的自我改善反应剂

Self-improvement Based On Reinforcement Learning, Planning and Teaching

摘要

著录项

相似文献

相关主题

期刊订阅