Effective Transfer via Demonstrations in Reinforcement Learning: A Preliminary Study

机译：通过钢筋学习的示威活动有效转移：初步研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are many successful methods for transferring information from one agent to another. One approach, taken in this work, is to have one (source) agent demonstrate a policy to a second (target) agent, and then have that second agent improve upon the policy. By allowing the target agent to observe the source agent's demonstrations, rather than relying on other types of direct knowledge transfer like Q-values, rules, or shared representations, we remove the need for the agents to know anything about each other's internal representation or have a shared language. In this work, we introduce a refinement to HAT, an existing transfer learning method, by integrating the target agent's confidence in its representation of the source agent's policy. Results show that a target agent can effectively 1) improve its initial performance relative to learning without transfer (jumpstart) and 2) improve its performance relative to the source agent (total reward). Furthermore, both the jumpstart and total reward are improved with this new refinement, relative to learning without transfer and relative to learning with HAT.

机译：有许多成功的方法，用于将信息从一个代理转移到另一个代理。在这项工作中采取的一种方法是让一个（源）代理商向第二（目标）代理商展示策略，然后将第二代理改进了政策。通过允许目标代理观察源代理的演示，而不是依赖于Q值，规则或共享表示等其他类型的直接知识转移，我们消除了代理商的需要了解彼此的内部表示或拥有的任何内容共享语言。在这项工作中，通过将目标代理人的信心集成在源代理政策的代表性方面，我们向帽子，现有的转移学习方法介绍了一种改进。结果表明，目标代理可以有效地提高其相对于学习的初始性能而无需转移（JumpStart），2）改善其相对于源代理的性能（总奖励）。此外，随着这种新细化，相对于学习而没有转移和戴帽子的学习，可以改善JumpStart和总奖励。

著录项

来源
《Association for the Advancement of Artificial Intelligence Symposium》|2016年|434p|共7页
会议地点
作者
Zhaodong Wang; Matthew E. Taylor;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-21 05:29:35

相似文献

外文文献
中文文献
专利

1. Transferring knowledge from human-demonstration trajectories to reinforcement learning [J] . Wang Guo-fang, Fang Zhou, Li Ping, Transactions of the Institute of Measurement and Control . 2018,第1期

机译：将知识从人类演示轨迹转移到加强学习
2. Shaping in reinforcement learning by knowledge transferred from human-demonstrations of a simple similar task [J] . Wang Guo-Fang, Fang Zhou, Li Ping Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第1期

机译：通过简单类似任务的人类演示传递的知识塑造了钢筋学习
3. A Preliminary Study on the Relationship Between Iterative Learning Control and Reinforcement Learning ? [J] . Yueqing Zhang, Bing Chu, Zhan Shu IFAC PapersOnLine . 2019,第29期

机译：迭代学习控制与强化学习之间的关系的初步研究？
4. Effective Transfer via Demonstrations in Reinforcement Learning: A Preliminary Study [C] . Zhaodong Wang, Matthew E. Taylor Association for the Advancement of Artificial Intelligence Symposium . 2016

机译：通过钢筋学习的示威活动有效转移：初步研究
5. A Preliminary Study of Leo to Geo Transfers for Inclination Changes Using Libration Point Orbits [D] . Shepard, John Philip. 2020

机译：利用自由点轨道初步研究Leo对倾斜变化的磨削转移
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning [O] . Linchao Zhu, Sercan Ö. Arık, Yi Yang, 2020

机译：学习转移学习：加强基于学习的自适应转移学习选择

Effective Transfer via Demonstrations in Reinforcement Learning: A Preliminary Study

摘要

著录项

相似文献

相关主题

期刊订阅