首页> 外文期刊>IEEE Transactions on Cognitive and Developmental Systems >Combining Model-Based $Q$ -Learning With Structural Knowledge Transfer for Robot Skill Learning
【24h】

Combining Model-Based $Q$ -Learning With Structural Knowledge Transfer for Robot Skill Learning

机译:将基于模型的 $ Q $ -学习与结构化知识转移相结合,以进行机器人技能学习

获取原文
获取原文并翻译 | 示例
       

摘要

Learning skills autonomously is a particularly important ability for an autonomous robot. A promising approach is reinforcement learning (RL) where agents learn policy through interaction with its environment. One problem of RL algorithm is how to tradeoff the exploration and exploitation. Moreover, multiple tasks also make a great challenge to robot learning. In this paper, to enhance the performance of RL, a novel learning framework integrating RL with knowledge transfer is proposed. Three basic components are included: 1) probability policy reuse; 2) dynamic model learning; and 3) model-based Q-learning. In this framework, the prelearned skills are leveraged for policy reuse and dynamic learning. In model-based Q-learning, the Gaussian process regression is used to approximate the Q-value function so as to suit for robot control. The prior knowledge retrieved from knowledge transfer is integrated into the model-based Q-learning to reduce the needed learning time. Finally, a human-robot handover experiment is performed to evaluate the learning performance of this learning framework. Experiment results show that fewer exploration is needed to obtain a high expected reward, due to the prior knowledge obtained from knowledge transfer.
机译:自主学习技能是自主机器人的一项特别重要的能力。一种有前途的方法是强化学习(RL),其中代理通过与其环境的交互来学习策略。 RL算法的一个问题是如何权衡探索和开发。此外,多项任务也给机器人学习带来了巨大挑战。为了提高RL的性能,提出了一种将RL与知识转移相结合的新型学习框架。其中包括三个基本组成部分:1)概率策略重用; 2)动态模型学习; 3)基于模型的Q学习。在此框架中,将预先学习的技能用于策略重用和动态学习。在基于模型的Q学习中,高斯过程回归用于近似Q值函数,从而适合机器人控制。从知识转移中获取的先验知识被集成到基于模型的Q学习中,以减少所需的学习时间。最后,执行人机交互实验以评估该学习框架的学习性能。实验结果表明,由于从知识转移中获得了先验知识,因此需要较少的探索来获得较高的期望回报。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号