首页> 外国专利> DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

机译:机器人操纵深度增强学习

摘要

Using large-scale reinforcement learning to train a policy model that can be utilized by a robot in performing a robotic task in which the robot interacts with one or more environmental objects. In various implementations, off-policy deep reinforcement learning is used to train the policy model, and the off-policy deep reinforcement learning is based on self-supervised data collection. The policy model can be a neural network model. Implementations of the reinforcement learning utilized in training the neural network model utilize a continuous-action variant of Q-learning. Through techniques disclosed herein, implementations can learn policies that generalize effectively to previously unseen objects, previously unseen environments, etc.
机译:使用大规模增强学习培训可以通过机器人执行的策略模型进行执行机器人任务,其中机器人与一个或多个环境对象相互作用。在各种实现中,违规的深度加强学习用于培训政策模型,违规的深度加强学习是基于自我监督的数据收集。策略模型可以是神经网络模型。在训练神经网络模型中使用的增强学习的实施利用Q-Learning的连续动作变体。通过本文公开的技术,实现可以学习概括到以前未经看过的对象,以前看不见的环境等的策略。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号