首页> 外国专利> REINFORCEMENT LEARNING DEVICE, REINFORCEMENT LEARNING SYSTEM, OBJECT MANIPULATION DEVICE, MODEL GENERATION METHOD, AND REINFORCEMENT LEARNING PROGRAM

REINFORCEMENT LEARNING DEVICE, REINFORCEMENT LEARNING SYSTEM, OBJECT MANIPULATION DEVICE, MODEL GENERATION METHOD, AND REINFORCEMENT LEARNING PROGRAM

机译：加固学习设备，加固学习系统，物体操纵装置，模型生成方法和强化学习计划

页面导航

摘要
著录项
相似文献

摘要

Provided are a reinforcement learning device, a reinforcement learning system, an object manipulation device, a model generation method, and a reinforcement learning program, whereby the probability of success of a prescribed manipulation on an object can be increased. This reinforcement learning device has at least one memory and at least one processor, the at least one processor being configured so as to be capable of: inputting information relating to a captured image captured by an imaging device that changes in at least position or orientation thereof, and information relating to a target object image indicating an object to be manipulated by an end effector, to a training model that outputs information for controlling the operation of the end effector; and updating a parameter of the training model on the basis of the result of manipulation of the object for a case where the operation of the end effector is controlled on the basis of the information outputted by the training model.

机译：提供了一种加强学习装置，加强学习系统，物体操纵装置，模型生成方法和加强学习程序，由此可以增加对象上的规定操纵的成功概率。该加强学习设备具有至少一个存储器和至少一个处理器，该至少一个处理器被配置为能够：输入与由成像装置捕获的捕获图像相关的信息，其在其至少位置或其方向上变化和与指示要由末端执行器操纵的对象的目标对象图像有关的信息，以输出用于控制末端执行器的操作的信息的训练模型;基于基于由训练模型输出的信息来控制末端执行器的操作的情况，根据对象的操纵结果来更新训练模型的参数。

著录项

公开/公告号WO2022009859A1

专利类型
公开/公告日2022-01-13

原文格式PDF
申请/专利权人 PREFERRED NETWORKS INC.;
展开▼

申请/专利号WO2021JP25392
发明设计人 FUJITA YASUHIRO;
展开▼

申请日2021-07-06
分类号B25J13/08;
国家 JP
入库时间 2022-08-24 23:22:07

相似文献

专利
外文文献
中文文献