首页> 外国专利> GENERATIVE NEURAL NETWORK SYSTEMS FOR GENERATING INSTRUCTION SEQUENCES TO CONTROL AN AGENT PERFORMING A TASK

GENERATIVE NEURAL NETWORK SYSTEMS FOR GENERATING INSTRUCTION SEQUENCES TO CONTROL AN AGENT PERFORMING A TASK

机译：生成神经网络系统，用于生成指令序列以控制执行任务的代理商

页面导航

摘要
著录项
相似文献

摘要

A generative adversarial neural network system to provide a sequence of actions for performing a task. The system comprises a reinforcement learning neural network subsystem coupled to a simulator and a discriminator neural network. The reinforcement learning neural network subsystem includes a policy recurrent neural network to, at each of a sequence of time steps, select one or more actions to be performed according to an action selection policy, each action comprising one or more control commands for a simulator. The simulator is configured to implement the control commands for the time steps to generate a simulator output. The discriminator neural network is configured to discriminate between the simulator output and training data, to provide a reward signal for the reinforcement learning. The simulator may be non-differentiable simulator, for example a computer program to produce an image or audio waveform or a program to control a robot or vehicle.

机译：一种生成对抗神经网络系统，提供一系列执行任务的动作。该系统包括耦合到模拟器和鉴别神经网络的强化学习神经网络子系统。强化学习神经网络子系统包括策略循环神经网络，以在一系列时间步骤中的每个步骤中，根据一个动作选择策略选择一个或多个要执行的动作，每个动作包括一个或多个模拟器的控制命令。模拟器被配置为执行时间步的控制命令，以生成模拟器输出。鉴别器神经网络被配置为在模拟器输出和训练数据之间进行鉴别，以提供用于强化学习的奖励信号。该模拟器可以是不可微分的模拟器，例如用于产生图像或音频波形的计算机程序或用于控制机器人或车辆的程序。

著录项

公开/公告号EP3698283A1

专利类型
公开/公告日2020-08-26

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号EP20190704793
发明设计人 GANIN IAROSLAV;KULKARNI TEJAS DATTATRAYA;VINYALS ORIOL;ESLAMI SEYED MOHAMMADALI;
展开▼

申请日2019-02-11
分类号G06N3;G06N3/04;G06N3/08;
国家 EP
入库时间 2022-08-21 11:40:05

相似文献

专利
外文文献
中文文献