首页> 外国专利> MULTI-TASK NEURAL NETWORK SYSTEMS WITH TASK-SPECIFIC POLICIES AND A SHARED POLICY

MULTI-TASK NEURAL NETWORK SYSTEMS WITH TASK-SPECIFIC POLICIES AND A SHARED POLICY

机译：具有任务特定策略和共享策略的多任务神经网络系统

页面导航

摘要
著录项
相似文献

摘要

A method is proposed for training a multitask computer system, such as a multitask neural network system. The system comprises a set of trainable workers and a shared module. The trainable workers and shared module are trained on a plurality of different tasks, such that each worker learns to perform a corresponding one of the tasks according to a respective task policy, and said shared policy network learns a multitask policy which represents common behavior for the tasks. The coordinated training is performed by optimizing an objective function comprising, for each task: a reward term indicative of an expected reward earned by a worker in performing the corresponding task according to the task policy; and at least one entropy term which regularizes the distribution of the task policy towards the distribution of the multitask policy.

机译：提出了一种用于训练诸如多任务神经网络系统的多任务计算机系统的方法。该系统包括一组可训练的工人和一个共享模块。可训练的工作人员和共享模块在多个不同的任务上受到训练，以使每个工作人员都学会根据各自的任务策略执行相应的任务之一，并且所述共享策略网络学习一个多任务策略，该策略代表该任务的通用行为。任务。通过优化目标函数来执行协同训练，对于每个任务，该目标函数包括：奖励项，其表示工人根据任务策略执行相应任务所获得的预期奖励;至少一个熵项，其将任务策略的分配朝着多任务策略的分配进行正则化。

著录项

公开/公告号US2020090048A1

专利类型
公开/公告日2020-03-19

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号US201916689020
发明设计人 RAZVAN PASCANU;RAIA THAIS HADSELL;VICTOR CONSTANT BAPST;WOJCIECH CZARNECKI;JAMES KIRKPATRICK;YEE WHYE TEH;NICOLAS MANFRED OTTO HEESS;
展开▼

申请日2019-11-19
分类号G06N3/08;G06N3/10;G06N5/04;
国家 US
入库时间 2022-08-21 11:23:17

相似文献

专利
外文文献
中文文献