首页> 外国专利> Multi-task neural network systems with task-specific policies and a shared policy

Multi-task neural network systems with task-specific policies and a shared policy

机译：具有任务特定策略和共享策略的多任务神经网络系统

页面导航

摘要
著录项
相似文献

摘要

A method is proposed for training a multitask computer system, such as a multitask neural network system. The system comprises a set of trainable workers and a shared module. The trainable workers and shared module are trained on a plurality of different tasks, such that each worker learns to perform a corresponding one of the tasks according to a respective task policy, and said shared policy network learns a multitask policy which represents common behavior for the tasks. The coordinated training is performed by optimizing an objective function comprising, for each task: a reward term indicative of an expected reward earned by a worker in performing the corresponding task according to the task policy; and at least one entropy term which regularizes the distribution of the task policy towards the distribution of the multitask policy.

机译：提出了一种用于训练多任务计算机系统的方法，例如多任务神经网络系统。该系统包括一组可培训工人和共享模块。可训练的工人和共享模块在多个不同的任务中训练，使得每个工作人员根据相应的任务策略学习执行相应的一个任务，并且所述共享策略网络了解一个多任务策略，该策略表示常见行为任务。通过优化每个任务的目标函数来执行协调培训：指示工作策略执行相应任务时赢得预期奖励的奖励术语; 并且至少一个熵项，它将任务策略的分布进行了规范，以朝多任务策略分发。

著录项

公开/公告号US11132609B2

专利类型
公开/公告日2021-09-28

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号US201916689020
发明设计人 RAZVAN PASCANU;RAIA THAIS HADSELL;VICTOR CONSTANT BAPST;WOJCIECH CZARNECKI;JAMES KIRKPATRICK;YEE WHYE TEH;NICOLAS MANFRED OTTO HEESS;
展开▼

申请日2019-11-19
分类号G06N3/08;G06N3/10;G06N5/04;
国家 US
入库时间 2022-08-24 21:18:22

相似文献

专利
外文文献
中文文献