首页> 外国专利> MULTI-TASK NEURAL NETWORK SYSTEMS WITH TASK-SPECIFIC POLICIES AND A SHARED POLICY

MULTI-TASK NEURAL NETWORK SYSTEMS WITH TASK-SPECIFIC POLICIES AND A SHARED POLICY

机译:具有任务特定策略和共享策略的多任务神经网络系统

摘要

A method is proposed for training a multitask computer system, such as a multitask neural network system. The system comprises a set of trainable workers and a shared module. The trainable workers and shared module are trained on a plurality of different tasks, such that each worker learns to perform a corresponding one of the tasks according to a respective task policy, and said shared policy network learns a multitask policy which represents common behavior for the tasks. The coordinated training is performed by optimizing an objective function comprising, for each task: a reward term indicative of an expected reward earned by a worker in performing the corresponding task according to the task policy; and at least one entropy term which regularizes the distribution of the task policy towards the distribution of the multitask policy.
机译:提出了一种用于训练诸如多任务神经网络系统的多任务计算机系统的方法。该系统包括一组可训练的工人和一个共享模块。可训练的工作人员和共享模块在多个不同的任务上受到训练,以使每个工作人员都学会根据各自的任务策略执行相应的任务之一,并且所述共享策略网络学习一个多任务策略,该策略代表该任务的通用行为。任务。通过优化目标函数来执行协同训练,对于每个任务,该目标函数包括:奖励项,其表示工人根据任务策略执行相应任务所获得的预期奖励;至少一个熵项,其将任务策略的分配朝着多任务策略的分配进行正则化。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号