首页> 外国专利> Multi-task neural network systems with task-specific policies and a shared policy

Multi-task neural network systems with task-specific policies and a shared policy

机译:具有任务特定策略和共享策略的多任务神经网络系统

摘要

A method is proposed for training a multitask computer system, such as a multitask neural network system. The system comprises a set of trainable workers and a shared module. The trainable workers and shared module are trained on a plurality of different tasks, such that each worker learns to perform a corresponding one of the tasks according to a respective task policy, and said shared policy network learns a multitask policy which represents common behavior for the tasks. The coordinated training is performed by optimizing an objective function comprising, for each task: a reward term indicative of an expected reward earned by a worker in performing the corresponding task according to the task policy; and at least one entropy term which regularizes the distribution of the task policy towards the distribution of the multitask policy.
机译:提出了一种用于训练多任务计算机系统的方法,例如多任务神经网络系统。 该系统包括一组可培训工人和共享模块。 可训练的工人和共享模块在多个不同的任务中训练,使得每个工作人员根据相应的任务策略学习执行相应的一个任务,并且所述共享策略网络了解一个多任务策略,该策略表示常见行为 任务。 通过优化每个任务的目标函数来执行协调培训:指示工作策略执行相应任务时赢得预期奖励的奖励术语; 并且至少一个熵项,它将任务策略的分布进行了规范,以朝多任务策略分发。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号