首页> 外文OA文献 >Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
【2h】

Parallel reinforcement learning for weighted multi-criteria model with adaptive margin

机译:自适应余量加权多准则模型的并行强化学习

摘要

Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.
机译:本文介绍了线性任务系列的强化学习(RL)。我们讨论的重点是即使任务族是线性的,最优解的非线性也是如此。我们不能用幼稚的方法来获得最优政策。尽管存在用于同时计算每个任务的等效于Q学习的结果的算法,但它存在集合大小爆炸的问题。因此,我们引入了自适应余量来克服这一困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号