首页> 外文会议>AAAI Conference on Artificial Intelligence >Determinantal Reinforcement Learning
【24h】

Determinantal Reinforcement Learning

机译:决定性加强学习

获取原文

摘要

We study reinforcement learning for controlling multiple agents in a collaborative manner. In some of those tasks, it is insufficient for the individual agents to take relevant actions, but those actions should also have diversity. We propose the approach of using the determinant of a positive semidefinite matrix to approximate the action-value function in reinforcement learning, where we learn the matrix in a way that it represents the relevance and diversity of the actions. Experimental results show that the proposed approach allows the agents to learn a nearly optimal policy approximately ten times faster than baseline approaches in benchmark tasks of multi-agent reinforcement learning. The proposed approach is also shown to achieve the performance that cannot be achieved with conventional approaches in partially observable environment with exponentially large action space.
机译:我们以协同方式研究加固学习,用于控制多个代理。 在其中一些任务中,个人代理人不足以采取相关行动,但这些行动也应该具有多样性。 我们提出了使用正半纤维矩阵的决定因素的方法来近似于加强学习中的动作值函数,在那里我们以代表行动的相关性和多样性的方式学习矩阵。 实验结果表明,该拟议的方法允许代理人在多智能体加固学习基准任务中的基准方法速度快大约最佳政策大约最佳的政策。 还显示了所提出的方法,以实现具有常规方法的常规方法无法实现的性能,其具有指数大的动作空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号