首页> 外文期刊>電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing >Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model
【24h】

Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model

机译:马尔可夫博弈的多主体强化学习方法:一种基于环境模型估计的方法

获取原文
获取原文并翻译 | 示例
       

摘要

In this article, we propose a multi-agent reinforcement learning method for Markov games. In our multi-agent reinforcement learning method, each agent infers the environmental model which consists of the other agents' policies and the state transition function, and estimates the future states by using the inferred environmental model. Each agent conducts its reinforcement learning based on the estimated future states. In order to evaluate our multi-agent reinforcement learning method, we employ the variant of the pursuit problem as a task. Through experiments, we demonstrate that our multi-agent reinforcement learning method is effective.
机译:在本文中,我们提出了一种用于Markov游戏的多主体强化学习方法。在我们的多主体强化学习方法中,每个主体都推断出由其他主体的策略和状态转换函数组成的环境模型,并使用推断的环境模型来估计未来状态。每个代理根据估计的未来状态进行强化学习。为了评估我们的多主体强化学习方法,我们采用了追随问题的变体作为任务。通过实验,我们证明了我们的多主体强化学习方法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号