首页> 中文期刊> 《系统工程与电子技术:英文版》 >Multi-agent reinforcement learning based on policies of global objective

Multi-agent reinforcement learning based on policies of global objective

         

摘要

In general sum games, taking all agent’s collective rationality into account, we define agents’ global objective, and propose a novel multi agent reinforcement learning(RL) algorithm based on global policy. In each learning step, all agents commit to select the global policy to achieve the global goal. We prove this learning algorithm converges given certain restrictions on stage games of learned Q values, and show that it has quite lower computation time complexity than already developed multi agent learning algorithms for general sum games. An example is analyzed to show the (algorithm’s) merits.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号