首页> 中文期刊> 《控制理论与应用:英文版》 >Distributed multi-agent temporal-difference learning with full neighbor information

Distributed multi-agent temporal-difference learning with full neighbor information

摘要

This paper presents a novel distributed multi-agent temporal-difference learning framework for value function approximation,which alows agents using all the neighbor information instead of the information from only one neighbor.With full neighbor information,the proposed framework(1)has a faster convergence rate,and(2)is more robust compared to the state of-the art approaches.Then we propose a distributed multi-agent discounted temporal dfferene algorithm and a distributed muli-agent average cost temporal diference leaming algorithm based on th framework.Moreover,the two proposed algorthms'theoretical convergence proofs are provided.Numerical simulation resuts show that our proposed algorihms are superior to the gossip-based algorithm in convergence speed,robustness to noise and time-varying network topology.

著录项

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号