Distributed multi-agent temporal-difference learning with full neighbor information

Zhinan Peng; Jiangping Hu; Rui Luo; Bijoy K.Ghosh

首页> 中文期刊> 《控制理论与应用：英文版》 >Distributed multi-agent temporal-difference learning with full neighbor information

Distributed multi-agent temporal-difference learning with full neighbor information

页面导航

摘要
著录项
引文网络
相关主题

摘要

This paper presents a novel distributed multi-agent temporal-difference learning framework for value function approximation,which alows agents using all the neighbor information instead of the information from only one neighbor.With full neighbor information,the proposed framework(1)has a faster convergence rate,and(2)is more robust compared to the state of-the art approaches.Then we propose a distributed multi-agent discounted temporal dfferene algorithm and a distributed muli-agent average cost temporal diference leaming algorithm based on th framework.Moreover,the two proposed algorthms'theoretical convergence proofs are provided.Numerical simulation resuts show that our proposed algorihms are superior to the gossip-based algorithm in convergence speed,robustness to noise and time-varying network topology.

著录项

来源
《控制理论与应用：英文版》 |2020年第4期|379-389|共11页
作者
Zhinan Peng; Jiangping Hu; Rui Luo; Bijoy K.Ghosh;
展开▼
作者单位

School of Automation Engineering;

University of Electronic Science and Technology of China;

Chengdu 611731;

Sichuan;

China;

Department of Mathematics and Statistics;

Texas Tech University;

Lubbock;

TX 79409-1042;

USA;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算技术、计算机技术;
关键词
Distributed algorithm; Reinforcement learning; Temprel-lifferene learning; Multi-agent systems;

Distributed multi-agent temporal-difference learning with full neighbor information

摘要

著录项

引文网络

相关主题

期刊订阅