Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model

Yasuo Nagayuki; Minoru Ito

首页> 外文期刊>電子情報通信学会技術研究報告. オフィスシステム >Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model

【24h】

Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model

机译：Markov Games的多功能增强学习方法：一种基于环境模型估计的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, we propose a multi-agent reinforcement learning method for Markov games. In our multi-agent reinforcement learning method, each agent infers the environmental model which consists of the other agents' policies and the state transition function, and estimates the future states by using the inferred environmental model. Each agent conducts its reinforcement learning based on the estimated future states. In order to evaluate our multi-agent reinforcement learning method, we employ the variant of the pursuit problem as a task. Through experiments, we demonstrate that our multi-agent reinforcement learning method is effective.

机译：在本文中，我们为马尔可夫游戏提出了一种多功能加强学习方法。在我们的多功能增强学习方法中，每个代理商都是由其他代理商的政策和国家转型函数组成的环境模型，并通过使用推断的环境模型估计未来状态。每个代理基于估计的未来州进行其强化学习。为了评估我们的多功能增强学习方法，我们采用了追求问题的变体作为任务。通过实验，我们证明了我们的多功能加强学习方法是有效的。

著录项

来源
《電子情報通信学会技術研究報告. オフィスシステム》 |2001年第208期|共8页
作者
Yasuo Nagayuki; Minoru Ito;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类计算机软件;
关键词
Multi-agent reinforcement learning; TD learning; Environmental model; Markov game; Pursuit problem;

机译：多功能钢筋学习;TD学习;环境模型;马尔可夫游戏;追求问题;

相似文献

外文文献
中文文献
专利

1. Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model [J] . Yasuo Nagayuki, Minoru Ito 電子情報通信学会技術研究報告. オフィスシステム . 2001,第208期

机译：马尔可夫博弈的多主体强化学习方法：一种基于环境模型估计的方法
2. Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model [J] . Yasuo Nagayuki, Minoru Ito 電子情報通信学会技術研究報告. オフィスシステム . 2001,第208期

机译：Markov Games的多功能增强学习方法：一种基于环境模型估计的方法
3. Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model [J] . Yasuo Nagayuki, Minoru Ito 電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing . 2001,第210期

机译：Markov Games的多功能增强学习方法：一种基于环境模型估计的方法
4. Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement Learning [C] . Borja G. Leon, Francesco Belardinelli European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems . 2020

机译：扩展马尔可夫游戏以了解多智能经纪增强学习中的多项任务
5. Multi-agent reinforcement learning in Markov games. [D] . Sheppard, John Wilbur. 1997

机译：马尔可夫游戏中的多主体强化学习。
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. A zero-sum Markov Defender-attacker Game for Modeling False Pricing in Smart Grids and its Solution by Multi-agent Reinforcement Learning [O] . Daogui Tang, Yi-Ping Fang, Enrico Zio 2019

机译：一种零级马尔可夫防御者攻击者，用于通过多智能经纪增强学习在智能电网和解决方案中建模虚假定价

Multi-agent reinforcement learning method for Markov games: an approach based on the estimation of the environmental model

摘要

著录项

相似文献

相关主题

期刊订阅