Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments

Takuya Okano; Itsuki Noda

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments

【24h】

Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments

机译：基于非静止环境下多智能体增强学习中均衡方向的勘探率的适应方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a method to adapt the exploration ratio in multi-agent reinforcement learning. The adaptation of exploration ratio is important in multi-agent learning, as this is one of key parameters that affect the learning performance. In our observation, the adaptation method can adjust the exploration ratio suitably (but not optimally) according to the characteristics of environments. We investigated the evolutionarily adaptation of the exploration ratio in multi-agent learning. We conducted several experiments to adapt the exploration ratio in a simple evolutionary way, namely, mimicking advantageous exploration ratio (MAER), and confirmed that MAER always acquires relatively lower exploration ratio than the optimal value for the change ratio of the environments. In this paper, we propose a second evolutionary adaptation method, namely, win or update exploration ratio (WoUE). The results of the experiments showed that WoUE can acquire a more suitable exploration ratio than MAER, and the obtained ratio was near-optimal.

机译：在本文中，我们提出了一种方法来调整多助理强化学习中的勘探比。探索比的适应在多智能体学习中很重要，因为这是影响学习性能的关键参数之一。在我们观察中，适应方法可以根据环境的特性适当地（但未最佳地）调整勘探比。我们调查了多助理学习中勘探比的进化调整。我们进行了几个实验，以简单的进化方式调整勘探比，即模仿有利的探索比（MAER），并确认MAER总是比环境变化比的最佳值获得相对较低的勘探比。在本文中，我们提出了第二个进化适应方法，即赢或更新探索比（Woue）。实验结果表明，Woue可以获得比MAER更合适的勘探比，并且获得的比率近乎最佳。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2017年第125期|共9页
作者
Takuya Okano; Itsuki Noda;
展开▼
作者单位

Fujitsu Limited;

National Institute of Advanced Industrial Science and Technology (AIST);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
Reinforcement learning; Exploration ratio; Multi-agent learning;

机译：加强学习;探索率;多智能经纪人学习;

相似文献

外文文献
中文文献
专利

1. Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments [J] . Takuya Okano, Itsuki Noda Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：基于非静止环境下多智能体增强学习中均衡方向的勘探率的适应方法
2. A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment [J] . Zhang Fengjiao, Li Jie, Li Zhi Neurocomputing . 2020,第Octa21期

机译：混合合作竞争环境中的基于TD3的多代理深增强学习方法
3. A Distributed Control Method for Urban Networks Using Multi-Agent Reinforcement Learning Based on Regional Mixed Strategy Nash-Equilibrium [J] . Qu Zhaowei, Pan Zhaotian, Chen Yongheng, Quality Control, Transactions . 2020,第期

机译：基于区域混合策略纳什平衡的城市网络的分布式控制方法利用多功能强化学习
4. Decentralized Exploration of a Structured Environment Based on Multi-agent Deep Reinforcement Learning [C] . Dingjie He, Dawei Feng, Hongda Jia, IEEE International Conference on Parallel and Distributed Systems . 2020

机译：基于多智能经纪深度增强学习的结构化环境分散探索
5. Optimal Transport-Based Density-Aware Single and Multi-Agent Exploration Strategies for Efficient Environment Survey [D] . Kabir, Rabiul Hasan. 2021

机译：基于最佳的基于传输的密度感知单一和多功能探索策略，用于高效环境调查
6. Mobile robots exploration through cnn-based reinforcement learning [O] . Lei Tai, Ming Liu -1

机译：通过基于CNN的强化学习探索移动机器人
7. k-Certainty Exploration Method: an action selector to identify the environment in reinforcement learning [O] . Miyazaki Kazuteru, Yamamura Masayuki, Kobayashi Shigenobu 1997

机译：k-确定性探索方法：在强化学习中识别环境的动作选择器

Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments

摘要

著录项

相似文献

相关主题

期刊订阅