首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments
【24h】

Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments

机译:基于非静止环境下多智能体增强学习中均衡方向的勘探率的适应方法

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we propose a method to adapt the exploration ratio in multi-agent reinforcement learning. The adaptation of exploration ratio is important in multi-agent learning, as this is one of key parameters that affect the learning performance. In our observation, the adaptation method can adjust the exploration ratio suitably (but not optimally) according to the characteristics of environments. We investigated the evolutionarily adaptation of the exploration ratio in multi-agent learning. We conducted several experiments to adapt the exploration ratio in a simple evolutionary way, namely, mimicking advantageous exploration ratio (MAER), and confirmed that MAER always acquires relatively lower exploration ratio than the optimal value for the change ratio of the environments. In this paper, we propose a second evolutionary adaptation method, namely, win or update exploration ratio (WoUE). The results of the experiments showed that WoUE can acquire a more suitable exploration ratio than MAER, and the obtained ratio was near-optimal.
机译:在本文中,我们提出了一种方法来调整多助理强化学习中的勘探比。探索比的适应在多智能体学习中很重要,因为这是影响学习性能的关键参数之一。在我们观察中,适应方法可以根据环境的特性适当地(但未最佳地)调整勘探比。我们调查了多助理学习中勘探比的进化调整。我们进行了几个实验,以简单的进化方式调整勘探比,即模仿有利的探索比(MAER),并确认MAER总是比环境变化比的最佳值获得相对较低的勘探比。在本文中,我们提出了第二个进化适应方法,即赢或更新探索比(Woue)。实验结果表明,Woue可以获得比MAER更合适的勘探比,并且获得的比率近乎最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号