首页> 外文期刊>Advances in complex systems >LEARNING FROM ACTIONS NOT TAKEN IN MULTIAGENT SYSTEMS
【24h】

LEARNING FROM ACTIONS NOT TAKEN IN MULTIAGENT SYSTEMS

机译:从多主体系统中未采取的行动中学习

获取原文
获取原文并翻译 | 示例
       

摘要

In large cooperative multiagent systems, coordinating the actions of the agents is critical to the overall system achieving its intended goal. Even when the agents aim to cooperate, ensuring that the agent actions lead to good system level behavior becomes increasingly difficult as systems become larger. One of the fundamental difficulties in such multiagent systems is the slow learning process where an agent not only needs to learn how to behave in a complex environment, but also needs to account for the actions of other learning agents. In this paper, we present a multiagent learning approach that significantly improves the learning speed in multiagent systems by allowing an agent to update its estimate of the rewards (e.g. value function in reinforcement learning) for all its available actions, not just the action that was taken. This approach is based on an agent estimating the counterfactual reward it would have received had it taken a particular action. Our results show that the rewards on such "actions not taken" are beneficial early in training, particularly when only particular "key" actions are used. We then present results where agent teams are leveraged to estimate those rewards. Finally, we show that the improved learning speed is critical in dynamic environments where fast learning is critical to tracking the underlying processes.
机译:在大型协作多代理系统中,协调代理的动作对于整个系统实现其预期目标至关重要。即使当代理程序旨在合作时,随着系统的变大,确保代理程序行为导致良好的系统级行为也变得越来越困难。这种多主体系统中的基本困难之一是学习过程缓慢,其中主体不仅需要学习如何在复杂的环境中进行行为,还需要考虑其他学习主体的行为。在本文中,我们提出了一种多主体学习方法,该方法通过允许主体更新其对所有可用动作的奖励估算(例如强化学习中的价值函数),而不仅是以前的动作,从而显着提高了多主体系统的学习速度。采取。这种方法是基于代理估算的,如果采取了特定的行动,它将获得反事实的奖励。我们的结果表明,对此类“未采取的行动”的奖励在训练初期是有益的,尤其是在仅使用特定的“关键”行动时。然后,我们在代理团队被用来估计这些奖励的情况下给出结果。最后,我们证明了提高的学习速度在动态环境中至关重要,在动态环境中,快速学习对于跟踪基础过程至关重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号