首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >Using Reward/Utility Based Impact Scores in Partitioning
【24h】

Using Reward/Utility Based Impact Scores in Partitioning

机译:在分区中使用基于奖励/实用程序的影响分数

获取原文

摘要

Reinforcement learning with reward shaping is a well-established but often computationally expensive approach to multiagent problems. Agent partitioning can assist in this computational complexity by treating each partition of agents as an independent problem. We introduce a novel agent partitioning approach called Reward/Utility-Based Impact (RUBI). RUBI finds an effective partitioning of agents while requiring no prior domain knowledge, provides better performance by discovering a non-trivial agent partitioning, and leads to faster simulations. We test RUBI in the Air Traffic Flow Management Problem, where there are simultaneously tens of thousands of aircraft affecting the system and no intuitive similarity metric between agents. When partitioning with RUBI in the ATFMP, there is a 37% increase in performance, with a 510x speed up per simulation step over non-partitioning approaches.
机译:奖励塑造的加固学习是一种很好的,但经常是多层问题的计算昂贵的方法。代理分区可以通过将每个代理分区视为独立问题来帮助这种计算复杂度。我们介绍一种名为奖励/实用的影响的新代理分区方法(RUBI)。 RUBI在不需要现有域知识的同时找到有效分区,同时要求未经证实的域知识提供更好的性能,并导致更快的模拟。我们在空中交通流量管理问题中测试鲁西,其中有几万飞机影响系统,代理之间没有直观相似度量。当在ATFMP中使用RUBI进行分区时,性能增加37%,每次模拟步骤速度为510倍,通过非分区方法加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号