【24h】

Game-theoretic learning algorithm for a spatial coverage problem

机译:空间覆盖问题的博弈论学习算法

获取原文

摘要

In this paper we consider a class of dynamic vehicle routing problems, in which a number of mobile agents in the plane must visit target points generated over time by a stochastic process. It is desired to design motion coordination strategies in order to minimize the expected time between the appearance of a target point and the time it is visited by one of the agents. We cast the problem as a spatial game in which each agent's objective is to maximize the expected value of the "time spent alone" at the next target location and show that the Nash equilibria of the game correspond to the desired agent configurations. We propose learning-based control strategies that, while making minimal or no assumptions on communications between agents as well as the underlying distribution, provide the same level of steady-state performance achieved by the best known decentralized strategies.
机译:在本文中,我们考虑了一类动态车辆路径问题,其中飞机中的许多移动代理必须访问由随机过程随时间产生的目标点。期望设计运动协调策略,以最小化目标点的出现与其中一个特工访问该目标点的时间之间的预期时间。我们将该问题视为一个空间博弈,其中每个特工的目标是在下一个目标位置最大化“单独花费的时间”的预期值,并证明游戏的纳什均衡与所需特工配置相对应。我们提出了基于学习的控制策略,该策略在对代理之间的通信以及底层分布进行最少或没有假设的同时,提供了由最著名的分散策略所达到的相同水平的稳态性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号