首页> 外文期刊>International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms >A Multiagent Planning Approach for Cooperative Patrolling with Non-Stationary Adversaries
【24h】

A Multiagent Planning Approach for Cooperative Patrolling with Non-Stationary Adversaries

机译:非静止对手的合作巡逻的多元规划方法

获取原文
获取原文并翻译 | 示例
           

摘要

Multiagent patrolling is the problem faced by a set of agents that have to visit a set of sites to prevent or detect some threats or illegal actions. Although it is commonly assumed that patrollers share a common objective, the issue of cooperation between the patrollers has received little attention. Over the last years, the focus has been put on patrolling strategies to prevent a one-shot attack from an adversary. This adversary is usually assumed to be fully rational and to have full observability of the system. Most approaches are then based on game theory and consists in computing a best response strategy. Nonetheless, when patrolling frontiers, detecting illegal fishing or poaching; patrollers face multiple adversaries with limited observability and rationality. Moreover, adversaries can perform multiple illegal actions over time and space and may change their strategies as time passes. In this paper, we propose a multiagent planning approach that enables effective cooperation between a team of patrollers in uncertain environments. Patrolling agents are assumed to have partial observability of the system. Our approach allows the patrollers to learn a generic and stochastic model of the adversaries based on the history of observations. A wide variety of adversaries can thus be considered with strategies ranging from random behaviors to fully rational and informed behaviors. We show that the multiagent planning problem can be formalized by a non-stationary DEC- POMDP. In order to deal with the non-stationary, we introduce the notion of context. We then describe an evolutionary algorithm to compute patrolling strategies on-line, and we propose methods to improve the patrollers performance.
机译:Multi8gent Parolling是一组代理面临的问题,必须访问一组网站以防止或检测某些威胁或非法行动。虽然普遍认为巡逻队共享共同目标,但巡逻队之间的合作问题收到了很少的关注。在过去几年中,重点是巡逻策略,以防止对手的一次性攻击。通常假设这种对手完全是合理的,并且具有系统的完全可观性。然后,大多数方法都是基于博弈论,并包括计算最好的反应策略。尽管如此,在巡逻边境时,检测非法钓鱼或偷猎;巡逻队面临着有限的可观察性和合理性的对手。此外,对手可以随着时间和空间对多重非法行动进行,并且可以随着时间的推移来改变他们的策略。在这篇论文中,我们提出了一种多元指的规划方法,可以在不确定的环境中实现巡逻队团队之间的有效合作。假设巡逻剂具有系统的部分可观察性。我们的方法允许巡逻器根据观察史学习对手的通用和随机模型。因此,可以考虑各种各样的对手,从随机行为到完全理性和知情行为。我们表明,可以通过非静止的DECDP来形式化多算规划问题。为了处理非静止,我们介绍了上下文的概念。然后,我们描述了一种进化算法来在线计算巡逻策略,我们提出了改善巡逻器性能的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号