首页> 外文会议>Unmanned Systems Technology Conference >Game Theory Based Framework for Agent Decision-Making in Communication Constrained Environments
【24h】

Game Theory Based Framework for Agent Decision-Making in Communication Constrained Environments

机译:基于博弈论的经理决策在通信受限环境中的框架

获取原文

摘要

We have designed a game theory based framework in order to compute effective agent asset laydown and courses of action (COAs) for adversarial scenarios. Our technical approach is based on Stackelberg security game theory, which is a specialization of game theory for adversarial situations and deterrence. Security games approaches provide a scalable optimization framework to determine geospatial COAs for agents. Specifically it can exploit intelligence about adversaries by constraining courses of action search and eliminating dominated COAs. Several issues arise in providing a game theory for a communication-constrained environment. Current minimax payoff models for sensing an adversary/obstacles consider the probability for the defender to sense an adversary. For limited acoustic sensing, this term now has path dependence such as building interiors and areas with transmission issues. Next, an extension to account for loss or degradation of defender assets is required. A candidate solution being considered is to have an agent update/choose degraded contingency strategies at each communication. We are also evaluating providing refined strategies as a function of time if communications are out and how to account for effect of uncertainty in our knowledge of agent member loss for updated strategies. We are employing simulators at NRL DC to model multiagent trajectories and allowing the testing of the game theory approach based on environmental conditions.
机译:我们设计了一种基于博弈论的博物馆,以计算有效的代理资产裁员和对抗方案的行动课程(COAS)。我们的技术方法是基于Stackelberg安全游戏理论,这是对抗性情况和威慑的博弈论的专业化。安全游戏方法提供可扩展的优化框架,以确定代理的地理空间COA。具体而言,它可以通过限制行动搜索和消除占主导地位的COA的课程来利用对对手的智能。提供若干问题在为通信受限环境提供博弈论时出现。目前,用于传感对手/障碍的最低资金支付模型考虑了捍卫者的概率来感知对手。对于有限的声学感应,现在该术语现在具有路径依赖性,例如构建内部和具有传输问题的区域。接下来,需要一个延期,以偿还后卫资产的损失或退化。被认为的候选解决方案是在每个通信中具有代理更新/选择退化的应急策略。如果通信已出版以及如何在我们对更新策略的知识中的了解情况下,我们还评估提供完整的策略。我们在NRL DC采用模拟器来模拟多轴轨迹,并根据环境条件测试博弈论方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号