首页> 外文会议>2011 5th International Conference on Network Games, Control and Optimization >Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy
【24h】

Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

机译:通过惠特勒的躁动不安的土匪指数策略来搜寻难以捉摸的躲藏目标的传感器调度

获取原文

摘要

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between “exposed” and “hidden,” according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priority-index type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.
机译:我们考虑一个传感器调度模型,其中使用一组相同的传感器来搜寻更大的一组异构目标,每个目标位于相应的位置。根据取决于是否搜索站点的马尔可夫转换概率,目标状态在“暴露”和“隐藏”之间的离散时隙上随机变化,从而使目标难以捉摸。传感器是不完善的,当以正误检测概率搜索目标时,无法检测到暴露的目标。我们将调度传感器以搜索站点以便最大化获得的预期总折现价值(当目标被追捕时)减去所产生的搜索成本的问题,拟定为可观察的马尔可夫决策过程。鉴于找到最佳策略的难处理性,我们针对不安定的土匪引入了基于Whittle指数的优先指数类型的可处理启发式搜索策略。据初步计算结果表明,这种策略几乎是最佳的,并且可以大大胜过近视策略和其他简单的启发式算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号