首页> 外文OA文献 >Sensor scheduling for hunting elusive hiding targets: a restless bandit index policy
【2h】

Sensor scheduling for hunting elusive hiding targets: a restless bandit index policy

机译:用于搜寻难以捉摸的隐藏目标的传感器调度:不安全的强盗指数策略

摘要

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between “exposed” and ‘hidden,” according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priority index type based on the Whittle’s index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.
机译:我们考虑一个传感器调度模型,其中使用一组相同的传感器来搜寻更大的一组异构目标,每个目标位于相应的位置。根据马尔可夫变迁概率,目标状态在“暴露”和“隐藏”之间的离散时隙上随机变化,这取决于是否搜索站点,从而使目标难以捉摸。传感器是不完善的,当以正误检测概率搜索目标时,无法检测到暴露的目标。我们将调度传感器以搜索站点的问题公式化为部分可观察到的马尔可夫决策过程,以使获得的奖励(当目标被追捕时)减去所产生的搜索费用的预期总折现值最大化。鉴于找到最佳策略的难处理性,我们基于惠特尔针对不安定土匪的索引,引入了优先索引类型的可处理启发式搜索策略。据初步计算结果表明,这种策略几乎是最佳的,并且可以大大胜过近视策略和其他简单的启发式算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号