首页> 外文会议>International Conference on Network Games, Control and Optimization >Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy
【24h】

Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

机译:传感器调度,用于狩猎难以捉摸的隐藏目标通过薄片的不安的强盗指数政策

获取原文

摘要

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between “exposed” and “hidden,” according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priority-index type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.
机译:我们考虑一种传感器调度模型,其中使用一组相同的传感器来寻找更大的异构目标,每个目标位于相应的位点。目标状态随机更改在“暴露”和“隐藏”之间的离散时隙,根据Markovian转换概率,这取决于是否搜索站点,以使目标难以捉摸。传感器是不完美的,在以积极的误差概率搜索其网站时,无法检测到暴露的目标。我们作为一个部分可观察的马尔可夫决策过程调度传感器搜索网站的问题,以便最大化所获得的奖励的预期总折扣价值(当目标被捕时)减去搜索费。鉴于找到最佳政策的诡计,我们根据不安的匪徒的薄片指数介绍了优先级索引类型的贸易启发式搜索策略。据报道,初步计算结果表明,这种政策几乎是最佳的,并且可以大大倾向于近视政策和其他简单的启发式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号