Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

机译：通过惠特勒的躁动不安的土匪指数策略来搜寻难以捉摸的躲藏目标的传感器调度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between “exposed” and “hidden,” according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priority-index type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.

机译：我们考虑一个传感器调度模型，其中使用一组相同的传感器来搜寻更大的一组异构目标，每个目标位于相应的位置。根据取决于是否搜索站点的马尔可夫转换概率，目标状态在“暴露”和“隐藏”之间的离散时隙上随机变化，从而使目标难以捉摸。传感器是不完善的，当以正误检测概率搜索目标时，无法检测到暴露的目标。我们将调度传感器以搜索站点以便最大化获得的预期总折现价值（当目标被追捕时）减去所产生的搜索成本的问题，拟定为可观察的马尔可夫决策过程。鉴于找到最佳策略的难处理性，我们针对不安定的土匪引入了基于Whittle指数的优先指数类型的可处理启发式搜索策略。据初步计算结果表明，这种策略几乎是最佳的，并且可以大大胜过近视策略和其他简单的启发式算法。

著录项

来源
《2011 5th International Conference on Network Games, Control and Optimization》|2011年|p.1-8|共8页
会议地点
作者
Nino-Mora Jose; Villar Sofia S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management [J] . R. Washburn, M. Schneider Journal of Advances in Information Fusion . 2008,第1期

机译：一类不安定多臂土匪调度问题的最优策略及其在传感器管理中的应用
2. Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy [J] . Wang Kehao, Yu Jihong, Chen Lin, IEEE transactions on wireless communications . 2019,第10期

机译：使用不安的匪徒重新探讨机会调度：可索引性和索引策略
3. Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access [J] . Liu K.Zhao Q. Information Theory, IEEE Transactions on . 2010,第11期

机译：动态多通道访问的不安定匪问题的可索引性和Whittle索引的最优性
4. Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy [C] . Nino-Mora Jose, Villar Sofia S. International Conference on Network Games, Control and Optimization . 2011

机译：传感器调度，用于狩猎难以捉摸的隐藏目标通过薄片的不安的强盗指数政策
5. Stochastic optimization over parallel queues: Channel-blind scheduling, restless bandit, and optimal delay. [D] . Li, Chih-ping. 2011

机译：并行队列上的随机优化：信道盲调度，躁动的匪徒和最佳延迟。
6. INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS [O] . Sofía S. Villar -1

机译：一类可恢复初始化的强盗的可失性和最佳索引策略
7. Sensor scheduling for hunting elusive hiding targets: a restless bandit index policy [O] . Niño-Mora José, Villar Sofía S. 2012

机译：用于搜寻难以捉摸的隐藏目标的传感器调度：不安全的强盗指数策略
8. Myopic Policy for a Class of Restless Bandit Problems with Applications in Dynamic Multichannel Access [R] . Liu, K., Zhao, Q. 2009

机译：一类不安全强盗问题的近视策略及其在动态多通道接入中的应用

Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

摘要

著录项

相似文献

相关主题

期刊订阅