Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

机译：传感器调度，用于狩猎难以捉摸的隐藏目标通过薄片的不安的强盗指数政策

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between “exposed” and “hidden,” according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priority-index type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.

机译：我们考虑一种传感器调度模型，其中使用一组相同的传感器来寻找更大的异构目标，每个目标位于相应的位点。目标状态随机更改在“暴露”和“隐藏”之间的离散时隙，根据Markovian转换概率，这取决于是否搜索站点，以使目标难以捉摸。传感器是不完美的，在以积极的误差概率搜索其网站时，无法检测到暴露的目标。我们作为一个部分可观察的马尔可夫决策过程调度传感器搜索网站的问题，以便最大化所获得的奖励的预期总折扣价值（当目标被捕时）减去搜索费。鉴于找到最佳政策的诡计，我们根据不安的匪徒的薄片指数介绍了优先级索引类型的贸易启发式搜索策略。据报道，初步计算结果表明，这种政策几乎是最佳的，并且可以大大倾向于近视政策和其他简单的启发式。

著录项

来源
《International Conference on Network Games, Control and Optimization》|2011年||共8页
会议地点
作者
Nino-Mora Jose; Villar Sofia S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词

相似文献

外文文献
中文文献
专利

1. Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management [J] . R. Washburn, M. Schneider Journal of Advances in Information Fusion . 2008,第1期

机译：一类不安定多臂土匪调度问题的最优策略及其在传感器管理中的应用
2. Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy [J] . Wang Kehao, Yu Jihong, Chen Lin, IEEE transactions on wireless communications . 2019,第10期

机译：使用不安的匪徒重新探讨机会调度：可索引性和索引策略
3. Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access [J] . Liu K.Zhao Q. Information Theory, IEEE Transactions on . 2010,第11期

机译：动态多通道访问的不安定匪问题的可索引性和Whittle索引的最优性
4. Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy [C] . Nino-Mora Jose, Villar Sofia S. 2011 5th International Conference on Network Games, Control and Optimization . 2011

机译：通过惠特勒的躁动不安的土匪指数策略来搜寻难以捉摸的躲藏目标的传感器调度
5. Stochastic optimization over parallel queues: Channel-blind scheduling, restless bandit, and optimal delay. [D] . Li, Chih-ping. 2011

机译：并行队列上的随机优化：信道盲调度，躁动的匪徒和最佳延迟。
6. INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS [O] . Sofía S. Villar -1

机译：一类可恢复初始化的强盗的可失性和最佳索引策略
7. Sensor scheduling for hunting elusive hiding targets: a restless bandit index policy [O] . Niño-Mora José, Villar Sofía S. 2012

机译：用于搜寻难以捉摸的隐藏目标的传感器调度：不安全的强盗指数策略
8. Myopic Policy for a Class of Restless Bandit Problems with Applications in Dynamic Multichannel Access [R] . Liu, K., Zhao, Q. 2009

机译：一类不安全强盗问题的近视策略及其在动态多通道接入中的应用

Sensor scheduling for hunting elusive hiding targets via whittle's restless bandit index policy

摘要

著录项

相似文献

相关主题

期刊订阅