Partially Observable Reinforcement Learning for Sustainable Active Surveillance

机译：部分可观察的强化学习以实现持续的主动监视

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Active surveillance is the most effective strategy in the applications of infectious disease prevention, road network optimization, crime reconnaissance, etc. However, the incomplete data collected from partially monitored regions by active surveillance disables existing models to maintain a sustainable performance in the future. To address this issue, this article presents a sustainable active surveillance framework (SAS), which consists of a predictor, a classifier, and a planner, by developing a novel partially observable reinforcement learning algorithm. The predictor estimates priorities of candidate regions for monitoring. The classifier assigns candidate regions with similar features into the same groups, so that the data collected from monitored regions can be shared with unmonitored regions within the group. The planner determines where and when to allocate limited resources, considering the outcomes of available resources and model sustainability. An empirical case study on infectious disease prevention showed that the proposed SAS method significantly outperforms the state-of-the-art methods.

机译：在传染病预防，道路网络优化，犯罪侦查等应用中，主动监视是最有效的策略。但是，通过主动监视从部分监视区域收集的不完整数据将使现有模型无法维持未来的可持续性能。为了解决这个问题，本文通过开发一种新颖的可部分观察的强化学习算法，提出了一个可持续的主动监视框架（SAS），该框架由预测器，分类器和计划器组成。预测变量估计要监视的候选区域的优先级。分类器将具有相似特征的候选区域分配到同一组中，以便可以从监视区域中收集的数据与该组中的未监视区域共享。计划者考虑可用资源的结果并为可持续性建模，确定何时何地分配有限的资源。一项关于传染病预防的经验案例研究表明，提出的SAS方法显着优于最新方法。

著录项

来源
《International conference on knowledge science, engineering and management》|2018年|425-437|共13页
会议地点
作者
Hechang Chen; Bo Yang; Jiming Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sustainable active surveillance; Resources allocation; Reinforcement learning; Neural networks;

机译：可持续主动监测;资源分配;强化学习;神经网络;

相似文献

外文文献
中文文献
专利

1. PALO bounds for reinforcement learning in partially observable stochastic games [J] . Ceren Roi, He Keyang, Doshi Prashant, Neurocomputing . 2021,第Jana8期

机译：Palo界限为部分可观察到的随机游戏中的加固学习
2. Partially observable environment estimation with uplift inference for reinforcement learning based recommendation [J] . Shang Wenjie, Li Qingyang, Qin Zhiwei, Machine Learning . 2021,第9期

机译：基于学习的强化推论的部分可观察环境估算
3. Reinforcement Learning-Based Autonomous Navigation and Obstacle Avoidance for USVs under Partially Observable Conditions [J] . Nan Yan, Subin Huang, Chao Kong Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：在部分可观察条件下，加强基于学习的自主导航和USV的避免
4. Partially Observable Reinforcement Learning for Sustainable Active Surveillance [C] . Hechang Chen, Bo Yang, Jiming Liu International Conference on Knowledge Science, Engineering and Management . 2018

机译：可持续积极监测的部分可观察的加强学习
5. A Partially Observable Markov Decision Process for Optimal Design of Surveillance Policies for Bladder Cancer. [D] . Zhang, Yuan. 2012

机译：膀胱癌监测策略优化设计的部分可观察马尔可夫决策过程。
6. Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs [O] . Finale Doshi, Joelle Pineau, Nicholas Roy -1

机译：通过有限的强化进行强化学习：使用Bayes风险在POMDP中进行主动学习
7. Learning in a State of Confusion: Employing active perception and reinforcement learning in partially observable worlds [O] . Crook Paul A 2007

机译：处于混乱状态的学习：在部分可观察的世界中采用主动感知和强化学习

Partially Observable Reinforcement Learning for Sustainable Active Surveillance

摘要

著录项

相似文献

相关主题

期刊订阅