首页> 外文会议>Asian Conference on Defense Technology >A Reinforcement Learning for Criminal’s Escape Path Prediction

【24h】

A Reinforcement Learning for Criminal’s Escape Path Prediction

机译：犯罪分子逃生路径预测的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A real-time decision support system with the capability to provide information related to possible criminal's escape path can be very useful for a law enforcement to pursue a perpetrator after a crime has been committed. Typically, the exact escape path is unknown, and pursuers must relied on a predicted path based on available information about the environment. In static environment, a perpetrator may escape through an optimal path that is predicted using any existing optimal path finding algorithms. However, the path can be dynamic when environment is changed. The perpetrator may decide to change path when there is information about foremost changes in environment. This paper models the perpetrator's path selection as a Markov Decision Process (MDP) and apply Q-learning to solve for a perpetrator's escape path. The experiment results shows that our algorithm can find most probable escape path in the dynamic environment, which can be significant reference in a real-time decision support system for law enforcement applications.

机译：能够提供与可能的犯罪分子的逃生路径相关的信息的实时决策支持系统对于执法人员在犯罪后追捕肇事者非常有用。通常，确切的逃生路径是未知的，追赶者必须基于有关环境的可用信息依赖于预测的路径。在静态环境中，犯罪者可能会通过使用任何现有的最佳路径查找算法预测的最佳路径逃逸。但是，当环境更改时，路径可以是动态的。当存在有关环境中最重要变化的信息时，犯罪者可以决定更改路径。本文将犯罪者的路径选择建模为马尔可夫决策过程（MDP），并应用Q学习来解决犯罪者的逃生路径。实验结果表明，该算法能够在动态环境中找到最可能的逃生路径，这对于执法应用实时决策支持系统具有重要的参考意义。

著录项

来源
《Asian Conference on Defense Technology 》|2018年|26-30|共5页
会议地点 Hanoi(VN)
作者
Pakamaj Wongsai; Wichai Pawgasame;
展开▼
作者单位

Data Communication Division Defence Technology Institute Pakkret Nonthaburi Thailand;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Roads; Junctions; Computational modeling; Markov processes; Law enforcement; Decision support systems; Reinforcement learning;

机译：道路；交界处；计算建模；马尔可夫过程；执法;决策支持系统；强化学习;

相似文献

外文文献
中文文献
专利

1. Situation-Aware Deep Reinforcement Learning Link Prediction Model for Evolving Criminal Networks [J] . Lim Marcus, Abdullah Azween, Jhanjhi N. Z., Quality Control, Transactions . 2020 ,第期

机译：不知情的深度加强学习链接预测预测模型，实现刑事网络
2. Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique [J] . Marcus Lim, Azween Abdullah, NZ Jhanjhi, Computers . 2019 ,第1期

机译：使用深度强化学习技术的犯罪网络中的隐藏链接预测
3. A Path-Integral-Based Reinforcement Learning Algorithm for Path Following of an Autoassembly Mobile Robot [J] . Zhu Wei, Guo Xian, Fang Yongchun, Neural Networks and Learning Systems, IEEE Transactions on . 2020 ,第11期

机译：基于基于路径的路径跟踪跟踪后面的自动装配移动机器人
4. A Reinforcement Learning for Criminal’s Escape Path Prediction [C] . Pakamaj Wongsai, Wichai Pawgasame Asian Conference on Defence Technologys . 2018

机译：犯罪逃生路径预测的加强学习
5. A study of collaborative distributed intelligent multi-agent reinforcement learning via multi goals for dynamic agent shortest path-planning [D] . Kim, Minsuk. 2016

机译：通过多目标进行动态代理最短路径规划的协同分布式智能多功能智能多功能多智能智能多功能
6. Subjective and model-estimated reward prediction: Association with the feedback-related negativity (FRN) and reward prediction error in a reinforcement learning task [O] . Naho Ichikawa, Greg J. Siegle, Alexandre Y. Dombrovski, -1

机译：主观和模型估计奖励预测：与反馈相关的消极性（FRN）关联并在加固学习任务中奖励预测误差
7. Emergent Escape-based Flocking behavior using Multi-Agent Reinforcement Learning [O] . Carsten Hahn, Thomy Phan, Thomas Gabor, 2019

机译：基于逃避的植物植入行为，采用多档强化学习
8. Complexity Analysis of Real-Time Reinforcement Learning Applied to FindingShortest Paths in Deterministic Domains [R] . Koenig, S., Simmons, R. G. 1992

机译：实时强化学习的复杂性分析应用于确定性域中寻找最短路径

A Reinforcement Learning for Criminal’s Escape Path Prediction

摘要

著录项

相似文献

相关主题

期刊订阅