A Reinforcement Learning for Criminal’s Escape Path Prediction

机译：犯罪逃生路径预测的加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A real-time decision support system with the capability to provide information related to possible criminal's escape path can be very useful for a law enforcement to pursue a perpetrator after a crime has been committed. Typically, the exact escape path is unknown, and pursuers must relied on a predicted path based on available information about the environment. In static environment, a perpetrator may escape through an optimal path that is predicted using any existing optimal path finding algorithms. However, the path can be dynamic when environment is changed. The perpetrator may decide to change path when there is information about foremost changes in environment. This paper models the perpetrator's path selection as a Markov Decision Process (MDP) and apply Q-learning to solve for a perpetrator's escape path. The experiment results shows that our algorithm can find most probable escape path in the dynamic environment, which can be significant reference in a real-time decision support system for law enforcement applications.

机译：实时决策支持系统具有提供与可能的犯罪逃生路径相关的信息的能力对于执法者在犯罪犯下后，可以非常有用。通常，精确的转义路径是未知的，并且追求者必须基于有关环境的可用信息依赖于预测路径。在静态环境中，肇事者可以通过使用任何现有最佳路径查找算法预测的最佳路径来逃逸。但是，当环境发生变化时，路径可以是动态的。当有关于环境变化的信息时，犯罪者可以决定改变路径。本文模拟了犯罪者的路径选择作为Markov决策过程（MDP），并应用Q-Leach，以解决犯罪者的逃生路径。实验结果表明，我们的算法可以在动态环境中找到最可能的逃生路径，这可以是执法应用的实时决策支持系统中的重要引用。

著录项

来源
《Asian Conference on Defence Technologys》|2018年|v 102 p. :|共5页
会议地点
作者
Pakamaj Wongsai; Wichai Pawgasame;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-532;
关键词
Roads; Junctions; Computational modeling; Markov processes; Law enforcement; Decision support systems; Reinforcement learning;

机译：道路;连接;计算建模;马尔可夫进程;执法;决策支持系统;加强学习;

相似文献

外文文献
中文文献
专利

1. Situation-Aware Deep Reinforcement Learning Link Prediction Model for Evolving Criminal Networks [J] . Lim Marcus, Abdullah Azween, Jhanjhi N. Z., Quality Control, Transactions . 2020,第期

机译：不知情的深度加强学习链接预测预测模型，实现刑事网络
2. Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique [J] . Marcus Lim, Azween Abdullah, NZ Jhanjhi, Computers . 2019,第1期

机译：使用深度强化学习技术的犯罪网络中的隐藏链接预测
3. A Path-Integral-Based Reinforcement Learning Algorithm for Path Following of an Autoassembly Mobile Robot [J] . Zhu Wei, Guo Xian, Fang Yongchun, Neural Networks and Learning Systems, IEEE Transactions on . 2020,第11期

机译：基于基于路径的路径跟踪跟踪后面的自动装配移动机器人
4. A Reinforcement Learning for Criminal’s Escape Path Prediction [C] . Pakamaj Wongsai, Wichai Pawgasame Asian Conference on Defense Technology . 2018

机译：犯罪分子逃生路径预测的强化学习
5. A study of collaborative distributed intelligent multi-agent reinforcement learning via multi goals for dynamic agent shortest path-planning [D] . Kim, Minsuk. 2016

机译：通过多目标进行动态代理最短路径规划的协同分布式智能多功能智能多功能多智能智能多功能
6. Subjective and model-estimated reward prediction: Association with the feedback-related negativity (FRN) and reward prediction error in a reinforcement learning task [O] . Naho Ichikawa, Greg J. Siegle, Alexandre Y. Dombrovski, -1

机译：主观和模型估计奖励预测：与反馈相关的消极性（FRN）关联并在加固学习任务中奖励预测误差
7. Emergent Escape-based Flocking behavior using Multi-Agent Reinforcement Learning [O] . Carsten Hahn, Thomy Phan, Thomas Gabor, 2019

机译：基于逃避的植物植入行为，采用多档强化学习
8. Complexity Analysis of Real-Time Reinforcement Learning Applied to FindingShortest Paths in Deterministic Domains [R] . Koenig, S., Simmons, R. G. 1992

机译：实时强化学习的复杂性分析应用于确定性域中寻找最短路径

A Reinforcement Learning for Criminal’s Escape Path Prediction

摘要

著录项

相似文献

相关主题

期刊订阅