首页> 外文会议>Asian Conference on Defence Technologys >A Reinforcement Learning for Criminal’s Escape Path Prediction
【24h】

A Reinforcement Learning for Criminal’s Escape Path Prediction

机译:犯罪逃生路径预测的加强学习

获取原文

摘要

A real-time decision support system with the capability to provide information related to possible criminal's escape path can be very useful for a law enforcement to pursue a perpetrator after a crime has been committed. Typically, the exact escape path is unknown, and pursuers must relied on a predicted path based on available information about the environment. In static environment, a perpetrator may escape through an optimal path that is predicted using any existing optimal path finding algorithms. However, the path can be dynamic when environment is changed. The perpetrator may decide to change path when there is information about foremost changes in environment. This paper models the perpetrator's path selection as a Markov Decision Process (MDP) and apply Q-learning to solve for a perpetrator's escape path. The experiment results shows that our algorithm can find most probable escape path in the dynamic environment, which can be significant reference in a real-time decision support system for law enforcement applications.
机译:实时决策支持系统具有提供与可能的犯罪逃生路径相关的信息的能力对于执法者在犯罪犯下后,可以非常有用。通常,精确的转义路径是未知的,并且追求者必须基于有关环境的可用信息依赖于预测路径。在静态环境中,肇事者可以通过使用任何现有最佳路径查找算法预测的最佳路径来逃逸。但是,当环境发生变化时,路径可以是动态的。当有关于环境变化的信息时,犯罪者可以决定改变路径。本文模拟了犯罪者的路径选择作为Markov决策过程(MDP),并应​​用Q-Leach,以解决犯罪者的逃生路径。实验结果表明,我们的算法可以在动态环境中找到最可能的逃生路径,这可以是执法应用的实时决策支持系统中的重要引用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号