首页> 外文会议>Asian Conference on Defense Technology >A Reinforcement Learning for Criminal’s Escape Path Prediction
【24h】

A Reinforcement Learning for Criminal’s Escape Path Prediction

机译:犯罪分子逃生路径预测的强化学习

获取原文

摘要

A real-time decision support system with the capability to provide information related to possible criminal's escape path can be very useful for a law enforcement to pursue a perpetrator after a crime has been committed. Typically, the exact escape path is unknown, and pursuers must relied on a predicted path based on available information about the environment. In static environment, a perpetrator may escape through an optimal path that is predicted using any existing optimal path finding algorithms. However, the path can be dynamic when environment is changed. The perpetrator may decide to change path when there is information about foremost changes in environment. This paper models the perpetrator's path selection as a Markov Decision Process (MDP) and apply Q-learning to solve for a perpetrator's escape path. The experiment results shows that our algorithm can find most probable escape path in the dynamic environment, which can be significant reference in a real-time decision support system for law enforcement applications.
机译:能够提供与可能的犯罪分子的逃生路径相关的信息的实时决策支持系统对于执法人员在犯罪后追捕肇事者非常有用。通常,确切的逃生路径是未知的,追赶者必须基于有关环境的可用信息依赖于预测的路径。在静态环境中,犯罪者可能会通过使用任何现有的最佳路径查找算法预测的最佳路径逃逸。但是,当环境更改时,路径可以是动态的。当存在有关环境中最重要变化的信息时,犯罪者可以决定更改路径。本文将犯罪者的路径选择建模为马尔可夫决策过程(MDP),并应​​用Q学习来解决犯罪者的逃生路径。实验结果表明,该算法能够在动态环境中找到最可能的逃生路径,这对于执法应用实时决策支持系统具有重要的参考意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号