首页> 外文会议>International Conference on Robotics and Automation Engineering >An Improved Method Based on Deep Reinforcement Learning for Target Searching
【24h】

An Improved Method Based on Deep Reinforcement Learning for Target Searching

机译:一种基于深度加强学习的目标搜索的改进方法

获取原文

摘要

Unmanned Aerial Vehicle (UAV), due to their high mobility and the ability to cover areas of different heights and locations at relatively low cost, are increasingly used for disaster monitoring and detecting. However, developing and testing UAVs in real world is an expensive task, especially in the domain of search and rescue, most of the previous systems are developed on the basis of greedy or potential-based heuristics without neural network. On the basis of the recent development of deep neural network architecture and deep reinforcement learning (DRL), in this research we improved the probability of success rate of searching target in an unstructured environment by combining image processing algorithms and reinforcement learning methods (RL). This paper aims at the deficiency of target tracking in unstructured environment, trying to propose an algorithm of stationary target positioning of UAV based on computer vision system. Firstly, a new input source is formed by acquiring depth information image of current environment and combining segmentation image. Secondly, the DQN algorithm is used to regulate the reinforcement learning model, and the specific flight response can be independently selected by the UAV through training. This paper utilizes open-source Microsoft UAV simulator AirSim as training and test environment based with Keras a machine learning framework. The main approach investigated in this research is modifying the network of Deep Q-Network, which designs the moving target tracking experiment of UAV in simulation scene. The experimental results demonstrate that this method has better tracking effect.
机译:无人驾驶飞行器(UAV),由于它们的高迁移率和覆盖不同高度的区域和以相对较低的成本覆盖不同高度和位置的能力,越来越多地用于灾害监测和检测。然而,现实世界中的开发和测试无人机是一个昂贵的任务,特别是在搜索和救援领域,最先前的系统是在没有神经网络的贪婪或基于潜在的启发式的基础上开发的。在最近的深度神经网络架构和深度加强学习(DRL)的基础上,在本研究中,我们通过组合图像处理算法和增强学习方法(RL)来改善在非结构化环境中搜索目标的成功率的概率。本文旨在非结构化环境中的目标跟踪缺陷,试图基于计算机视觉系统提出无人机静止目标定位算法。首先,通过获取当前环境的深度信息图像和组合分割图像来形成新的输入源。其次,DQN算法用于调节增强学习模型,并且可以通过训练独立地选择特定的飞行响应。本文利用开源Microsoft UAV Simulator AIRSIM作为基于Keras机器学习框架的培训和测试环境。本研究中调查的主要方法正在修改深度Q-Network网络,该网络设计了仿真场景中的无人机的移动目标跟踪实验。实验结果表明,该方法具有更好的跟踪效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号