首页> 外文会议>International Conference on Robotics and Automation Engineering >An Improved Method Based on Deep Reinforcement Learning for Target Searching
【24h】

An Improved Method Based on Deep Reinforcement Learning for Target Searching

机译:一种基于深度强化学习的目标搜索改进方法

获取原文

摘要

Unmanned Aerial Vehicle (UAV), due to their high mobility and the ability to cover areas of different heights and locations at relatively low cost, are increasingly used for disaster monitoring and detecting. However, developing and testing UAVs in real world is an expensive task, especially in the domain of search and rescue, most of the previous systems are developed on the basis of greedy or potential-based heuristics without neural network. On the basis of the recent development of deep neural network architecture and deep reinforcement learning (DRL), in this research we improved the probability of success rate of searching target in an unstructured environment by combining image processing algorithms and reinforcement learning methods (RL). This paper aims at the deficiency of target tracking in unstructured environment, trying to propose an algorithm of stationary target positioning of UAV based on computer vision system. Firstly, a new input source is formed by acquiring depth information image of current environment and combining segmentation image. Secondly, the DQN algorithm is used to regulate the reinforcement learning model, and the specific flight response can be independently selected by the UAV through training. This paper utilizes open-source Microsoft UAV simulator AirSim as training and test environment based with Keras a machine learning framework. The main approach investigated in this research is modifying the network of Deep Q-Network, which designs the moving target tracking experiment of UAV in simulation scene. The experimental results demonstrate that this method has better tracking effect.
机译:由于无人飞行器(UAV)的高机动性和能够以相对较低的成本覆盖不同高度和位置的区域的能力,越来越多地用于灾害监测和检测。但是,在现实世界中开发和测试无人机是一项昂贵的任务,特别是在搜索和救援领域,以前的大多数系统都是在没有神经网络的情况下基于贪婪或基于潜能的启发式技术开发的。在深度神经网络架构和深度强化学习(DRL)的最新发展的基础上,本研究通过结合图像处理算法和强化学习方法(RL)提高了非结构化环境中目标搜索成功率的可能性。针对非结构化环境下目标跟踪的不足,试图提出一种基于计算机视觉系统的无人机固定目标定位算法。首先,通过获取当前环境的深度信息图像并组合分割图像来形成新的输入源。其次,DQN算法用于调节强化学习模型,具体的飞行响应可以由无人机通过训练独立选择。本文利用开源的Microsoft UAV模拟器AirSim作为基于Keras机器学习框架的培训和测试环境。本研究研究的主要方法是修改深度Q网络,设计模拟场景中无人机的运动目标跟踪实验。实验结果表明,该方法具有较好的跟踪效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号