首页> 外文期刊>Expert Systems with Application >Reward-driven U-Net training for obstacle avoidance drone
【24h】

Reward-driven U-Net training for obstacle avoidance drone

机译:奖励驱动的U-Net培训,用于避障无人机

获取原文
获取原文并翻译 | 示例

摘要

Along with the fast progress in deep learning, an autonomous drone with obstacle avoidance capability has been studied mainly by two machine learning paradigms, Le. supervised learning, and reinforcement learning. The former has some advantages since the trained network is light and fast, but it needs a large amount of data that requires laborious manual labeling. With the latter, such a drawback can be overcome as an agent learns by itself in a simulated environment, although the gap between the real and simulated one has to be minimized in the end. This study proposes a new framework where a supervised segmentation network is trained with labels made by an actor-critic network in a reward-driven manner, wherein this U-Net based network infers the next moving direction from the sequence of input images. For the actor-critic part, several recent policy gradient algorithms have been tested for controlling the drone with the continuous action space. After training in the Airsim simulation environment, the model is transferred to a Bebop drone flying in the real environment, built as a reconfigurable maze using panels and a hoop. The result suggests that our network enables the drone to navigate through the obstacles using only monocular RGB input in the trained environment as well as in the reconfigured ones without retraining. (C) 2019 Elsevier Ltd. All rights reserved.
机译:随着深度学习的快速发展,主要通过两个机器学习范式Le研究了具有避障能力的自主无人机。监督学习和强化学习。前者具有一些优点,因为受过训练的网络轻巧,快速,但是它需要大量数据,需要费力的手动标记。对于后者,可以克服这种缺陷,因为代理程序可以在模拟环境中自行学习,尽管最终必须最小化真实人与模拟人之间的差距。这项研究提出了一个新的框架,其中有监督的分割网络使用行为者批评网络以奖励驱动的方式来训练标签,其中基于U-Net的网络从输入图像序列中推断出下一个运动方向。对于演员批评者部分,已经测试了几种最新的策略梯度算法,用于控制具有连续动作空间的无人机。在Airsim模拟环境中训练后,模型被转移到在实际环境中飞行的Bebop无人机,并使用面板和箍将其构建为可重新配置的迷宫。结果表明,我们的网络使无人机能够在经过训练的环境以及经过重新配置的环境中,仅使用单眼RGB输入就可以穿越障碍物,而无需进行重新训练。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号