Fixed-Wing UAVs flocking in continuous spaces: A deep reinforcement learning approach

Yan Chao; Xiang Xiaojia; Wang Chang

首页> 外文期刊>Robotics and Autonomous Systems >Fixed-Wing UAVs flocking in continuous spaces: A deep reinforcement learning approach

【24h】

Fixed-Wing UAVs flocking in continuous spaces: A deep reinforcement learning approach

机译：固定翼无人机在连续空间中植入：深度加强学习方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fixed-Wing UAVs (Unmanned Aerial Vehicles) flocking is still a challenging problem due to the kinematics complexity and environmental dynamics. In this paper, we solve the leader-followers flocking problem using a novel deep reinforcement learning algorithm that can generate roll angle and velocity commands by training an end-to-end controller in continuous state and action spaces. Specifically, we choose CACLA (Continuous Actor-Critic Learning Automation) as the base algorithm and we use the multi-layer perceptron to represent both the actor and the critic. Besides, we further improve the learning efficiency by using the experience replay technique that stores the training data in the experience memory and samples from the memory as needed. We have compared the performance of the proposed CACER (Continuous Actor-Critic with Experience Replay) algorithm with benchmark algorithms such as DDPG and double DQN in numerical simulation, and we have demonstrated the performance of the learned optimal policy in semi-physical simulation without any parameter tuning. (C) 2020 Elsevier B.V. All rights reserved.

机译：由于运动学复杂性和环境动态，固定翼无人机（无人驾驶飞行器）植绒仍然是一个具有挑战性的问题。在本文中，我们通过新颖的深度加强学习算法解决了领导者追随者植入问题，该群体可以通过在连续状态和动作空间中训练端到端控制器来产生滚角和速度命令。具体而言，我们选择Cacla（连续演员 - 评论家学习自动化）作为基础算法，我们使用多层的Perceptron代表演员和批评者。此外，我们通过使用经验重放技术进一步提高了学习效率，该技术将培训数据存储在经验存储器中，并根据需要从内存中的样本中存储。我们已经将提议的Cacer（连续演员 - 评论家与体验重放）算法进行了比较了与数值模拟中的基准算法（如DDPG和Double DQN）的表现，我们已经证明了学习的最佳政策在半物理模拟中的表现而没有任何参数调整。（c）2020 Elsevier B.V.保留所有权利。

著录项

来源
《Robotics and Autonomous Systems》 |2020年第1期|共11页
作者
Yan Chao; Xiang Xiaojia; Wang Chang;
展开▼
作者单位

Natl Univ Def Technol Coll Intelligence Sci &

Technol Changsha 410073 Peoples R China;

Natl Univ Def Technol Coll Intelligence Sci &

Technol Changsha 410073 Peoples R China;

Natl Univ Def Technol Coll Intelligence Sci &

Technol Changsha 410073 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类机器人技术;
关键词
Fixed-wing UAV; Flocking; Reinforcement learning; Actor-critic;

机译：固定翼无人机;植绒;加固学习;演员 - 评论家;

相似文献

外文文献
中文文献
专利

1. Fixed-Wing UAVs flocking in continuous spaces: A deep reinforcement learning approach [J] . Yan Chao, Xiang Xiaojia, Wang Chang Robotics and Autonomous Systems . 2020,第1期

机译：固定翼无人机在连续空间中植入：深度加强学习方法
2. Continuous State-Space Models for Optimal Sepsis Treatment: a Deep Reinforcement Learning Approach [J] . Aniruddh Raghu, Matthieu Komorowski, Leo Anthony Celi, JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：最佳脓毒症治疗的连续状态空间模型：一种深度强化学习方法
3. Interference Management for Cellular-Connected UAVs: A Deep Reinforcement Learning Approach [J] . Challita Ursula, Saad Walid, Bettstetter Christian IEEE transactions on wireless communications . 2019,第4期

机译：蜂窝连接无人机的干扰管理：一种深度强化学习方法
4. A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS [C] . Chao Wang, Jian Wang, Xudong Zhang IEEE Global Conference on Signal and Information Processing . 2019

机译：大型复杂环境中无人机植入和导航的深度增强学习方法
5. Visual Object Tracking for UAVs Using Deep Reinforcement Learning [D] . Ko, Kyungtae. 2020

机译：使用深增强学习的无人机的视觉对象跟踪
6. Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV’s Autonomous Motion Planning in Complex Unknown Environments [O] . Zijian Hu, Kaifang Wan, Xiaoguang Gao, 2020

机译：具有多种经验库的深度强化学习方法用于复杂未知环境中的无人机自主运动计划
7. Flocking with Fixed-Wing UAVs for Distributed Sensing: A Stochastic Optimal Control Approach [O] . Steven A. P. Quintero, Gaemus E. Collins, João P. Hespanha 2013

机译：用于分布式传感的固定翼无人机植绒：一种随机最优控制方法

Fixed-Wing UAVs flocking in continuous spaces: A deep reinforcement learning approach

摘要

著录项

相似文献

相关主题

期刊订阅