On Developing a UAV Pursuit-Evasion Policy Using Reinforcement Learning

机译：利用强化学习制定无人机追逃策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an approach for learning a reactive maneuver policy for a UAV involved in a close-quarters one-on-one aerial engagement. Specifically, UAVs with behaviors learned through reinforcement learning can match or improve upon simple, but effective behaviors for intercept. In this paper, a framework for developing reactive policies that can learn to exploit behaviors is discussed. In particular, the A3C algorithm with a deep neural network is applied to the aerial combat domain. The efficacy of the learned policy is demonstrated in Monte Carlo experiments. An architecture that can transfer the learned policy from simulation to an actual aircraft and its effectiveness in live-flight are also demonstrated.

机译：我们提出了一种用于学习参与近距离一对一空中接触的无人机的反应机动策略的方法。具体而言，具有通过强化学习获得的行为的无人机可以匹配或改进简单但有效的拦截行为。在本文中，讨论了开发可以学习利用行为的反应性策略的框架。特别是，将具有深度神经网络的A3C算法应用于空战领域。蒙特卡洛实验证明了所学策略的有效性。还演示了一种可以将学习到的策略从仿真转移到实际飞机的体系结构，以及它在实时飞行中的有效性。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2018年|859-864|共6页
会议地点
作者
Bogdan Vlahov; Eric Squires; Laura Strickland; Charles Pippin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Aircraft; Atmospheric modeling; Training; Testing; Neural networks; Reinforcement learning; Games;

机译：飞机;大气建模;培训;测试;神经网络;强化学习;游戏;

相似文献

外文文献
中文文献
专利

1. Learning to Fly: Computational Controller Design for Hybrid UAVs with Reinforcement Learning [J] . Xu Jie, Du Tao, Foshey Michael, ACM Transactions on Graphics . 2019,第4CD期

机译：学习飞行：具有强化学习功能的混合无人机的计算控制器设计
2. Cooperative control for multi-player pursuit-evasion games with reinforcement learning [J] . Wang Yuanda, Dong Lu, Sun Changyin Neurocomputing . 2020,第Octa28期

机译：利用加固学习的多人追求逃避游戏的合作控制
3. Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies [J] . Probert W. J. M., Lakkur S., Fonnesbeck C. J., Philosophical Transactions of the Royal Society of London, Series B. Biological Sciences . 2019,第1776期

机译：背景：使用加强学习，培养人类可读，国家依赖的爆发响应政策
4. On Developing a UAV Pursuit-Evasion Policy Using Reinforcement Learning [C] . Bogdan Vlahov, Eric Squires, Laura Strickland, IEEE International Conference on Machine Learning and Applications . 2018

机译：用加固学习开发无人驾演逃守政策
5. Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning. [D] . Al Faiya, Badr. 2012

机译：使用强化模糊学习在追逃性差分游戏中学习。
6. Context matters: using reinforcement learning to develop human-readable state-dependent outbreak response policies [O] . W. J. M. Probert, S. Lakkur, C. J. Fonnesbeck, 2019

机译：上下文很重要：使用强化学习来开发人类可读的取决于状态的暴发应对策略
7. UAV Autonomous Aerial Combat Maneuver Strategy Generation with Observation Error Based on State-Adversarial Deep Deterministic Policy Gradient and Inverse Reinforcement Learning [O] . Weiren Kong, Deyun Zhou, Zhen Yang, 2020

机译：无人机自动空中作战机动策略生成基于国家对冲深度确定性政策梯度和反增强学习的观察误差

On Developing a UAV Pursuit-Evasion Policy Using Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅