首页> 外国专利> NAVIGATING AERIAL VEHICLES USING DEEP REINFORCEMENT LEARNING

NAVIGATING AERIAL VEHICLES USING DEEP REINFORCEMENT LEARNING

机译：使用深钢筋学习导航航空公司

页面导航

摘要
著录项
相似文献

摘要

The technology relates to navigating aerial vehicles using deep reinforcement learning techniques to generate flight policies. A simulator may produce simulations of a flight occurring in a region of an atmosphere, a replay buffer may store frames of the simulations, and a learning module having a deep reinforcement learning architecture may process a set of frames and output a neural network encoding a learned flight policy. A meta-learning system may include stacks of learning systems, a coordinator to provide an instruction to the learning systems, and an evaluation server to evaluate resulting rewards from learned flight policies. An operational system for controlling flight of an aerial vehicle may include a computing system to process an input vector representing a state of the aerial vehicle and output an action, an operation-ready policies server to store a trained neural network encoding a learned flight policy, and a controller.

机译：该技术涉及使用深增强学习技术导航航空公司来产生飞行政策。模拟器可以在大气区域中产生发生的飞行的模拟，重放缓冲器可以存储模拟的帧，并且具有深增强学习架构的学习模块可以处理一组帧并输出编码学习的神经网络飞行政策。元学习系统可以包括堆栈的学习系统，协调器为学习系统提供指令，以及评估服务器，以评估来自学习的飞行策略的结果奖励。用于控制空中车辆飞行的操作系统可以包括计算系统来处理表示航空车辆状态的输入矢量并输出动作，操作就绪策略服务器，以存储编码学习飞行政策的训练的神经网络，和一个控制器。

著录项

公开/公告号WO2021086532A1

专利类型
公开/公告日2021-05-06

原文格式PDF
申请/专利权人 LOON LLC;
展开▼

申请/专利号WO2020US53345
发明设计人 CANDIDO SALVATORE J.;GONG JUN;GENDRON-BELLEMARE MARC;MACHADO MARLOS CHOLODOVSKIS;
展开▼

申请日2020-09-29
分类号G08G5;G06N3;G06N3/08;
国家 US
入库时间 2022-08-24 18:36:33

相似文献

专利
外文文献
中文文献