Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations

首页> 外文期刊>Acta astronautica >Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations

【24h】

Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations

机译：通过强化元学习进行终端自适应制导：在小行星近距离自主操作中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current practice for asteroid close proximity maneuvers requires extremely accurate characterization of the environmental dynamics and precise spacecraft positioning prior to the maneuver. This creates a delay of several months between the spacecraft's arrival and the ability to safely complete close proximity maneuvers. In this work we develop an adaptive integrated guidance, navigation, and control system that can complete these maneuvers in environments with unknown dynamics, with initial conditions spanning a large deployment region, and without a shape model of the asteroid. The system is implemented as a policy optimized using reinforcement meta-learning. The lander is equipped with an optical seeker that locks to either a terrain feature, reflected light from a targeting laser, or an active beacon, and the policy maps observations consisting of seeker angles and LIDAR range readings directly to engine thrust commands. The policy implements a recurrent network layer that allows the deployed policy to adapt real time to both environmental forces acing on the agent and internal disturbances such as actuator failure and center of mass variation. We validate the guidance system through simulated landing maneuvers in a six degrees-of-freedom simulator. The simulator randomizes the asteroid's characteristics such as solar radiation pressure, density, spin rate, and nutation angle, requiring the guidance and control system to adapt to the environment. We also demonstrate robustness to actuator failure, sensor bias, and changes in the lander's center of mass and inertia tensor. Finally, we suggest a concept of operations for asteroid close proximity maneuvers that is compatible with the guidance system.

机译：小行星近距离操纵的当前实践要求在操纵之前对环境动力学进行极为精确的表征，并要求航天器进行精确定位。这在航天器到达与安全完成近距离操纵的能力之间造成了几个月的延迟。在这项工作中，我们开发了一种自适应的集成制导，导航和控制系统，该系统可以在动力学未知的环境中完成这些操作，初始条件跨越较大的部署区域，并且没有小行星的形状模型。该系统被实现为使用强化元学习优化的策略。着陆器配备了一个光学导引器，该导引器可锁定地形特征，来自目标激光的反射光或活动信标，并且该策略将包括导引器角度和LIDAR范围读数的观测结果直接映射到发动机推力命令。该策略实现了循环网络层，该层允许已部署的策略使实时适应代理上施加的环境力以及内部干扰（例如执行器故障和质心变化）。我们通过六自由度模拟器中的模拟着陆演习来验证制导系统。该模拟器随机化小行星的特性，例如太阳辐射压力，密度，旋转速率和章动角，需要引导和控制系统以适应环境。我们还展示了对执行器故障，传感器偏置以及着陆器质心和惯性张量变化的鲁棒性。最后，我们提出了与制导系统兼容的小行星近距离操纵的操作概念。

著录项

来源
《Acta astronautica》 |2020年第6期|1-13|共13页
作者

展开▼
作者单位

Univ Arizona Dept Syst & Ind Engn Tucson AZ 85721 USA;

MIT Dept Aeronaut & Astronaut Cambridge MA 02139 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; Asteroid missions; Guidance; Navigation artificial intelligence; Autonomous maneuvers;

机译：强化学习;小行星任务;指导;导航人工智能;自主演习;

相似文献

外文文献
中文文献
专利

1. Six degree-of-freedom body-fixed hovering over unmapped asteroids via LIDAR altimetry and reinforcement meta-learning [J] . Gaudet Brian, Linares Richard, Furfaro Roberto Acta astronautica . 2020,第Jula期

机译：通过激光雷达高度和钢筋元学习，六种自由度悬停在未映射的小行星上。
2. Iterative-Learning-Control-Based Tracking for Asteroid Close-Proximity Operations [J] . Long Jiateng, Wu Fen Journal of guidance, control, and dynamics . 2019,第5期

机译：小行星近距离操作的基于迭代学习控制的跟踪
3. Robust Adaptive Tracking of Rigid-Body Motion With Applications to Asteroid Proximity Operations [J] . George Vukovich, Haichao Gui IEEE Transactions on Aerospace and Electronic Systems . 2017,第1期

机译：刚体运动的鲁棒性自适应跟踪及其在小行星近场应用中的应用
4. Development of Non-Linear Guidance Algorithms for Asteroids Close-Proximity Operations [C] . Roberto Furfaro, Brian Gaudet, Daniel R. Wibben, AIAA guidance, navigation, and control conference . 2013

机译：小行星近距离操作的非线性制导算法的开发
5. Bio-inspired Method for Close-proximity Operations on a Near-earth Asteroid [D] . Valenzuela Najera, Rene Alberto. 2020

机译：关于近地球小行星的近距离操作的生物启发方法
6. A Computationally Inexpensive Optimal Guidance via Radial-Basis-Function Neural Network for Autonomous Soft Landing on Asteroids [O] . Peng Zhang, Keping Liu, Bo Zhao, -1

机译：通过径向基函数神经网络的计算廉价最优制导用于小行星自主软着陆
7. Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations [O] . Brian Gaudet, Richard Linares, Roberto Furfaro 2020

机译：终端自适应引导通过钢筋元学习：自主小行星近距离操作的应用

Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations

摘要

著录项

相似文献

相关主题

期刊订阅