Low-energy transfers reduce the total ΔV in deep space missions by exploiting multibody dynamics. However, searching for initial guesses in highly nonlinear systems is often a demanding task and it has been heavily dependent on professionals. In this paper, we aim to automate this initial guess search by using deep reinforcement learning (RL) algorithms. Minimum fuel transfer problems from low-Earth orbit to three-body orbits are redefined in the RL framework with the ΔV vector as an action. Since a common RL algorithm, deep deterministic policy gradient, fails to find long time-of-flight transfers, we extend it by explicitly separating a null action from the continuous action space. We demonstrate our algorithm successfully finds long transfers from a wide variety of initial states in both two and three dimensional problems.
展开▼