Leveraging optimal control demonstrations in reinforcement learning for powered descent

Tools

Wilson, Callum and Riccardi, Annalisa (2021) Leveraging optimal control demonstrations in reinforcement learning for powered descent. In: 8th International Conference on Astrodynamics Tools and Techniques, 2021-06-22 - 2021-06-25, Virtual.

[thumbnail of Wilson-Riccardi-ICATT-2021-Leveraging-optimal-control-demonstrations-in-reinforcement-learning-for-powered-descent]

Preview

Text. Filename: Wilson_Riccardi_ICATT_2021_Leveraging_optimal_control_demonstrations_in_reinforcement_learning_for_powered_descent.pdf
Accepted Author Manuscript
Download (416kB)| Preview

Abstract

This work presents an approach to deriving a controller for spacecraft powered descent using reinforcement learning. To assist in the learning process, our approach uses optimal control demonstrations which provide open-loop control for optimal trajectories. Combining these approaches to use the optimal trajectories as demonstrations helps to overcome issues with convergence on desirable policies in the reinforcement learning problem. We demonstrate the applicability of this approach on a simulated 3-DOF Mars lander. The results show that the learned controller is capable of achieving a pinpoint soft landing from a range of initial conditions. Compared to the open-loop optimal trajectories alone, this controller generalises to more initial conditions and can cope with environmental uncertainties.

ORCID iDs

Wilson, Callum

and Riccardi, Annalisa

;

Share and Export

Item metadata

Item type:	Conference or Workshop Item(Paper)
ID code:	77284
Dates:	Date Event 25 June 2021 Published 25 June 2021 Published Online 21 March 2021 Accepted
Subjects:	Technology > Mechanical engineering and machinery
Department:	Faculty of Engineering > Mechanical and Aerospace Engineering Strategic Research Themes > Ocean, Air and Space
Depositing user:	Pure Administrator
Date deposited:	04 Aug 2021 13:17
Last modified:	16 Nov 2024 01:40
URI:	https://strathprints.strath.ac.uk/id/eprint/77284

CORE (COnnecting REpositories)