Enabling intelligent onboard guidance, navigation, and control using reinforcement learning on near-term flight hardware

Tools

Wilson, Callum and Riccardi, Annalisa (2022) Enabling intelligent onboard guidance, navigation, and control using reinforcement learning on near-term flight hardware. Acta Astronautica, 199. pp. 374-385. ISSN 0094-5765 (https://doi.org/10.1016/j.actaastro.2022.07.013)

[thumbnail of Wilson-Riccardi-AA-2022-Enabling-intelligent-onboard-guidance-navigation-and-control-using-reinforcement-learning]

Preview

Text. Filename: Wilson_Riccardi_AA_2022_Enabling_intelligent_onboard_guidance_navigation_and_control_using_reinforcement_learning.pdf
Final Published Version
License:

Download (1MB)| Preview

Abstract

Future space missions require technological advances to meet more stringent requirements. Next generation guidance, navigation, and control systems must safely operate autonomously in hazardous and uncertain environments. While these developments often focus on flight software, spacecraft hardware also creates computational limitations for onboard algorithms. Intelligent control methods combine theories from automatic control, artificial intelligence, and operations research to derive control systems capable of handling large uncertainties. While this can be beneficial for spacecraft control, such control systems often require substantial computational power. Recent improvements in single board computers have created physically lighter and less power-intensive processors that are suitable for spaceflight and purpose built for machine learning. In this study, we implement a reinforcement learning based controller on NVIDIA Jetson Nano hardware and apply this controller to a simulated Mars powered descent problem. The proposed approach uses optimal trajectories and guidance laws under nominal environment conditions to initialise a reinforcement learning agent. This agent learns a control policy to cope with environmental uncertainties and updates its control policy online using a novel update mechanism called Extreme Q-Learning Machine. We show that this control system performs well on flight suitable hardware, which demonstrates the potential for intelligent control onboard spacecraft.

ORCID iDs

Wilson, Callum

and Riccardi, Annalisa

;

Share and Export

Item metadata

Item type:	Article
ID code:	81480
Dates:	Date Event 31 October 2022 Published 15 July 2022 Published Online 7 July 2022 Accepted
Subjects:	Technology > Motor vehicles. Aeronautics. Astronautics Science > Mathematics > Electronic computers. Computer science
Department:	Faculty of Engineering > Mechanical and Aerospace Engineering Strategic Research Themes > Ocean, Air and Space
Depositing user:	Pure Administrator
Date deposited:	19 Jul 2022 09:38
Last modified:	06 May 2025 15:55
URI:	https://strathprints.strath.ac.uk/id/eprint/81480

CORE (COnnecting REpositories)