Reinforcement Learning in Categorical Cybernetics
Hedges, Jules and Rodríguez Sakamoto, Riu (2025) Reinforcement Learning in Categorical Cybernetics. Electronic Proceedings in Theoretical Computer Science, 429. pp. 270-286. ISSN 2075-2180 (https://doi.org/10.4204/eptcs.429.15)
Preview |
Text.
Filename: Hedges-and-Sakamoto-2025-Reinforcement-Learning-in-Categorical-Cybernetics.pdf
Final Published Version License: Strathprints license 1.0 Download (257kB)| Preview |
Abstract
We show that several major algorithms of reinforcement learning (RL) fit into the framework of categorical cybernetics, that is to say, parametrised bidirectional processes. We build on our previous work in which we show that value iteration can be represented by precomposition with a certain optic. The outline of the main construction in this paper is: (1) We extend the Bellman operators to parametrised optics that apply to action-value functions and depend on a sample. (2) We apply a representable contravariant functor, obtaining a parametrised function that applies the Bellman iteration. (3) This parametrised function becomes the backward pass of another parametrised optic that represents the model, which interacts with an environment via an agent. Thus, parametrised optics appear in two different ways in our construction, with one becoming part of the other. As we show, many of the major classes of algorithms in RL can be seen as different extremal cases of this general setup: dynamic programming, Monte Carlo methods, temporal difference learning, and deep RL. We see this as strong evidence that this approach is a natural one and believe that it will be a fruitful way to think about RL in the future.
ORCID iDs
Hedges, Jules and Rodríguez Sakamoto, Riu
ORCID: https://orcid.org/0000-0002-3356-7412;
-
-
Item type: Article ID code: 94389 Dates: DateEvent25 September 2025Published3 April 2024AcceptedSubjects: Science > Mathematics
Science > Science (General) > CyberneticsDepartment: Faculty of Science > Computer and Information Sciences
Faculty of Engineering > Electronic and Electrical EngineeringDepositing user: Pure Administrator Date deposited: 08 Oct 2025 16:05 Last modified: 02 Feb 2026 08:36 Related URLs: URI: https://strathprints.strath.ac.uk/id/eprint/94389
Tools
Tools





