Thumb ticker md aalto 2019 by matti ahlgren2

Primal-Dual Methods for Reinforcement Learning

Antoine Moulin (Ph.D. Student)

Empirical research in reinforcement learning has achieved impressive results over the past few years. However, many questions remain open regarding the theoretical guarantees of the algorithms used in practice. The PhD project aims to gain a deeper understanding of the challenges posed by large-scale reinforcement learning by identifying and exploiting structural properties of Markov decision processes that make large-scale learning statistically and computationally feasible. In particular, we aim to develop efficient and theoretically grounded reinforcement learning algorithms from the linear programming formulation of optimal control in Markov decision processes, which has been one of the most promising research directions explored in the past few years.

Primary Host: Gergely Neu (Universitat Pompeu Fabra)
Exchange Host: Arthur Gretton (University College London)
PhD Duration: 01 December 2021 - 01 July 2025
Exchange Duration: 01 October 2022 - 31 January 2023 01 July 2024 - 01 October 2024