Adrian Müller
PhD
Swiss Federal Technology Institute of Lausanne (EPFL)
Reinforcement Learning Through the Lens of Optimization

Reinforcement learning offers a solution to learning problems that require planning and has led to several breakthroughs in recent years. However, many of these breakthroughs were achieved in controlled setups. In such setups, it is common that a) one does not require a theoretical understanding of the algorithms and b) only the eventually trained policy but not performance during learning matters. This PhD project aims to provide reinforcement learning algorithms that allow for the desired mathematical guarantees. Crucially, these algorithms are supposed to provably scale to large Markov decision processes at the same time. The key idea is to view the learning problems from the viewpoint of online optimization theory.

Track:
Academic Track
PhD Duration:
September 1st, 2023 - September 1st, 2027
First Exchange:
September 1st, 2025 - March 1st, 2026
ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.