Jan Schneider
PhD
Max Planck Institute for Intelligent Systems (MPI-IS)
Exploring Inductive Biases in Reinforcement Learning Through Action Spaces

When applying reinforcement learning to robotics, there is a multitude of choices for the action representations. Our experiments demonstrate that this choice generally has a significant impact on learning performance. In this project, we conduct an in-depth analysis of the causes of these effects for policy gradient algorithms. Particularly, we study effects on the optimization landscape and the variance of the gradient estimator through visualization and analysis techniques. Our experiments reveal significant structure in the learning process and identify effects that impact the optimization performance. We will use the insights gained from these analyses to find optimized action representations to improve learning performance. Furthermore, we will investigate to which extent our findings generalize across similar tasks and to learning on a real robotic platform.

Track:
Academic Track
PhD Duration:
July 1st, 2022 - June 30th, 2026
First Exchange:
April 1st, 2025 - September 30th, 2025
ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.