ELLIS fosters international collaboration across domains, connecting top researchers while investing in the next generation of AI talent.

ELLIS Members are leading scientists in machine learning and AI, shaping Europe's global position in these fields.

ELLIS is a pan-European AI network of excellence built upon machine learning as the driver for modern AI.

Home
› The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

2025

raw.githubusercontent.com

Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Max Viktor Skalse

ELLIS Authors

No location data available for the ELLIS authors of this paper.

Research

Members

About

ELLIS Edge Newsletter

Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.