Visual Data-Type Understanding does not emerge from scaling Vision-Language Models

Vishaal Udandarao, Max F Burg, Samuel Albanie, Matthias Bethge

Author Locations

No location data available for the ELLIS authors of this paper.

ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.