Michael Dorkenwald
PhD
University of Amsterdam (UvA)
Generalizeable Video Representation Learning

Video data is a treasure trove for AI models, providing a lens to the intricate dynamics and mechanisms that define our world. The key to unlocking the tremendous amount of data available on the web is to bypass the laborious and expensive process of annotating each video. Yet, extracting knowledge and understanding from these videos without labels poses a significant challenge. This research project aims to address this problem by developing innovative self-supervised methods that leverage multi-modalities (e.g. audio) to achieve a more comprehensive and causally-informed understanding of video data and its reflection on the world. The primary objective is to generate robust representations that can be applied to various video-related tasks, such as video scene understanding or long-term video understanding.

Track:
Industry Track
PhD Duration:
June 1st, 2022 - June 1st, 2027
ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.