Multimodal Foundation Models: from Research to Innovation Workshop

This half-day workshop will explore the latest advances in multimodal artificial intelligence, with a focus on how cutting-edge research can be translated into real-world innovation. Bringing together leading researchers, industry representatives and European AI initiatives, the event will create a space for scientific exchange, applied perspectives and discussion on the future of AI-driven innovation.

The workshop is open to the public and will take place at the Institut d’Estudis Catalans (IEC) in Barcelona, with the possibility to attend either in person or online.

About the workshop:
Multimodal foundation models are becoming central to the next generation of artificial intelligence. By learning from and connecting different types of data — such as text, images, video, audio, 3D, sensors and other data streams — these models open new possibilities for more generalisable, robust and adaptable AI systems.

The workshop will feature invited talks by top researchers in the field, presenting recent developments in multimodal foundation model research. These scientific contributions will be complemented by an invited industry talk and a panel discussion on the role of AI-driven innovation in connecting research breakthroughs with practical applications.

The event is hosted by the Computer Vision Center (CVC), co-organised by ELLIOT, ELLIS Unit Barcelona, the ELLIS Program on Multimodal Learning Systems, ELIAS, and XARXA RDI-IA, and supported by the city council of Barcelona.