Gabriele Goletto

Efficient egocentric vision models for the edge

Gabriele Goletto (Ph.D. Student)

With the advent of machines capable, in principle, to cooperate with humans in performing tasks of daily living, it becomes crucial the development of methods to quickly and easily infer human actions from common data. Egocentric vision is rapidly becoming one of the most promising source of information for this purpose, demonstrating impressing capabilities in pose estimation, action recognition, anticipation, retrieval, and many more. However, often these models suppose very controlled scenarios, while the major challenges for bringing egocentric vision into the wild are seldom considered overall. The aim of this project is to work on the boundaries that egocentric vision models should consider for realistic applications, namely the dimensionality of the model, the real-time of inference, the potential to easily generalize across conditions (i.e. new spaces, users, tasks), and the capability to work on continuous stream of data.

Primary Host: Barbara Caputo (Politecnico di Torino & Italian Institute of Technology)
Exchange Host: Dima Damen (University of Bristol)
PhD Duration: 01 January 2022 - 31 December 2024
Exchange Duration: 01 February 2023 - 31 July 2023 - Ongoing