Srishti Yadav
PhD
University of Copenhagen
Visual Understanding with Fine-Grained Language and Complimentary Cues

Visual data is informative, but they are also confusing, intentionally or unintentionally. Images that are visually similar, confusing, and manipulative to the human eye can benefit from the image pattern identification and associated description of these images at a fine-grained level. Understanding which token, words, phrases, or sentences evoke the best meaning, intention, and motivation of an image captured in real-life can have wide applications. Our research will attempt to understand this use of the objects and complimentary cues like motivation or feelings behind descriptions (as seen in the real world e.g. in news articles, video interviews with transcription, etc.) to find images that best match the fine-grained descriptions. These language-based heuristics, we contend, will not always result in an unequivocal interpretation of images, but will at least explain at what point and why interpretations differ.

Track:
Academic Track
PhD Duration:
November 1st, 2022 - October 31st, 2025
First Exchange:
January 1st, 2025 - June 30th, 2025
ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.