Foundational and language models for 3D scene understanding

University of Trento

PhD Programme in Information Engineering and Computer Science
Cycle: 40

3D scene understanding is an area of vision research with applications ranging from augmented reality to autonomous navigation. This PhD position is focused on research into foundational models for 3D scene understanding. 3D scenes can be created from image collections (Structure from Motion, Simultaneous Localisation and Mapping), thus allowing for the extraction of foundational representations from images and their transfer to the 3D domain using pixel-to-point correspondences. These representations can then be interacted with via language model prompting. However, these representations have been optimised for 2D reasoning. The PhD candidate will be tasked with exploring novel approaches to disentangle object-level information in the 2D domain and to fuse it in 3D, thereby enabling 3D reasoning capabilities.

FBK Contact

Are you ready to join FBK international community?

We welcome motivated applicants who are passionate about research, eager to learn, and driven by curiosity to explore new ideas.

Six reasons to become a PhD student at FBK

At FBK, our PhD program is designed to develop highly specialized researchers in a unique, stimulating environment

RESEARCH
AT FBK​

A Hub of innovation and collaboration​

TOWARD PHD EXCELLENCE

FBK stands out as one of Italy’s leading research institutions

international
network

National and international
companies and universities

learning opportunities

Explore a world of learning
at FBK

Discover Trento

One of the most Italy’s
livable city

Join FBK

A truly international
community