Multimedia Personalization with Multimodal Large Language Models

Università degli Studi di Trento

Information Engineering And Computer Science

Cycle: 42

The rise of multimodal large language models (MLLMs) is transforming language, speech, and vision technologies, enabling unprecedented capabilities in translation, summarization, and, in general, content generation. This PhD project will focus on personalization, exploring how models can adapt to individual users’ preferences, context, and communication style. Research directions include adaptive speech-to-speech translation, context-aware description generation, text simplification, and modeling users’ preferences. By integrating these approaches into MLLMs, the project aims to create more natural, context-sensitive, and user-centric multilingual experiences, pushing the boundaries of how AI can serve people on an individual level.

FBK Contact

Are you ready to join FBK international community?

We welcome motivated applicants who are passionate about research, eager to learn, and driven by curiosity to explore new ideas.

Six reasons to become a PhD student at FBK

At FBK, our PhD program is designed to develop highly specialized researchers in a unique, stimulating environment

RESEARCH AT FBK

A Hub of innovation and collaboration

TOWARD PHD EXCELLENCE

FBK stands out as one of Italy’s leading research institutions

international network

National and international companies and universities

learning opportunities

Explore a world of learning at FBK

Discover Trento

One of the most Italy’s
livable city

Join FBK

A truly international community