southampton

About the project

This project aims to explore a Multimodal Large Language Model framework that enables Social Robots to interpret interaction contexts from various modality inputs, such as vision, language or audio, and provide interactions to users through multiple communication channels, such as speech, gestures or images.

Social robots are rapidly integrating into many aspects of daily life, from education and healthcare to workplaces and personal settings. All these practical applications require that robots collaborate effectively with humans in shared environments, where social interaction is essential. For socially assistive robots, it is crucial to be context-aware, enabling them to interact and deliver services that align with users' customs and needs, much like a human would.

This project aims to develop a multimodal input-output framework based on large language model, and specifically designed for human-robot interaction. The framework will empower robots to perceive multiple social signals collected from environments and users during daily interaction, to form a deep understanding of the interaction context, and respond appropriately to users via multiple communication channels.

You will have the opportunity to collaborate with other researchers in the Agent, Interaction, and Complexity group, and the Responsible AI project team, contributing to high-impact journals and conferences. You will have access to high-performance computers and multiple robot platforms to support your research.

Potential supervisors

Lead supervisor

Dr Tan Viet Tuyen Nguyen

New Frontiers Fellow

Research interests

Human-centered Artificial Intelligence
Social Human-Robot Interaction
Multimodal Perception and Interaction

Supervisors

Professor Gopal Ramchurn

Professor of Artificial Intelligence

Entry requirements

You must have a UK 2:1 honours degree, or its international equivalent.

You must have solid programming skills in Python and machine learning, and a strong interest in the intersection of machine learning and robotics. Prior experience with large language models is desirable.

Fees and funding

We offer a range of funding opportunities for both UK and international students. Horizon Europe fee waivers automatically cover the difference between overseas and UK fees for qualifying students.

Competition-based Presidential Bursaries from the University cover the difference between overseas and UK fees for top-ranked applicants.

Competition-based studentships offered by our schools typically cover UK-level tuition fees and a stipend for living costs (minimum of £19,237 in 2024-25) for top-ranked applicants.

Funding will be awarded on a rolling basis, so apply early for the best opportunity to be considered.

How to apply

Apply now

You need to:

choose programme type (Research), 2025/26, Faculty of Engineering and Physical Sciences
select Full time or Part time
choose the relevant PhD in Computer Science
add name of the supervisor in section 2

Applications should include:

personal statement
your CV (resumé)
2 academic references
degree transcripts to date

Contact us

Faculty of Engineering and Physical Sciences

If you have a general question, email our doctoral college ([email protected]).

Project leader

For an initial conversation, email Dr Tan Viet Tuyen Nguyen ([email protected]).

Postgraduate research project

Multimodal large language model in human-robot interaction