southampton

About the project

This project explores the integration of calculus of variations with verification-guided representation learning for reinforcement learning.

We aim to develop optimal, physically consistent representations that enhance sample efficiency, stabilize learning, and facilitate formal verification. This research addresses critical needs in safe and reliable AI, with applications in robotics, autonomous systems, and beyond.

As reinforcement learning (RL) applications expand into safety-critical domains, the need for models that are both high-performing and verifiable has become paramount.

This project proposes a novel approach to representation learning that combines calculus of variations with verification-guided learning to address this challenge. By leveraging the calculus of variations, we aim to optimize the representations learned by RL agents to be not only effective but also compliant with underlying physical laws and constraints, thereby enhancing sample efficiency and stability in learning. This principled approach ensures that representations capture the essential dynamics of the environment, allowing RL agents to generalize better with less data and to learn more robust policies.

Additionally, the project incorporates formal verification techniques into the representation learning process, guiding the agent to create representations that are inherently verifiable. This ensures that the resulting policies are safer and more reliable, which is critical in applications such as autonomous vehicles, robotics, and industrial control. Students working on this project will explore cutting-edge intersections between machine learning, optimization, and formal methods, developing a framework that addresses both performance and verifiability.

Potential research directions include multi-dimensional calculus of variations for high-dimensional state spaces, efficient integration of physical laws as constraints in representation learning, and the application of verification feedback to refine learned representations.

This project offers PhD students the opportunity to contribute to foundational advancements in RL and to push the boundaries of safe, efficient, and explainable AI.

Potential supervisors

Lead supervisor

Dr Chao Huang

Associate Professor

Research interests

Reinforcement Learning
Formal Methods
Design Automation for Cyber Physical Systems

Entry requirements

You must have a UK 2:1 honours degree, or its international equivalent.

You need to have a strong foundation in:

the calculus of variations, optimization, and reinforcement learning (RL),
proficiency in Python
deep learning frameworks like PyTorch or TensorFlow

Familiarity with formal verification or control theory is highly desirable, especially for safety and verification aspects.

Fees and funding

We offer a range of funding opportunities for both UK and international students. Horizon Europe fee waivers automatically cover the difference between overseas and UK fees for qualifying students.

Competition-based Presidential Bursaries from the University cover the difference between overseas and UK fees for top-ranked applicants.

Competition-based studentships offered by our schools typically cover UK-level tuition fees and a stipend for living costs (minimum of £19,237 in 2024-25) for top-ranked applicants.

Funding will be awarded on a rolling basis, so apply early for the best opportunity to be considered.

How to apply

Apply now

You need to:

choose programme type (Research), 2025/26, Faculty of Engineering and Physical Sciences
select Full time or Part time
choose the relevant PhD in Computer Science
add name of the supervisor Dr. Chao Huang in section 2 of the application form

Applications should include:

personal statement
your CV (resumé)
2 academic references
degree transcripts to date

Contact us

Faculty of Engineering and Physical Sciences

If you have a general question, email our doctoral college ([email protected]).

Project leader

For an initial conversation, please email
Dr. Chao Huang: [email protected]

Postgraduate research project

Optimal and verifiable representation learning for safe and efficient reinforcement learning systems