Latest News | Yau Research Group

The fundamentals of artificial intelligence and education in 2026

This blog is part of a short series of articles I will be writing to welcome the start of 2026. All opinions expressed are my own and do not represent the views of any individuals or organisations mentioned in the articles. Further, I am a UK-based academic scientist and these articles reflect the UK context.

Jan 3, 2026 12 min read Ai, Phd

Building PhD Programmes

This blog is part of a short series of articles I will be writing to welcome the start of 2026. All opinions expressed are my own and do not represent the views of any individuals or organisations mentioned in the articles. Further, I am a UK-based academic scientist and these articles reflect the UK context.

Dec 31, 2025 12 min read Phd, Academia

Workshop presentations

The group was represented at a number of workshops over the weekend:

Multimodal Survival Analysis with Locally Deployable Large Language Models at the NeurIPS 2025 2nd Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

Dec 7, 2025 1 min read Machine Learning, Health

Improving maternity care for all

We are very pleased to have contributed an article to the Association for Improvements in the Maternity Services (AIMS) journal entitled “Women, Pregnancy and Artificial Intelligence: Opportunities and Cautions in the Age of Digital Maternity” as part of the MuM-Predict consortium. The article demonstrates our commitment to connecting AI research to those it affects.

Dec 5, 2025 1 min read Machine Learning, Health

AAAI26: Hybrid restricted master problem for Boolean matrix factorisation

Congratulations to Ellen Visscher whose paper “Hybrid restricted master problem for Boolean matrix factorisation” was accepted for presentation at AAAI 2026. A preprint paper can be found on arXiv.

Nov 18, 2025 1 min read Machine Learning

A unofficial guide to NIHR-MRC Better Methods, Better Research applications

This guide reflects my own personal views and opinions and not those of the National Institute for Health and Care Research (NIHR), UK Research and Innovation (UKRI) or the Medical Research Council (MRC). For official information about the funding schemes discussed please refer to the NIHR website.

Nov 13, 2025 9 min read General

TMLR: Continual learning via probabilistic exchangeable sequence modelling

Congratulations to Hanwen Xing whose paper “Continual learning via probabilistic exchangeable sequence modelling” was accepted in Transactions on Machine Learning Research. A copy of the paper can be found on OpenReview.

Oct 19, 2025 1 min read Machine Learning

NeurIPS25: DoseSurv: Predicting Personalized Survival Outcomes under Continuous-Valued Treatments

Congratulations to Mortiz Gogl whose paper “DoseSurv: Predicting Personalized Survival Outcomes under Continuous-Valued Treatments” was accepted for NeurIPS 2025. A copy of the paper can be found on the NeurIPS25 website.

Sep 17, 2025 1 min read Machine Learning, Health

SurvivEHR preprint now available

A preprint of work by Charles Gadd describing “SurvivEHR: a competing risks, time-to-event foundation model for multiple long-term conditions from primary care electronic health records” is now available on medRxiv.

“Multiple long-term conditions (MLTCs) or multimorbidity – the co-occurrence of multiple chronic conditions –presents a growing challenge for primary care. Current predictive models often target single outcomes and overlook the complexities of time-to-event risk in real-world, longitudinal health data. Here, we present SurvivEHR, a generative transformer-based foundation model trained on over 7.6 billion coded events from 23 million patients in UK primary care. SurvivEHR introduces a competing risk time-to-event pretraining objective that enables accurate forecasting of future diagnoses, investigations, medications, and mortality. We demonstrate that SurvivEHR achieves strong risk stratification performance, captures clinically meaningful trajectories, and outperforms benchmark survival models across multiple tasks. The model also transfers effectively to fine-tuned prognostic tasks, particularly in low-resource settings. By learning patient trajectories directly from routine health records, SurvivEHR offers a scalable and privacy-preserving approach for building generalisable clinical risk tools that address the complexity of MLTCs in primary care..”

Aug 7, 2025 1 min read Machine Learning, Electronic Health Records

GPerturb in Nature Communications

Congratulations to Hanwen Xing on having his paper “GPerturb: Gaussian process modelling of single-cell perturbation data” accepted in Nature Communications:

“Single-cell RNA sequencing and CRISPR screening enable high-throughput analysis of genetic perturbations at single-cell resolution. Understanding combinatorial perturbation effects is essential but challenging due to data sparsity and complex biological mechanisms. We present GPerturb, a Gaussian process-based sparse perturbation regression model designed to estimate gene-level perturbation effects. GPerturb employs an additive structure to separate signal from noise and captures sparse, interpretable effects from both discrete and continuous responses. It also provides uncertainty estimates for the presence and strength of perturbation effects on individual genes. We demonstrate the use GPerturb on both simulated and real-world datasets, characterising its competitive performance with current state-of-the-art methods. Furthermore, the model reveals meaningful gene-perturbation interactions and identifies effects consistent with known biology. GPerturb offers a novel approach for uncovering complex dependencies between gene expression and perturbations and advancing our understanding of gene regulation at the single-cell level.”

Jul 11, 2025 1 min read Machine Learning, Cancer