Health Data Research UK Doctoral Immersion Weeks
Course Item | Detail |
---|---|
Code | HDR002 |
Title | Structural causal modelling and inference |
Leader | Max Little |
Duration | 3 days |
Most health data researchers appreciate that “association is not causation”, but in practice make heavy use of classical statistical techniques and machine learning methods without considering that these are blind to causal effects. This has serious implications in practice: for instance, a predictive algorithm which attempts to rank patients by their risk of being diagnosed with some disease in the future, is misleading if it merely picks up strong but spurious cause-effect signals relating the specific way in which blood tests are ordered by hospital staff, to eventual diagnosis. Therefore, causal modelling and inference are critical to the design and implementation of effective and reliable predictive modelling.
This workshop aims to give the participants a thorough and rigorous advanced introduction to the key modern theory of causality, structural causal modelling, and causal inference. By the end of the course, participants should be able to have confidence in applying the concepts of causal modelling and inference, and be able to use and/or implement causal inference algorithms, for their own health data science applications.
The training will consist of technical lectures, directed reading group discussions, practical group tasks and enrichment such as guest lectures.
THIS IS NOT AN NON-TECHNICAL INTRODUCTORY COURSE
Participants are expected to have sufficient mathematical and computational as outlined below. Participants will be required to have undertaken the pre-course reading. If you are interested in a non-technical introduction to this area, you may wish to consider alternative providers such as the Leeds Causal Inference Training Course.
In order to take part in the group tasks, students will be expected to have access to a laptop with the Python language installed. Familiarity with Python is an advantage, but experience in programming in any imperative language (e.g. R, C++, Java) will likely suffice for the course.
Day 1: Concepts and tools
Time | Description |
---|---|
9.30-10.20 | Lecture 1: Structural causal models and the Pearl hierarchy |
10.30-11.20 | Directed reading group 1: Bareinboim et al., 2022 |
11.30-12.30 | Lecture 2: Interventional distributions and the identifiability problem |
12.30-13.50 | Lunch |
14.00-14.50 | Directed reading group 2: Shpitser, 2020 |
15.00-15.50 | Lecture 3: GRAPL: A computational Python library for nonparametric structural causal modelling, analysis and inference |
16.00-16.30 | Group task descriptions and discussion |
Day 2: Group tasks and guest lectures
Time | Description |
---|---|
9.30-10.50 | Group work |
11.00-11.50 | External Seminar [tbc] |
12.00-13.20 | Lunch |
13.30-14.50 | Group work |
15.00-15.30 | External Seminar [Dhurim Caqiki] |
15.30-16.30 | Group work |
Task A: Application: modelling and structural inference problem
Task B: Algorithm implementation: Identification of conditional interventional distributions (IDC algorithm, Shpitser et al.)
Task C: Algorithm implementation: Recovering an interventional distribution when only biased data is available (RC algorithm, Correa et al.)
Day 3: Task presentation and wrap-up
Time | Description |
---|---|
9.30-11.00 | Finalise task work |
11.00-12.30 | Presentations on results from each group |
12.30-13.00 | Summary |