Support
← Back to ML modules

Module 2

Supervised Learning for Clinical and Health Data

This module moves from foundational prediction thinking into supervised learning methods used for clinical and health-data problems: regression, logistic classification, k-nearest neighbours, decision trees and modelling pipelines.

Module aim

Learn core supervised models for health prediction.

The purpose of this module is to connect familiar statistical models and beginner machine learning methods to prediction tasks in medical and public health datasets.

5

Lessons

R

Coding labs

Preparing

Module status

Supervised

Learning focus

Regression for prediction

Students learn to treat regression models as tools for estimating outcomes and risks for new patients.

Clinical classification

The module introduces probability-based classification, thresholds and supervised learning for health outcomes.

Model workflow

Each method is placed inside a practical clinical-data pipeline rather than taught as an isolated algorithm.

Module lessons

Study the supervised learning sequence in order.

Each lesson introduces a different supervised learning idea, then connects it back to clinical prediction, model interpretation and responsible use with health data.

Learning route

Use Module 2 before studying validation and performance.

Module 3 assumes that students understand how supervised models are trained, how they generate predictions and why prediction pipelines must be built before performance is judged.

Continue to Module 3 →

Course pathway

Return to the full ML in Biostatistics course.

Use the course homepage to move between modules, case studies, datasets, scripts and the full medical machine learning pathway.

Back to course →