Honest validation
Students learn why apparent performance is often too optimistic when models are assessed on the same data used to build them.
Module 3
This module focuses on how medical machine learning models should be assessed: train/test splits, resampling, cross-validation, bootstrap validation, classification metrics, calibration, clinical usefulness, leakage and reproducibility.
Module aim
The purpose of this module is to show that performance is not a single number. A useful clinical model must be validated, calibrated, reproducible and relevant to the decision setting.
5
Lessons
R
Coding labs
Preparing
Module status
Validation
Core focus
Students learn why apparent performance is often too optimistic when models are assessed on the same data used to build them.
The module explains how discrimination, sensitivity, specificity, calibration and usefulness answer different clinical questions.
Evaluation is treated as a transparent workflow involving resampling, leakage checks, reporting discipline and reproducibility.
Module lessons
Each lesson adds a different evaluation layer: data splitting, resampling, classification metrics, calibration, clinical usefulness, leakage prevention and reproducible reporting.
3.1
Lesson 3.1
Learn why model evaluation needs data separation, repeated resampling and a clear distinction between model fitting and model assessment.
Open lesson →
3.2
Lesson 3.2
Understand cross-validation and bootstrap validation as tools for estimating model performance more honestly in limited health datasets.
Open lesson →
3.3
Lesson 3.3
Study classification performance using sensitivity, specificity, predictive values, ROC curves and AUC in clinical prediction settings.
Open lesson →
3.4
Lesson 3.4
Learn why good discrimination is not enough, and how calibration and clinical usefulness shape responsible model assessment.
Open lesson →
3.5
Lesson 3.5
Bring validation together by recognising bias, leakage, reproducibility problems and reporting weaknesses in medical machine learning.
Open lesson →
Learning route
Module 4 assumes students understand how to evaluate models honestly. Regularisation, forests and boosting are only useful when their performance is judged with careful validation.
Continue to Module 4 →Course pathway
Use the course homepage to review modules, scripts, datasets, case studies and the full clinical prediction learning pathway.
Back to course →