- Douglas G Altman, professor of statistics in medicine1,
- Yvonne Vergouwe, assistant professor of clinical epidemiology2,
- Patrick Royston, senior statistician3,
- Karel G M Moons, professor of clinical epidemiology2
- 1Centre for Statistics in Medicine, University of Oxford, Oxford OX2 6UD
- 2Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, Netherlands
- 3MRC Clinical Trials Unit, London NW1 2DA
- Correspondence to: D G Altman doug.altman{at}csm.ox.ac.uk
- Accepted 6 October 2008
Prognostic models, like the one we developed in the previous article in this series,1 yield scores to enable the prediction of the risk of future events in individual patients or groups and the stratification of patients by these risks.2 A good model may allow the reasonably reliable classification of patients into risk groups with different prognoses. To show that a prognostic model is valuable, however, it is not sufficient to show that it successfully predicts outcome in the initial development data. We need evidence that the model performs well for other groups of patients.1 3 In this article, we discuss how to evaluate the performance of a prognostic model in new data.4 5
Summary points
Unvalidated models should not be used in clinical practice
When validating a prognostic model, calibration and discrimination should be evaluated
Validation should be done on a different data from that used to develop the model, preferably from patients in other centres
Models may not perform well in practice because of deficiencies in the development methods or because the new sample is too different from the original
Why prognostic models may not predict well
Various statistical or clinical factors may lead a prognostic model to perform poorly when applied to other patients.4 6 The model’s predictions may not be reproducible because of deficiencies in the design or modelling methods used in the study to derive the model, if the model was overfitted, or if an important predictor is absent from the model (which may be hard to know).1 Poor performance in new patients can also arise from differences between the setting of patients in the new and derivation …
Sign in
Personal subscribers, sign in here:
Article access
Article access for 1 day
Purchase this article for £20 $30 €32*
The PDF version can be downloaded as your personal record
CiteULike
Connotea
Del.icio.us
Digg
Facebook
Reddit
Technorati
Twitter
Stumbleupon
Rapid responses
Latest Responses
The decline in the breast cancer incidence is 1.2% and it is not significant.
Published 10 February 2012
'twas ever thus
Published 10 February 2012
The value of historic human remains
Published 10 February 2012
In Praise of British Literature
Published 10 February 2012
Is real shared decision making possible?
Published 10 February 2012
Most responses
Does anyone understand the government’s plan for the NHS? (17 responses)
Published 17 Jan 2012
Bad medicine: medical nutrition (15 responses)
Published 18 Jan 2012
Shared decision making: really putting patients at the centre of healthcare (7 responses)
Published 27 Jan 2012
Why legislation is necessary for my health reforms (7 responses)
Published 1 Feb 2012
Search for evidence goes on (5 responses)
Published 17 Jan 2012