Use of administrative data or clinical databases as predictors of risk of death in hospital: comparison of models

BMJ 2007; 334 doi: 10.1136/bmj.39168.496366.55 (Published 17 May 2007)
Cite this as: BMJ 2007;334:1044
  1. Paul Aylin, clinical senior lecturer1,
  2. Alex Bottle, lecturer1,
  3. Azeem Majeed, professor of primary care and social medicine2
  1. 1Dr Foster Unit, Imperial College London, London EC1A 9LA
  2. 2Department of Primary Care and Social Medicine, Imperial College London
  1. Correspondence to: P Aylin p.aylin{at}imperial.ac.uk
  • Accepted 23 February 2007

Abstract

Objective To compare risk prediction models for death in hospital based on an administrative database with published results based on data derived from three national clinical databases: the national cardiac surgical database, the national vascular database and the colorectal cancer study.

Design Analysis of inpatient hospital episode statistics. Predictive model developed using multiple logistic regression.

Setting NHS hospital trusts in England.

Patients All patients admitted to an NHS hospital within England for isolated coronary artery bypass graft (CABG), repair of abdominal aortic aneurysm, and colorectal excision for cancer from 1996-7 to 2003-4.

Main outcome measures Deaths in hospital. Performance of models assessed with receiver operating characteristic (ROC) curve scores measuring discrimination (<0.7=poor, 0.7-0.8=reasonable, >0.8=good) and both Hosmer-Lemeshow statistics and standardised residuals measuring goodness of fit.

Results During the study period 152 523 cases of isolated CABG with 3247 deaths in hospital (2.1%), 12 781 repairs of ruptured abdominal aortic aneurysm (5987 deaths, 46.8%), 31 705 repairs of unruptured abdominal aortic aneurysm (3246 deaths, 10.2%), and 144 370 colorectal resections for cancer (10 424 deaths, 7.2%) were recorded. The power of the complex predictive model was comparable with that of models based on clinical datasets with ROC curve scores of 0.77 (v 0.78 from clinical database) for isolated CABG, 0.66 (v 0.65) and 0.74 (v 0.70) for repairs of ruptured and unruptured abdominal aortic aneurysm, respectively, and 0.80 (v 0.78) for colorectal excision for cancer. Calibration plots generally showed good agreement between observed and predicted mortality.

Conclusions Routinely collected administrative data can be used to predict risk with similar discrimination to clinical databases. The creative use of such data to adjust for case mix would be useful for monitoring healthcare performance and could usefully complement clinical databases. Further work on other procedures and diagnoses could result in a suite of models for performance adjusted for case mix for a range of specialties and procedures.

Footnotes

  • Contributors: PA and AB were involved in the original research question and extracted and analysed data. All authors drafted the paper and contributed comments on drafts. PA is the guarantor.

  • Funding: Dr Foster Intelligence.

  • Competing interests: The Dr Foster Unit at Imperial College is funded by a grant from Dr Foster Intelligence (an independent health service research organisation).

  • Ethical approval: We have approval under Section 60 granted by the Patient Information Advisory Group (PIAG) to hold patient identifiable data and analyse them for research purposes. We also have approval from St Mary's local research ethics committee.

  • Accepted 23 February 2007

THIS WEEK'S POLL