Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
Babu Kunadian, research fellow, Joel Dunning, specialist registrar in cardiothoracic surgery, Anthony P Roberts, clinical effectiveness specialist adviser, Robert Morley, clinical audit lead, Darragh Twomey, clinical teaching fellow, James A Hall, consultant cardiologist, Andrew G C Sutton, consultant cardiologist, Robert A Wright, consultant cardiologist, Douglas F Muir, consultant cardiologist, Mark A de Belder, consultant cardiologist
1 Department of Cardiology, James Cook University Hospital, Middlesbrough TS4 3BW
Correspondence to: M A de Belder mark.debelder{at}stees.nhs.uk
Design Analysis of prospectively collected data.
Setting Tertiary centre NHS hospital in the north east of England.
Participants Five cardiologists carrying out percutaneous coronary interventions between January 2003 and December 2006.
Main outcome measures In-hospital major adverse cardiovascular and cerebrovascular events (in-hospital death, Q wave myocardial infarction, emergency coronary artery bypass graft surgery, and cerebrovascular accident) analysed against the logistic north west quality improvement programme predicted risk, for each operator. Results are displayed as funnel plots summarising overall performance for each operator and cumulative funnel plots for an individual operators performance on a case series basis.
Results The funnel plots for 5198 patients undergoing percutaneous coronary interventions showed an average observed rate for major adverse cardiovascular and cerebrovascular events of 1.96% overall. This was below the predicted risk of 2.06% by the logistic north west quality improvement programme risk score. Rates of in-hospital major adverse cardiovascular and cerebrovascular events for all operators were within the 3
upper control limit of 2.75% and 2
upper warning limit of 2.49%.
Conclusion The overall in-hospital major adverse cardiovascular and cerebrovascular events rates were under the predicted event rate. In-hospital rates after percutaneous coronary intervention procedure can be monitored successfully using funnel and cumulative funnel plots with 3
control limits to display and publish each operators outcomes. The upper warning limit (2
control limit) could be used for internal monitoring. The main advantage of these charts is their transparency, as they show observed and predicted events separately. By this approach individual operators can monitor their own performance, using the predicted risk for their patients but in a way that is compatible with benchmarking to colleagues, encapsulated by the funnel plot. This methodology is applicable regardless of variations in individual operator case volume and case mix.
Advancement in catheter technology and improved operator techniques have resulted in the continuous growth of percutaneous coronary interventions.5 6 Outcome analysis and quality control are important in interventional cardiology.7 8 9 10 11 Benchmarking raw outcome data is difficult and is complicated by variation in case mix, referral patterns, procedural techniques, and adjunctive therapy. Need is pressing for workable risk models for patients undergoing percutaneous coronary interventions. The north west quality improvement programme has provided a prediction model for major adverse cardiac events after percutaneous coronary intervention that has been subject to both internal and external validation.12 13
The New York State department of health collects and reports the number of interventions and patient mortality statistics for all cardiologists who carry out percutaneous coronary interventions.14 This allows the public to make better informed decisions when choosing a physician or hospital, and provides physicians and healthcare organisations with comparative data that will serve to improve the quality of health care. Although comparative performance of UK cardiac surgeons has been published in the public arena,15 operator specific data for percutaneous coronary intervention are not yet available.
The task force of the American College of Cardiology and American Heart Association has recently published recommendations for standards to assess operator proficiency and institutional programme quality.16 We address these recommendations and provide a method to implement them in a UK setting. We used the north west quality improvement programme risk model and then used cumulative funnels and funnel plots to display the observed major adverse cardiovascular and cerebrovascular events against the predicted rate of these events. Comparative performance of UK cardiac surgeons has been disseminated using these plots.17 In cardiology, funnel plots have been used to interpret the dataset of the myocardial infarction national audit project (a UK cardiology dataset that provides specific performance tables).18 We aimed to show that operator specific outcomes after percutaneous coronary intervention can be monitored successfully using funnel plots and cumulative funnel plots.
Study outcomes
Our outcome of interest was major adverse cardiovascular and cerebrovascular events, defined as one or more of: in-hospital death, Q wave myocardial infarction, emergency coronary artery bypass graft surgery, and cerebrovascular accident. We defined Q wave myocardial infarction as a new pathological Q wave, with creatine kinase levels more than twice the laboratory upper limit of normal with increased creatine kinase MB fraction or troponin T. We did not include non-Q wave myocardial infarction because of difficulties in knowing whether minor changes in enzyme levels after procedures in the setting of acute coronary syndromes reflect the tail end of a preprocedural event or the procedure itself—this differentiation requires knowledge of preprocedural changes to enzyme levels but is not required in the national dataset. We considered Q wave myocardial infarction occurring in the context of angioplasty therapy for acute ST elevation myocardial infarction to be an outcome of the original coronary event and not a complication of percutaneous coronary intervention.
Calculation of expected major adverse cardiovascular and cerebrovascular events
The north west quality improvement programme model included 9914 consecutive patients undergoing percutaneous coronary intervention between 1 August 2001 and 31 December 2003 in the north west of England. The model was internally validated using a dataset consisting of 1786 patients and has been externally validated on our dataset.12 13 The model calculates a patients probability of in-hospital major adverse cardiovascular and cerebrovascular events on the basis of several risk factors: age 70-79 years, age
80 years, female sex, cerebrovascular disease, cardiogenic shock, urgent percutaneous coronary intervention, emergency percutaneous coronary intervention, left main stem lesion treated, and graft lesion treated. The calculation of predicted risk of major adverse cardiovascular and cerebrovascular events as a percentage is P=odds/(1+odds)x100, where odds=exp(–5.4959+[0.7048xage70-79 years]+[1.0106xage
80 years]+[0.4586xfemale sex]+[0.8618xcerebrovascular disease]+[3.2636xcardiogenic shock]+[0.4788xurgent percutaneous coronary intervention]+[1.3625xemergency percutaneous coronary intervention]+[1.6502xleft main stem lesion treated]+[0.9101xgraft lesion treated].
Statistical analysis
Funnel plots are based on statistical process control,20 21 22 23 24 a set of methods for ongoing improvement of systems, processes, and outcomes. We present two methods of display: funnel plots summarising overall performance and cumulative funnel plots of an individual operators performance on a case series basis.
The mean predicted major adverse cardiovascular and cerebrovascular events for all cases is displayed as a percentage. We created a funnel plot using upper and lower control limits calculated at 3
(calculated similarly to 99.8% confidence intervals, although control limits are prediction limits not precision limits) around the mean predicted major adverse cardiovascular and cerebrovascular events, using an exact binomial method.21 The mean of the predicted event rate is recalculated as each additional case is added to the series. Similarly, the observed event rate is recalculated as each case is added to the series, creating a cumulative mean. We also calculated an upper warning limit (calculated similarly to 95% confidence intervals) at 2
above the predicted mean. Each operator is then displayed as a scatter point, showing the observed rate for major adverse cardiovascular and cerebrovascular events and the volume of cases for each operator, against the funnel. The control limits show how much variation to expect around the predicted event rate for a given volume of cases. If the observed event rate for any operator varies more than this, a special cause is implied. Special cause variation (also known as assignable cause variation) is the fluctuation that is caused by unpredictable factors resulting in a non-random distribution of the data. Unlike common cause variation (unassignable cause variation), special causes of variation can be eliminated by reacting to individual variations and have to be removed before a process can be improved by tackling sources of common cause variation. The more cases that are included, the more precisely the predicted event rate "constrains" the observed event rate. For this reason the control limits become narrower when the number of cases done by each operator increases.
Cumulative funnel plots are produced to display the performance of each individual operator and the whole unit on a case series basis. In cumulative funnel plots the funnel shape is produced from the cumulative mean predicted major adverse cardiovascular and cerebrovascular events—that is, it is calculated successively adding each new case. The observed events rate is also shown cumulatively, with each observed line for events beginning with 0%. The first event in the series produces a rise to the percentage of observed events and is followed by a falling line as the number of cases increases without an event, until the next event in the series. This produces a distinctive saw-tooth pattern, making the number of events and cases transparent while allowing comparison of rates. The predicted risk for patients in the series is also clearly displayed and makes it possible to calculate the number of events that would be needed to cross the control limits. The right hand end of the trace for an individual operators cumulative observed event rate indicates where they will be positioned on the funnel plot.
The risk adjusted major adverse cardiovascular and cerebrovascular event rate is a comparison of observed to expected numbers of events as a ratio, with confidence intervals, to reflect errors due to random sampling. A risk adjusted rate is the event rate that would be expected for the operators had their patients been identical to the unit wide mix.25
We also assessed whether the predicted risk of patients entering the unit is changing, by plotting the average logistic north west quality improvement programme score monthly and comparing this with the overall complication rates for these same periods.
|
|
(equivalent to the 99.8% confidence interval) upper control limit of 2.75% and the 2
upper warning limit of 2.49%. For example, in an individual operators series of 1178 percutaneous coronary interventions (operator E, fig 2
control limits would be crossed if there were fewer than 10 events (0.81%, the lower control limit) or greater than 39 in-hospital events (3.32%, the upper control limit). The upper warning limit would be crossed if more than 32 in-hospital events occurred.
|
|
|
|
|
Strengths and weaknesses
The dataset needed to calculate the predicted risk of major adverse cardiovascular and cerebrovascular events according to the north west quality improvement programme is small and feasible to collect. We have shown that it is a valid model to use for predicting events, even with modern percutaneous coronary intervention practice (for example, high rate of stenting, use of thienopyridines and glycoprotein IIb/IIIa inhibitors). It is proposed that the north west quality improvement programme model be used to calculate the risk adjusted outcome nationally using the UK national percutaneous coronary intervention audit dataset coordinated by the British Cardiovascular Intervention Society.19
The data collection exercise by the British Cardiovascular Intervention Society provides prospectively collected data on which different methods of analysis can be tested. Quality assessment can then be done both at the level of each unit and at the level of individual doctors across the country for all the hospitals participating in the societys registry. This will allow institutions to measure risk adjusted outcomes and to compare them with national benchmarks for improving quality of care. One advantage of the model is that it does not depend on the case volume of the individual operator. When a difference in case mix, referral patterns, procedural techniques, or adjunctive therapy is sufficiently dominant to create a special cause in the funnel plot for a unit, the individual operators cumulative funnel will show whether the operator is within the limits for the predicted risk in their personal case series.
Comparison with other studies
Since 1994 the New York State department of health has collected and periodically published observed and risk adjusted patient mortality rates for all the interventional cardiologists practising coronary angioplasty in the state.14 The New York data, possibly because of exhaustive validation, are about three years out of date by the time they are published. Public reporting of operator specific outcome data may influence doctors to withhold procedures from patients at higher risk, even when a doctor believes that the procedure might be beneficial. The form of analysis depicted in our study should mitigate against such behaviour in that it provides risk adjusted data. Using such a risk adjusted approach, one study has shown how the results of cardiac surgery have improved over time despite a higher risk patient profile, providing evidence that publication of individual results does not necessarily lead to surgeons avoiding high risk patients and, moreover, participation in the quality programme has been associated with better outcomes.26 We have presented our data in funnel plots and cumulative funnel plots summarising overall and individual performance on a case series basis. The routine publication of these funnel plots on performance attainments will encourage reflection, analysis of processes, and subsequent improvements in delivery of health care.
The expected low complication rate for percutaneous coronary interventions presents a statistical power problem when attempting to compare results of individual operators with different case volumes. For this reason we used a combined end point of clinically important outcomes. With a large enough database the methodology could be used for individual components of major adverse cardiovascular and cerebrovascular events. Once the end point to be used has been determined, cumulative funnel plots are completely transparent in showing the observed events and the number of cases and events included. They have acceptable sensitivity to deteriorating performance if the predicted events are not previously excessively above that observed. They can be applied to operators with both a high volume and a low volume of cases. The temporal display allows an analysis of when performance might deviate from an acceptable level, which itself allows for appropriate analysis and corrective action. A study using similar statistical monitoring techniques, has shown how an overall impression that there was a problem with excessive bleeding rates within a cardiothoracic unit was translated into a statistical demonstration that the problem was real and allowed for evaluation of possible reasons and corrective action.27
Sequential probability ratio tests28 have been suggested for monitoring mortality related to cardiac surgery. These plots (and variants of them, such as cumulative sum techniques) have the best sensitivity to changing performance; however, they have two main disadvantages. The first is their lack of a direct, transparent display of observed and expected mortality, combining the two as they do into a single trace. Secondly, they involve several arbitrary assumptions—specifically they are designed to detect a doubling (or other multiple) of the risk and require
and β (effectively the acceptable true and false positive rates) to be chosen in advance. The methodology described here is visually easy to understand, and its use (especially if fed back to the units by the national society) will encourage individual departments to continuously monitor their results and act accordingly.
Implications
This study will allow internal and external monitoring of performance and will help inform the public about outcome data for individual interventional cardiologists. The analysis for our unit shows good overall results, and predicted mean complication rates for individual operators in this study fell below the 3
limit (upper control) and 2
limit (upper warning). The results compare favourably with national and international data. We use the funnel plots on our hospital trust website and use cumulative funnels internally to monitor observed major adverse cardiovascular and cerebrovascular events against predicted events for all our individual operators carrying out percutaneous coronary interventions. Patients in north east England will now be able to scrutinise an individual operators outcomes, and this should provide reassurance about overall quality. Although such plots could theoretically be used to aid in patient choice of doctors, our own data suggest that a patient could expect equivalent short term outcomes from any of the operators. This type of analysis should be updated regularly, to give ongoing reassurance to patients. This also has a potential role in professional revalidation and could be fed into systems of doctor appraisal. This may require that the process should be made even more robust by developing methods for external validation of data quality.
If such a monitoring system showed that the results for an individual operator (or institution) fell outside the warning limits, this should trigger a response. Results outside the upper warning could initiate an internal review and results outside the upper control limit could trigger an external audit to determine opportunities to improve quality of care. Such reviews should first determine the completeness and accuracy of data collection and establish the case mix in comparison with the case mix overall for the unit and in the national dataset. Risk prediction models are not perfect, however, and for specific groups of patients can either overestimate or underestimate risk.13 Moreover, individual operators or institutions could become outliers just by chance. One check would be to reassess the data using a different, but also validated, risk model. Yearly monitoring of the previous three years data may also provide reassurances. The funnel plots provide summary data for a predetermined period (or for a predetermined number of cases) whereas the cumulative funnel plots can be used continuously, thus reflecting contemporary activity. Should there still be a concern about results after evaluating data quality and case mix then changes aimed at improving outcomes could be proposed. Such changes might include a review of case selection or techniques, "buddying-up" with colleagues in specific cases, or retraining. It has been shown that participation in such audit processes improves the quality of data collection, but, more importantly, it can improve patient care without resulting in risk averse behaviour on the part of individual operators or institutions.29
If this model is universally adopted by all centres carrying out percutaneous coronary interventions, this would allow a well structured benchmarking system in which the public could have full confidence. A system that could activate internal and external reviews would provide an appropriate response to deal with variations in practice. Equally, for most clinicians, attainment of results within these confidence limits would provide them with a mark of excellence and a track record of results recognised as being equivalent to the high standards seen nationally. To be accepted by the profession, such a system must persuade those undertaking high risk cases that their case mix is catered for by the risk model and that the system is primarily about quality improvement rather than seeking just to identify poor operators. We recommend that this methodology, which accommodates these concerns, be considered by the national society. Given that nearly all units in the UK collect data on their patients through the central cardiac audit database,19 this service could be provided to all units through its central administration, thus avoiding replication of effort.
We believe that cumulative funnel plots have acceptable sensitivity to poor performance using 3
(the widely accepted best practice in statistical process control) rather than 2
or lower (for example, 90% limits). Using a tighter upper control limit than 3
would increase the risk of a false positive and this is not justified if this method is to be applied to a large number of operators across the UK where an unjust examination of an individual operator would become too probable. These 2
warning limits and 3
upper control limits have been successfully implemented by the Society for Cardiothoracic Surgery, allowing widespread acceptance of the national publication of risk adjusted mortality in adult cardiac surgery.4
Limitations
As with all performance measurement and analysis methods, funnel charts produced using statistical process control may produce both false negative results and false positive results. Performing within limits does not guarantee that a unit may not be underperforming although this may be too slight to be detected or masked by other factors. Being outside a control limit is not always abnormal (special cause) either, even when set at 3
, equivalent to 99.8% confidence intervals. Despite such false negative results and false positive results, detecting a "special cause" encourages further investigation at the level of the individual operator. Similarly, funnel plots help prevent investigation of an outcome resulting from common cause variation as if it were a special cause phenomenon. Such plots are more meaningful and useful than league tables.
It is possible that outcomes after percutaneous coronary intervention improve and the risk model may therefore become less accurate over time. With a large central database that combines data from a large number of units, the risk model could be periodically re-evaluated so that the concept of benchmarking is not lost.
It would be important to interpret a new operators early experience appropriately. The concept of learning curves should be acknowledged. This is implicit in the methodology described because with low numbers of procedures the confidence intervals are wide. Nevertheless, should the results fall outside the control limits, this should trigger some form of response, such as buddying-up with a more senior operator for a predefined number of cases.
Conclusion
Our operators in-hospital major adverse cardiovascular and cerebrovascular events rates were lower than the predicted event rate. In-hospital rates after percutaneous coronary intervention can be monitored successfully using funnel plots and cumulative funnel plots with 3
control limits to display and publish each operators outcomes. The upper warning limit (2
control limit) could be used for internal monitoring. The main advantage of these charts is their transparency, as they show observed and predicted events separately. By this approach individual operators can monitor their own performance, using the predicted risk for their patients in a way that allows benchmarking against colleagues. The methodology allows scrutiny of outcomes regardless of variations in individual operator case volume.
|
Competing interests: BK, JAH, AGCS, RAW, DFM, and MdeB have received travel grants from manufacturers of coronary stents and percutaneous coronary intervention related pharmaceutical companies. MdeB has sat on advisory boards for stent manufacturers and percutaneous coronary intervention related pharmaceutical companies and has received research grants from a few stent manufacturers. As members or fellows of the Royal College of Physicians we have an interest in methods of revalidation and may be involved in standard setting. As, respectively, president of the British Cardiovascular Intervention Society and council member of the British Cardiovascular Society, MdeB and JAH will have a role in informing the debate about these issues. The authors do not believe that any of these declarations constitute a conflict of interest as regards this study.
Ethical approval: Not required.
Provenance and peer review: Not commissioned; externally peer reviewed.
What can you learn from this BMJ paper? Read Leanne Tite's Paper+