Derivation and validation of QStroke score for predicting risk of ischaemic stroke in primary care and comparison with other risk scores: a prospective open cohort studyBMJ 2013; 346 doi: http://dx.doi.org/10.1136/bmj.f2573 (Published 02 May 2013) Cite this as: BMJ 2013;346:f2573
- Julia Hippisley-Cox, professor of clinical epidemiology and general practice1,
- Carol Coupland, associate professor and reader in medical statistics1,
- Peter Brindle, research and development programme director2
- 1Division of Primary Care, University Park, Nottingham NG2 7RD, UK
- 2Avon Primary Care Research Collaborative, Bristol Clinical Commissioning Group, Bristol BS1 3NX, UK
- Correspondence to: J Hippisley-Cox
- Accepted 22 March 2013
Objective To develop and validate a risk algorithm (QStroke) to estimate risk of stroke or transient ischaemic attack in patients without prior stroke or transient ischaemic attack at baseline; to compare (a) QStroke with CHADS2 and CHA2DS2VASc scores in patients with atrial fibrillation and (b) the performance of QStroke with the Framingham stroke score in the full population free of stroke or transient ischaemic attack.
Design Prospective open cohort study using routinely collected data from general practice during the study period 1 January 1998 to 1 August 2012.
Setting 451 general practices in England and Wales contributing to the national QResearch database to develop the algorithm and 225 different QResearch practices to validate the algorithm.
Participants 3.5 million patients aged 25-84 years with 24.8 million person years in the derivation cohort who experienced 77 578 stroke events. For the validation cohort, we identified 1.9 million patients aged 25-84 years with 12.7 million person years who experienced 38 404 stroke events. We excluded patients with a prior diagnosis of stroke or transient ischaemic attack and those prescribed oral anticoagulants at study entry.
Main outcome measures Incident diagnosis of stroke or transient ischaemic attack recorded in general practice records or linked death certificates during follow-up.
Risk factors Self assigned ethnicity, age, sex, smoking status, systolic blood pressure, ratio of total serum cholesterol to high density lipoprotein cholesterol concentrations, body mass index, family history of coronary heart disease in first degree relative under 60 years, Townsend deprivation score, treated hypertension, type 1 diabetes, type 2 diabetes, renal disease, rheumatoid arthritis, coronary heart disease, congestive cardiac failure, valvular heart disease, and atrial fibrillation
Results The QStroke algorithm explained 57% of the variation in women and 55% in men without a prior stroke. The D statistic for QStroke was 2.4 in women and 2.3 in men. QStroke had improved performance on all measures of discrimination and calibration compared with the Framingham score in patients without a prior stroke. Among patients with atrial fibrillation, levels of discrimination were lower, but QStroke had some improved performance on all measures of discrimination compared with CHADS2 and CHA2DS2VASc.
Conclusion QStroke provides a valid measure of absolute stroke risk in the general population of patients free of stroke or transient ischaemic attack as shown by its performance in a separate validation cohort. QStroke also shows some improvement on current risk scoring methods, CHADS2 and CHA2DS2VASc, for the subset of patients with atrial fibrillation for whom anticoagulation may be required. Further research is needed to evaluate the cost effectiveness of using these algorithms in primary care.
Cardiovascular disease is the leading cause of premature death and a major cause of disability in the UK.1 In 2008 the UK government announced a major new initiative to reduce vascular risk,2 building on guidelines from the National Institute for Health and Clinical Excellence (NICE) for lipid modification.3 Risk factors for cardiovascular disease are now well established, and validated tools such as QRISK24 5 6 7 8 which predict risk of cardiovascular disease are included in clinical guidelines. QRISK2 is part of the Quality and Outcomes Framework to reward UK general practices for using it. It has been incorporated into all four major UK GP clinical computer systems, which cover more than 90% of UK general practice. This integration has allowed automated cardiovascular risk assessment within the consultation to aid clinical decision making as well as risk stratification of GP practice populations to identify those patients who need further recall, assessment, and treatments.
QRISK2, which is updated annually, currently predicts risk of cardiovascular disease, defined as either coronary heart disease or stroke/transient ischaemic attack. It includes major risk factors such as age, sex, deprivation, ethnicity, smoking, systolic blood pressure, ratio of total serum cholesterol to high density lipoprotein cholesterol concentrations, body mass index, diabetes, rheumatoid arthritis, chronic renal disease, and atrial fibrillation. Atrial fibrillation is important since is the most common cardiac rhythm disorder and it particularly predisposes to stroke. The risk of stroke in patients with atrial fibrillation can be reduced by anticoagulation, but the evidence for the effectiveness of aspirin is less clear.9 Many patients with atrial fibrillation are not currently prescribed anticoagulation even though it is encouraged in the Quality and Outcomes Framework.10 This is probably because of difficulties in case identification and concerns regarding the potential adverse effects of traditional anticoagulants such as warfarin.
New oral anticoagulants (factor Xa inhibitors and direct thrombin inhibitors) have a similar efficacy at reducing stroke in people with atrial fibrillation as warfarin, and they have a wider therapeutic range without the need for repeated monitoring of international normalised ratio (INR).11 The most common tool for helping clinicians decide whether to initiate anticoagulation in people with atrial fibrillation is the CHADS2 score,12 which is a simple counting system which does not include many established risk factors and does not give an absolute risk of stroke. More recently, additional risk factors have been included in the CHA2DS2VASc score,13 which is better at identifying low risk individuals for whom the risks of anticoagulation might outweigh the benefits14 but doesn’t given an absolute risk of stroke either.
While increasing the number of risk factors generally improves risk stratification, it also makes a risk factor counting tool such as CHA2DS2VASc increasingly cumbersome to use in everyday clinical practice. Our aim was therefore to develop and validate a new risk prediction algorithm to predict the risk of stroke and transient ischaemic attack (QStroke) that could be automatically populated by data held in the clinical record and calculated in the same way as QRISK2, thereby providing a simpler practical alternative to existing scores. We wanted to develop an algorithm that quantifies absolute risk of stroke in a way which can be communicated to patients to aid decision making. In particular, we wished to compare its performance with existing scores in a subset of patients with atrial fibrillation for whom anticoagulation should be considered.
Study design and data source
We conducted a prospective cohort study of a large UK primary care population using a similar method to our original analysis for predicting cardiovascular risk (QRISK2).15 Version 34 of the QResearch database was used for this study (www.qresearch.org). This is a large validated primary care electronic database containing the health records of 13 million patients registered from 676 general practices using the Egton Medical Information System (EMIS) computer system.15 Practices and patients contained on the database are nationally representative16 and similar to those on other primary care databases using other clinical software systems.4 We included all QResearch practices in England and Wales once they had been using their current EMIS system for at least a year (to ensure completeness of recording of morbidity and prescribing data), randomly allocating two thirds of practices to the derivation dataset and one third to the validation dataset.
We identified an open cohort of patients aged 25-84 years at the study entry date, drawn from patients registered with eligible practices between 1 January 1998 and 1 Aug 2012. We used an open cohort design, rather than a closed design, as this allows patients to enter the population throughout the whole study period rather than require registration on 1 January 1998, thus better reflecting the realities of routine general practice. We excluded registered patients with a prior recorded diagnosis of stroke or transient ischaemic attack at baseline because of the difficulty of distinguishing a new stroke from a review of an existing stroke in GP records. We excluded patients without a Townsend deprivation score related to a valid postcode.
We also excluded patients who were taking anticoagulants (as defined by chapter 2.8.2 of the British National Formulary) at baseline to reflect the clinical application of the tool for assessing patients with atrial fibrillation who might be suitable for anticoagulation and to permit better comparison with existing risk scores such as CHADS212 and CHA2DS2VASc. The anticoagulants included warfarin, acenocoumarol, phenindione, dabigatran, rivaroxaban, and apixaban, though not all drugs were licensed at the start of the study period.
We did not exclude patients prescribed aspirin at baseline as aspirin is generally not considered to be effective at preventing stroke in patients with atrial fibrillation.9 17 We did not exclude incident users of anticoagulants during follow-up in order to ensure the baseline population was representative of patients who might subsequently be prescribed anticoagulants.
For each patient we determined an entry date to the cohort, which was the latest of the following dates: 25th birthday, date of registration with the practice plus one year, date on which the practice computer system was installed plus one year, or the beginning of the study period (1 January 1998). Patients were censored at the earliest date of a stroke or transient ischaemic attack, death, deregistration with the practice, last upload of computerised data, or the study end date (1 August 2012).
Stroke or transient ischaemic attack disease outcomes
The primary outcome measure of interest was the first recorded diagnosis of either stroke or transient ischemic attacks, excluding haemorrhagic stroke. The Read codes used for case identification on the GP computer record were those agreed and used in the Quality and Outcomes Framework for General Practice. The ICD-10 codes used for case identification on the Office for National Statistics death certificate were cerebral infarction (I63) and stroke not specified as haemorrhage or infarction (I64).
Risk factors for stroke
We included the variables which are already included in the current version of QRISK2 (2013) or included in the CHADS212 or CHA2DS2VASc scoring systems13 as shown in table 1⇓. The following variables were examined:
Self assigned ethnicity (white/not recorded, Indian, Pakistani, Bangladeshi, other Asian, black African, black Caribbean, Chinese, other (including mixed))
Age at study entry (years)
Smoking status (non-smoker, former smoker, light smoker (<10 cigarettes/day), moderate smoker (10-19 cigarettes/day), heavy smoker (≥20 cigarettes/day))
Systolic blood pressure20 (continuous)
Ratio of total serum cholesterol to high density lipoprotein (HDL) cholesterol20 (continuous)
Body mass index15 (continuous)
Family history of coronary disease in first degree relative <60 years old15 (yes/no)
Townsend deprivation score15 (output area level 2001 census data evaluated as a continuous variable)
Treated hypertension15 (diagnosis of hypertension and at least one current prescription of at least one antihypertensive agent)
Rheumatoid arthritis21 (yes/no)
Chronic renal disease22 (yes/no)
Type 1 diabetes18 (yes/no)
Type 2 diabetes20 (yes/no)
Coronary heart disease (yes/no)
Congestive cardiac failure (yes/no)
Valvular heart disease (yes/no).
We restricted all values of these variables to those recorded in the person’s electronic healthcare record before baseline, except for ethnicity, smoking status, systolic blood pressure, total serum cholesterol:HDL cholesterol ratio, and body mass index, where we used the values recorded closest to the study entry date and recorded before the patient had the outcome or was censored. We imputed missing values where necessary as described below.
Model derivation and development
As in previous analyses,5 we used the Cox proportional hazards model in the derivation dataset to estimate the coefficients and hazard ratios associated with each potential risk factor for the first ever recorded diagnosis of stroke or transient ischaemic attack for males and females separately. We used fractional polynomials to model non-linear risk relationships with age and body mass index where appropriate.23 We tested for interactions between each variable and age and included significant interactions in the final model where they improved model fit. Continuous variables were centred for analysis. Our main analyses used multiple imputation to replace missing values for systolic blood pressure, total cholesterol:HDL cholesterol ratio, smoking status, and body mass index. Our final model was fitted based on five multiply imputed datasets using Rubin’s rules to combine effect estimates and standard errors to allow for the uncertainty due to imputing missing data.24 We took the log of the hazard ratio for each variable from the final model and used these as weights for the new stroke risk equations. We combined these weights with the baseline survivor function evaluated at 10 years centred on the means of continuous risk factors to derive a risk equation for 10 years’ follow-up.
We conducted a sensitivity analysis in which patients with atrial fibrillation who were prescribed anticoagulants during follow-up were censored on the date of first prescription of anticoagulation. This is similar to the approach reported elsewhere.14 25
We tested the performance of the final model (QStroke) in the validation dataset in patients aged 25-84 years. We also compared QStroke with the Framingham stroke equation20 for performance, restricting both samples to patents aged 35-74 years since this is the age range for which the Framingham equation was developed. We calculated the 10 year estimated risk of stroke or transient ischaemic attack for each patient in the validation dataset using multiple imputation to replace missing values as in the derivation dataset. We calculated the mean predicted and observed stroke risk at 10 years15 and compared these by tenth of predicted risk for each score. The observed risk at 10 years was obtained using the 10 year Kaplan-Meier estimate. We calculated the receiver operating characteristics (ROC) statistic, D statistic (a measure of discrimination where higher values indicate better discrimination),26 and an R2 statistic (which is a measure of explained variation for survival data where higher values indicate more variation is explained).27
Validation of QStroke, CHADS2, and CHA2DS2VASc in patients with atrial fibrillation
For patients with atrial fibrillation at baseline in the validation dataset, we calculated QStroke, CHADS2, and CHA2DS2VASc scores. We calculated Harrell’s C statistic as a measure of discrimination in the subset of atrial fibrillation patients in the validation cohort, as this takes account of the censored nature of the data, unlike the ROC statistic. The C statistic was not calculated in the full validation cohort as the sample was too large and the test would not run. We also calculated the D statistic, and R2 value using the numeric value of each score (QStroke, CHADS2, and CHA2DS2VASc).
We defined a high CHADS2 or CHA2DS2VASc score as being ≥1, since this is the cut-off value used to initiate anticoagulants. Since there is no currently accepted threshold for classifying high risk of stroke based on an absolute risk estimate, we examined the distribution of predicted risk values for QStroke and calculated a series of centile values which would identify similar numbers of patients to those identified using CHADS2 or CHA2DS2VASc. We calculated the numbers and percentages of patients who would be reclassified using CHADS2 or CHA2DS2VASc compared with QStroke (using the centile threshold values identified above). We calculated the observed risk of stroke or transient ischaemic attack at 10 years for each group of reclassified patients using Kaplan Meier estimates.
There were at least 100 events per variable considered in the prediction modelling for the outcome in the derivation cohort.28 Analyses were conducted using STATA (version 12).
Practices and patients
Overall, 676 practices in England and Wales met our inclusion criteria and had been using their current computer system for at least one year. Of these, 451 were randomly assigned to the derivation dataset and 225 to the validation dataset. We identified 3 746 065 patients aged 25-84 years in the derivation cohort. Of these, 126 620 (3.4%) had missing Townsend scores, 47 425 (1.3%) had prior stroke or transient ischaemic attack, and 22 542 (0.6%) were prescribed oral anticoagulation at baseline, leaving 3 549 478 eligible patients. We identified 2 031 993 patients aged 25-84 years in the validation cohort. Of these 98 045 (4.8%) had missing Townsend scores, 24 463 (1.2%) had a prior diagnosis of stroke or transient ischaemic attack, and 12 317 (0.6%) were prescribed oral anticoagulation at baseline, leaving 1 897 168 eligible patients.
Table 2⇓ compares the characteristics of eligible patients in the derivation and validation cohorts. Although this validation cohort was drawn from an independent group of practices, the baseline characteristics were similar to those for the derivation cohort. For example, 50.6% of patients in the derivation cohort had ethnicity recorded compared with 50.8% in the validation cohort.
Of the 3 549 478 patients in the derivation cohort, 1 175 805 (33.1%) had at least 10 years of follow-up. Of the 1 897 168 patients in the validation cohort, 592 973 (31.3%) had at least 10 years of follow-up. The median follow-up was 7.0 years for the derivation cohort and 6.7 years for the validation cohort.
Incidence of stroke
Table 3⇓ shows the numbers of cases and incidence rates of stroke by age and sex in both cohorts and in the subset of patients with atrial fibrillation at baseline. Overall in the derivation cohort, we identified 77 578 incident strokes or transient ischaemic attacks arising from 24.8 million person years of observation. In the validation cohort we identified 38 404 incident cases of stroke arising from 12.7 million person years of observation. The incidence of stroke was similar in both men and women and in both the derivation and validation cohorts. As expected, the incidence rates were higher in the group of patients with atrial fibrillation.
Table 4⇓ shows the results of the Cox regression analysis for the final QStroke model. Details of the fractional polynomial terms for age and body mass index are shown in footnote of the table. The final model included interactions between age and the following variables in men and women: body mass index, systolic blood pressure, Townsend score, family history of coronary heart disease, coronary heart disease, congestive cardiac failure, treated hypertension, atrial fibrillation, type 1 diabetes, type 2 diabetes, valvular heart disease, and smoking status. There was also an interaction between age and atrial fibrillation in women but not men. The interactions with age indicated higher hazard ratios for these risk factors among younger patients compared with older patients, as with QRISK2.5 Increasing material deprivation (as measured by the Townsend score) was associated with increasing stroke risk. There was a “dose-response” relationship for smoking, with heavy smokers having higher risks than moderate smokers, light smokers, or former smokers. Women in the Pakistani and Bangladeshi groups had significantly increased risks of stroke compared with women who were white or who didn’t have ethnicity recorded. Chinese men and men in the “other ethnic groups” has significantly lower risks of stroke compared with men who were white or who didn’t have ethnicity recorded. All the other factors in the table were significantly associated with increased stroke risk in men and women.
Of the 15 371 patients with atrial fibrillation at baseline in the derivation cohort, 3195 (20.8%) were subsequently prescribed anticoagulation during follow-up. Of the 7689 patients with atrial fibrillation in the validation cohort, 1640 (21.3%) were subsequently prescribed anticoagulation during follow-up. The results of the additional model in which patients with atrial fibrillation prescribed anticoagulation during follow-up were censored when they started treatment showed very similar hazard ratios to the main models presented here (results available from the authors).
Calibration and discrimination of QStroke and Framingham stroke equations in the validation cohort
In the full validation cohort of people aged 25-84 years the QStroke algorithm explained 57% of the variation in women and 55% in men (table 5⇓). The D statistic was 2.4 in women and 2.3 in men. Table 5 also shows the corresponding results for the Framingham stroke equation in patients aged 35-74 years, with the comparison figures for QStroke in the same age range: the ROC values, R2, and D statistic values for QStroke were higher than those for Framingham. All the measures of performance were higher for women than men for both QStroke and Framingham.
Figure 1⇓ compares the predicted and observed risks of stroke or transient ischaemic attack at 10 years using QStroke across each tenth of predicted risk (1 representing the lowest risk and 10 the highest risk) and demonstrates that the model is generally well calibrated for all patients free of stroke or transient ischaemic attack at baseline. The corresponding results for Framingham indicate a degree of under-prediction (fig 1⇓).
Performance of QStroke, CHADS2, and CHA2DS2VASc in atrial fibrillation
Figure 2⇓ compares the predicted and observed risks of stroke or transient ischaemic attack at 10 years in the subset of patients in the validation cohort with atrial fibrillation at baseline and shows that the model is well calibrated in men but there is a degree of over-prediction in women at higher levels of predicted risk.
Table 5⇑ shows the validation statistics for QStroke, CHADS2 and CHA2DS2VASc scores for men and women in the subset of 7689 patients in the validation cohort with atrial fibrillation at baseline. Of these, 890 had a stroke or transient ischaemic attack during follow-up. The point estimates in all measures of calibration and discrimination were higher in QStroke than CHA2DS2VASc and CHADS2, although the 95% confidence intervals were wide. For example, the C statistic in men was 0.71 for QStroke, 0.67 for CHA2DS2VASc, and 0.63 for CHADS2. The R2 statistic in men was 24.1% for QStroke, 18.3% for CHA2DS2VASc, and 13.5% for CHADS2. The D statistic in men was 1.15 for QStroke, 0.97 for CHA2DS2VASc, and 0.81 for CHADS2. The validation statistics for men tended to be higher than for women with atrial fibrillation for each of the three scores.
Table 6⇓ shows the performance statistics for QStroke, CHADS2, and CHA2DS2VASc for the patients with atrial fibrillation at baseline in the validation cohort. For CHADS2, 63% of patients had a score of ≥1 so were classified as high risk. The sensitivity was 77% and the observed 10 year risk was 22.8%. We identified the top 63% of men and women with the highest QStroke scores in order to assemble a group of comparable size to those classified at high risk using CHADS2. This was equivalent to a 10 year risk threshold of 15%. Using this definition, QStroke had a sensitivity of 83%, and the observed 10 year risk in this group was 24.4%.
For CHA2DS2VASc, 85% of men and women had a score of ≥1 so were classified as high risk. The sensitivity at this threshold was 97%, and the observed 10 year risk was 20.2%. Similarly, we identified the top 85% of patients with the highest QStroke scores in order to assemble a group of comparable size to those classified at high risk using CHA2DS2VASc. This was equivalent to a 10 year risk threshold of 5.1%. Using this definition, QStroke had a sensitivity of 98%, and a 10 year observed risk of 20.5%
Table 7⇓ shows the reclassification statistics for a high CHADS2 score compared with a high QStroke 10 year risk score based on the top 63% of patients at highest risk. Of the 7689 patients with atrial fibrillation, 2195 (29%) were classified as low risk on both CHADS2 and QStroke. The observed 10 year risk of stroke in this group was 8%. There were 4187 (55%) patients who were high risk on both QStroke and CHADS2. The observed 10 year risk of stroke in this group was 25%. There were 657 (9%) patients who were high risk on QStroke and low risk on CHADS2: these patients had an observed 10 year risk of stroke of 19%. There were 650 patients (9%) who were low risk on QStroke but high risk on CHADS2: these patients had a 10 year absolute risk of 8%.
Table 7⇑ also shows the reclassification statistics for a high CHA2DS2VASc score compared with a high QStroke 10 year risk score based on the top 85% of patients at highest risk. Overall 4% of patients would be reclassified from low to high risk using QStroke compared with CHA2DS2VASc—the observed risk in these patients was 8%. Similarly 4% of patients would be reclassified from high to low risk using QStroke compared with CHA2DS2VASc, and these had an observed risk of 3%.
Summary of key findings
We have developed and validated QStroke, which is a new algorithm to identify patients at high risk of ischaemic stroke based on contemporaneous primary care data from the UK. Although QStroke has been designed to be used in all patients without a history of stroke or transient ischaemic attack, we envisage that its primary use will be in the subset of patients with atrial fibrillation for whom anticoagulation is considered.
QStroke incorporates established risk factors for stroke or transient ischaemic attack, many of which are absent from existing stroke risk assessment tools. QStroke includes age, sex, deprivation, ethnicity, body mass index, systolic blood pressure, total cholesterol:HDL cholesterol ratio, smoking status (five levels), diabetes type, congestive cardiac failure, coronary heart disease, rheumatoid arthritis, chronic kidney disease, treated hypertension, valvular heart disease, and family history of premature coronary heart disease.
Comparison with existing risk prediction scores
We tested the performance of QStroke in a separate cohort of patients without stroke or transient ischaemic attack and demonstrated good levels of discrimination and calibration and improved performance compared with the Framingham stroke risk score.
We also tested QStroke in the subset of patients with atrial fibrillation, for whom anticoagulation might be indicated. We compared the performance of QStroke with both CHADS2 and CHA2DS2VASc in patients without a prior stroke to ensure a fair comparison between the scores. We found some indication of improved performance on all measures of discrimination, although confidence intervals were wide. The comparison between QStroke and CHADS2 is important since the use of CHADS2 is currently incentivised as an indicator in the primary care Quality and Outcomes Framework and is used to determine which patients require anticoagulation. We also demonstrated some evidence of improved performance of QStroke compared with the newer CHA2DS2VASc, although this was less marked. None the less, we think the difference between performance of QStroke and CHA2DS2VASc could be important for those patients who are reclassified with QStroke and for whom advice on treatment with anticoagulation might change. For example, patients at high predicted risk on QStroke but classified as low risk on CHA2DS2VASc might require anticoagulation. Conversely, the patients classified as high risk with CHA2DS2VASc but low predicted risk with QStroke might be able to avoid unnecessary anticoagulation.
We have not provided definite comment on what threshold of absolute risk should be used for intervention, as that would include cost effectiveness analyses, which are outside the scope of this study. Ideally the review of a risk score is best judged around a risk threshold. This can be appropriate for other scores such as QRISK2 determining whether to intervene with primary prevention of cardiovascular disease. When determining whether to intervene with primary prevention of cardiovascular disease, the current 10 year risk threshold is 20%, making comparisons of different risk scores such as QRISK2 around this threshold appropriate. Stroke prevention in the population with atrial fibrillation has not yet reached this level of sophistication. The currently accepted risk scores, such as those of CHADS2, have not described their outputs in terms of absolute risk of stroke, and, as such, there is no consensus regarding a risk threshold. In contrast, QStroke calculates the absolute risk of stroke and so, unlike CHADS2, is able to inform future debate around what threshold is appropriate to intervene with oral anticoagulation. Choice of threshold is a complex area dependent on many variables relating to clinical outcomes and service costs, and to do justice to the complexity, we consider it should be the subject of a separate paper. We have, however, provided analyses using a range of thresholds of risk which can be used to help inform future analyses and guidelines
The results of our validation statistics for CHA2DS2VASc and CHADS2 in patients with atrial fibrillation are broadly similar to those reported using another UK GP database25 and a Danish registry cohort.14 Both studies showed improved performance of CHA2DS2VASc compared with CHADS2. The Danish study additionally showed that CHA2DS2VASc was better for identifying those at low and intermediate risk.14 Our results, however, are not directly comparable with those of the Danish study as that study included venous thromboembolism in the definition of the outcome, whereas our study included only stroke and transient ischaemic attack.14 QStroke, CHA2DS2VASc, and CHADS2 tended to perform better in men with atrial fibrillation compared with women with atrial fibrillation, which deserves further study.
Implications for clinical practice
Whilst the new QStroke algorithm is more complex than CHA2DS2VASc or CHADS2, it has several advantages. It includes weighting for ethnicity and deprivation, which should help avoid widening health inequalities. The algorithm uses routinely collected data, which means it can be easily and regularly updated to reflect changes in populations, improvements in data quality, advances in knowledge, and evolving guidelines. The algorithms can also be implemented in primary care since the data are already present in the clinical computer systems. QStroke will work both in populations with atrial fibrillation and those without atrial fibrillation—though the immediate clinical use might be for risk stratification among patients with atrial fibrillation, QStroke can still inform other patients of their specific risk of stroke or transient ischaemic attack as part of their general cardiovascular risk assessment.
QStroke has also been designed to be integrated into UK general practice clinical computer systems, where the risk factors are already recorded and used to calculate closely related scores such as QRISK2. Much of the apparent complexity relating to additional variables and interactions can be incorporated into the software using data already entered into each patient’s electronic health record. There are only three variables in QStroke (congestive cardiac failure, coronary heart disease, and valvular heart disease) that are not in QRISK2. Where possible we used the definitions from the Quality and Outcomes Framework, which should simplify its implementation. QRISK2 is integrated into all four UK GP clinical computer systems, and QStroke can be implemented in a similar way. For example, clinicians can use structured templates within the consultation to calculate a patient’s risk and use the information to inform treatment decisions. It can also be used in “batch processing” mode to calculate an estimated risk for all eligible patients registered with a practice so that patients with the highest risk can be recalled. Additionally, QStroke could easily be integrated in the GRASP-AF tool (Guidance on Risk Assessment and Stroke Prevention in Atrial Fibrillation), which is a primary care database interrogation tool designed to help identify possible candidates for anticoagulation from practice lists.29
Another advantage of QStroke compared with either CHA2DS2VASc or CHADS2 is that it gives an absolute measure of stroke risk which can more easily be explained to a patients (for example, “Of 100 people like you, X are likely to have a stroke or transient ischaemic attack within the next 10 years”), rather than a simple integer that has no direct interpretation of absolute stroke risk. Should a tool be developed that quantifies absolute risk of bleeding with anticoagulation, it will be possible to do a more direct assessment of risk of stroke in patients compared with potential risk and benefits of anticoagulation, thus providing better information for patients to make an informed choice. This is important since anticoagulation treatment is usually life long, and the risk of bleeding increases with increasing age.
QStroke has not been designed be used in patients with atrial fibrillation who have had a previous stroke, since all such patients should be prescribed anticoagulation and an estimation of risk will not affect the clinical decision. To ensure a fair comparison, we compared the performance of QStroke against CHADS2 and CHA2DS2VASc only in patients with atrial fibrillation who were free from stroke. However, removing the patients already receiving treatment may result in the higher risk patients being removed from the cohort, which might then result in an underestimation of risk in patients with atrial fibrillation overall.
The methods to derive and validate this model are the same as those used for the original development of QRISK2 and a range of other risk prediction tools. The strengths and limitations of the approach have already been discussed in detail,4 30 31 32 33 34 including information on multiple imputation of missing data. In summary, key strengths include size, duration of follow-up, representativeness, and lack of selection, recall, and respondent bias. UK general practices have good levels of accuracy and completeness in recording clinical diagnoses and prescribed drugs.35 36 We think our study has good face validity since it has been conducted in the setting where most patients in the UK are assessed, treated, and followed up. Limitations include lack of formally adjudicated outcomes, information bias, and potential for bias due to missing data. Our database has linked cause of death from the UK Office of National Statistics, and our study is therefore likely to have picked up most cases of stroke or transient ischaemic attack, thereby minimising ascertainment bias. Patients who die of stroke in hospital will have stroke or transient ischaemic attack recorded on their death certificate and therefore will be included on the linked cause of death data. Other patients who have stroke or transient ischaemic attack diagnosed in hospital who do not die will have the information recorded in hospital discharge letters which are sent to the patients’ general practice and then entered into each patient’s electronic record. We excluded people without a valid deprivation score since this group may represent a more transient population where follow-up for stroke could be unreliable or unrepresentative. Their deprivation scores are unlikely to be missing at random so we did not think it would be appropriate to impute them.
The present validation has been done on a completely separate set of practices and individuals to those which were used to develop the score, although the practices all use the same clinical computer system (EMIS, the computer system used by 55% of UK general practices). An independent validation study would be a more stringent test and should be done, but when such independent studies have examined other risk algorithms,6 8 31 33 they have demonstrated similar performance compared with the validation in the QResearch database.5 30 32
This QStroke model has been developed using data from England and Wales and includes UK derived ethnicities and a postcode-based deprivation score. It is therefore not immediately applicable for clinical use in international settings without some modification of the UK-specific risk factors and validation in the setting in which it is intended to be used.
We have developed and validated a new algorithm to predict risk of stroke. QStroke shows some improvement over current risk scoring methods, CHADS2 and CHA2DS2VASc, for patients with atrial fibrillation for whom anticoagulation may be required. QStroke also provides an accurate measure of absolute stroke risk in the general population of patients free of stroke or transient ischaemic attack, as shown by its performance in a separate validation cohort. Further research is needed to evaluate the clinical outcomes and cost effectiveness of using these algorithms in primary care.
What is already known on this topic
Methods to identify patients at high or low risk of stroke are needed to identify patients for whom interventions may be required, especially those with atrial fibrillation for whom anticoagulation might be needed
Current methods for risk scoring, such as CHADS2 and CHA2DS2VASc, are not based on a statistical model, do not include many established risk factors, nor provide absolute risk estimates of stroke
What this study adds
We have developed a new algorithm to quantify absolute risk of primary stroke which includes established risk factors and which is designed to work with the QRISK2 cardiovascular disease algorithm
QStroke provides a valid measure of absolute stroke risk in the general population of patients free of stroke or transient ischaemic attack as shown by its performance in a separate validation cohort
QStroke shows some improvement on current risk scoring methods, CHADS2 and CHA2DS2VASc, for the subset of patients with atrial fibrillation for whom anticoagulation may be required
Further research is needed to evaluate the clinical outcomes and cost effectiveness of using these algorithms in primary care
Cite this as: BMJ 2013;346:f2573
We acknowledge the contribution of EMIS practices that contribute to QResearch and the University of Nottingham and EMIS for expertise in establishing, developing, and supporting the database.
Contributors: JHC initiated the study; undertook the literature review, data extraction, data manipulation, and primary data analysis; and wrote the first draft of the paper. CC contributed to the design, analysis, interpretation, and drafting of the paper. PB contributed to the development of core ideas, the analysis plan, the interpretation of the results, and the drafting of the paper.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf (available on request from the corresponding author) and declare: JHC is professor of clinical epidemiology at the University of Nottingham and co-director of QResearch, a not-for-profit organisation which is a joint partnership between the University of Nottingham and EMIS (commercial supplier of IT for 60% of general practices in the UK). JHC is also director of ClinRisk, which produces open and closed source software to ensure reliable and updatable implementation of clinical risk algorithms within clinical computer systems. CC is associate professor of Medical Statistics at the University of Nottingham and a consultant statistician for ClinRisk. This work and any views expressed within it are solely those of the authors and not of any affiliated bodies or organisations. There are no other relationships or activities that could appear to have influenced the submitted work.
Approvals: The project was approved in accordance with the QResearch agreement with Trent Multi-Centre Research Ethics Committee.
Data sharing: The patient level data from the QResearch are specifically licensed according to its governance framework. See www.qresearch.org for further details. The QStroke algorithm will be published as open source software under the GNU Lesser Public License.
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/.