- Lasse T Krogsbøll, doctor,
- Karsten Juhl Jørgensen, doctor,
- Christian Grønhøj Larsen, doctor,
- Peter C Gøtzsche, professor, director
- Correspondence to: L T Krogsbøll
- Accepted 15 October 2012
Objectives To quantify the benefits and harms of general health checks in adults with an emphasis on patient-relevant outcomes such as morbidity and mortality rather than on surrogate outcomes.
Design Cochrane systematic review and meta-analysis of randomised trials. For mortality, we analysed the results with random effects meta-analysis, and for other outcomes we did a qualitative synthesis as meta-analysis was not feasible.
Data sources Medline, EMBASE, Healthstar, Cochrane Library, Cochrane Central Register of Controlled Trials, CINAHL, EPOC register, ClinicalTrials.gov, and WHO ICTRP, supplemented by manual searches of reference lists of included studies, citation tracking (Web of Knowledge), and contacts with trialists.
Selection criteria Randomised trials comparing health checks with no health checks in adult populations unselected for disease or risk factors. Health checks defined as screening general populations for more than one disease or risk factor in more than one organ system. We did not include geriatric trials.
Data extraction Two observers independently assessed eligibility, extracted data, and assessed the risk of bias. We contacted authors for additional outcomes or trial details when necessary.
Results We identified 16 trials, 14 of which had available outcome data (182 880 participants). Nine trials provided data on total mortality (11 940 deaths), and they gave a risk ratio of 0.99 (95% confidence interval 0.95 to 1.03). Eight trials provided data on cardiovascular mortality (4567 deaths), risk ratio 1.03 (0.91 to 1.17), and eight on cancer mortality (3663 deaths), risk ratio 1.01 (0.92 to 1.12). Subgroup and sensitivity analyses did not alter these findings. We did not find beneficial effects of general health checks on morbidity, hospitalisation, disability, worry, additional physician visits, or absence from work, but not all trials reported on these outcomes. One trial found that health checks led to a 20% increase in the total number of new diagnoses per participant over six years compared with the control group and an increased number of people with self reported chronic conditions, and one trial found an increased prevalence of hypertension and hypercholesterolaemia. Two out of four trials found an increased use of antihypertensives. Two out of four trials found small beneficial effects on self reported health, which could be due to bias.
Conclusions General health checks did not reduce morbidity or mortality, neither overall nor for cardiovascular or cancer causes, although they increased the number of new diagnoses. Important harmful outcomes were often not studied or reported.
Systematic review registration Cochrane Library, doi:10.1002/14651858.CD009009.
General health checks have long been common elements of healthcare in some countries such as the United States.1 2 In the UK, the publicly funded NHS Health Check programme was introduced in 2009, and in Denmark an organised health check programme for the general public has been suggested, but now seems abandoned. Health checks are also performed by some primary care physicians outside organised programmes and by commercial clinics.3 However, evidence for their effectiveness has been lacking.
General health checks involve a contact between a person and a healthcare professional to identify signs, symptoms, or risk factors for disease that were previously unrecognised. They are combinations of screening tests, few of which have been adequately studied in randomised trials. For example, although the benefits and harms of treatments for conditions such as hypertension and diabetes have been extensively studied in randomised trials, screening asymptomatic people for these conditions has not.4 5
Health checks are intended to reduce morbidity and prolong life. Theoretically, there are many possible benefits of general health checks, through apparently intuitive mechanisms. The detection of elevated risk factors such as hypertension or hypercholesterolaemia may lead to reductions in morbidity and mortality through preventive treatment. Some tests may detect precursors to disease, such as cervical dysplasia, the treatment of which may prevent cancer from developing. Also, it may be beneficial to detect signs or symptoms of manifest disease that the person had not deemed important. Some people might improve their lifestyle because of the test results and counselling, and healthy people may feel reassured.
While we cannot be certain that general health checks lead to benefit, we know that all medical interventions can lead to harm. Possible harms from health checks are overdiagnosis, overtreatment, distress or injury from invasive follow-up tests, distress due to false positive test results, false reassurance due to false negative test results, possible continuation of adverse health behaviours due to negative test results, adverse psychosocial effects due to labelling, and difficulties with getting insurance. Last but not least, organised programmes of general health checks are likely to be expensive and may result in lost opportunities to improve other areas of healthcare.
We aimed to investigate the balance between benefits and harms of general health checks in adult populations, unselected for diseases or risk factors, and performed by any type of healthcare provider. We did not focus on surrogate outcomes because they may be seriously misleading9 and do not capture harmful effects.10 There is also a risk of biased loss to follow-up in non-blinded trials, whereas mortality status can usually be obtained for all randomised people.
The review was done according to a detailed, peer reviewed protocol, which is available in the Cochrane Library.
We included randomised trials of general health checks compared with no health checks. The participants had to be 18 years or older and unselected for specific known risk factors or diseases, such as hypertension or heart disease. The setting had to be primary care or the community (that is, we did not include trials in patients recruited from hospital clinics). We accepted trials regardless of the type of provider of the health check and regardless of where the health check was performed (such as general practice or a special clinic).
We defined general health checks as screening for more than one disease or risk factor in more than one organ system, whether performed only once or repeatedly. This deﬁnition excludes trials of screening for single diseases in isolation, such as prostate cancer, and trials of single screening tests that may detect more than one disease, such as spirometry. We accepted trials which included a lifestyle intervention (such as advice on diet, smoking, and exercise) in addition to screening, since this is a fairly well defined intervention often incorporated into health checks.
Although we originally planned to include trials of geriatric screening, we found that they included many interventions in addition to screening, such as falls prevention and specialist medication review. Thus, we excluded trials described as specifically targeting older people only, or which only enrolled people aged >65.
Search methods for identification of studies
Studies were identified using the Cochrane Central Register of Controlled Trials (CENTRAL) 2010, issue 11; Medline (via OVID) (1948 to “In-Process”); EMBASE (via OVID) (1947 onwards); Cumulative Index to Nursing and Allied Health Literature (CINAHL); EbscoHost (1980 onwards); Healthstar (via OVID) (1966 to 2010); and the EPOC Specialised Register. Related systematic reviews were identified by searching the Database of Abstracts of Reviews of Effectiveness (DARE), and ongoing trials were identified by searching ClinicalTrials.gov and WHO ICTRP. The searches were conducted in November and December 2010 and updated in July 2012. An example of a search strategy is available in appendix 1 on bmj.com.
Two observers searched the reference lists of included articles, and one author used citation tracking (Web of Knowledge) on all articles describing eligible trials. We asked authors of the included studies if they were aware of any other published, unpublished, or ongoing studies that could meet our inclusion criteria.
Selection of studies
Two observers (LTK and CGL or KJJ) independently assessed the potential relevance of all titles and abstracts identified through the searches. Full text copies of potentially relevant articles were assessed for eligibility independently by two authors (LTK and CGL or KJJ). Disagreements were resolved through discussion, involving the other authors (KJJ and PCG) when necessary.
Two authors (LTK and KJJ) independently extracted pre-specified data items from the included articles in a non-blinded fashion and entered them into a pilot tested data extraction form. When our preferred data formats were not available, we extracted what was possible, including narrative accounts if numbers were missing. We preferentially extracted data allowing an intention to treat analysis. We attempted to contact authors when necessary and succeeded in 10 cases.
Two authors (LTK and KJJ) independently assessed risk of bias in the included trials using the Cochrane Risk of Bias tool. The domains formally assessed were sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other biases. Baseline balance and risk of contamination was also assessed.
Our primary outcomes were total mortality and disease-specific mortality. Our secondary outcomes were morbidity (such as myocardial infarction), number of new diagnoses (total and condition-specific), admission to hospital, disability, patient worry, self reported health, number of referrals to specialists, number of non-scheduled visits to general practitioners, number of additional diagnostic procedures due to positive screening tests, new medications prescribed, frequency and type of surgery, and absence from work.
When cardiovascular and cancer mortality were reported as such, we used those numbers. When they were reported in several disease categories or organ systems, two of us independently combined them into an overall measure of cardiovascular or cancer mortality. For example, in one trial we added fatal coronary heart disease and fatal stroke to give a measure of cardiovascular mortality.
Meta-analysis was feasible only for our primary outcomes. We calculated risk ratios with 95% confidence intervals using the random effects model. To allow incorporation of adjusted effect estimates we used the generic inverse variance approach. Heterogeneity was investigated with the I2 statistic.
We conducted the following pre-specified subgroup analyses: one versus multiple health checks, lifestyle intervention versus no lifestyle intervention, length of follow-up (≤5 years versus >5 years), trial age (started before 1980 versus after 1980), geographical location (Europe versus US), examination by a physician, and risk of bias (selection bias, performance bias, detection bias, attrition bias, contamination). We did one pre-specified sensitivity analysis, excluding cluster randomised trials, and one post hoc sensitivity analysis excluding trials judged to be biased towards no effect. The results of these are presented in the corresponding Cochrane review.11 For other outcomes, we summarised the results in tables and did a qualitative synthesis.
Results of the search
The 14 trials analysed included a total of 182 880 participants, with 76 403 allocated to health checks and 106 477 to control groups. The length of follow-up varied from 1 to 22 years (table 1⇓). The participants were recruited from general practice in five trials,14 15 16 17 18 the general population in seven trials,19 20 21 22 23 24 25 health plan members in one trial,26 and the workplace in one trial.27 The health checks took place in general practice in four trials, a screening clinic in five trials, at the workplace in one trial, in a hospital in one trial, and in three trials it was not clear. Table 2⇓ provides a summary of the trials’ methods, and table 3⇓ provides an overview of the screening tests used.
Risk of bias in included studies
Risk of bias varied between trials, and within trials for different outcomes (fig 2⇓). Most trials randomised participants before any contact was made, effectively leading to concealed allocation. When the randomisation sequence was predictable but likely to provide balanced groups given allocation before contact (such as date of birth), we judged the risk of selection bias to be low.15 19 20 26 Of the nine trials that reported mortality,14 16 18 19 20 21 22 26 27 seven had a low risk of selection bias, and eight had a low risk of attrition bias for that particular outcome. All nine trials reporting mortality could be analysed by intention to treat. By design, three trials were biased towards no effect.14 18 26 In two of these, the control group was offered health checks before follow-up for mortality ended. In one, the control group had free access to the same health check as the intervention group and, though not actively encouraged, used this option to a considerable extent. In four trials, the follow-up and treatment of detected abnormalities were possibly better in the intervention group than in the control group (for example, follow-up by specialists who used treatment algorithms).19 20 22 27 This might have caused bias in favour of screening.
Effects of interventions
Nine trials reported on total mortality, and our meta-analysis included 155 899 people and 11 940 deaths. The median length of follow-up was nine years (range 4–22 years), and the median event rate in the control groups was 7% (range 2%–16%). We did not find an effect of general health checks on total mortality, risk ratio 0.99 (95% confidence interval 0.95 to 1.03) (fig 6⇓). There was no heterogeneity (I2=0%). Subgroup and sensitivity analyses did not alter this result.
For cardiovascular mortality (8 trials, 152 435 people, 4567 deaths), the median length of follow-up was 10.4 years and the median event rate in the control groups was 3.7%. The pooled estimate was risk ratio 1.03 (0.91 to 1.17), but with large heterogeneity (I2=64%) (fig 7⇓). Subgroup and sensitivity analyses did not alter the results, nor explain the heterogeneity. One possible explanation for the heterogeneity is the varying definitions of the outcome among trials. One trial found a large beneficial effect,20 and one found a large harmful effect.14
For cancer mortality (8 trials, 139 290people, 3663 deaths), the median length of follow-up was 10.4 years, and the median event rate in the control groups was 2.4%. The pooled estimate was risk ratio 1.01 (0.92 to 1.12) with moderate heterogeneity (I2=33%) (fig 8⇓). A high quality trial found a reduction in cancer mortality (risk ratio 0.87 (0.76 to 0.99)).22 That trial did not use cancer screening tests, and was not successful in reducing smoking.
Subgroup and sensitivity analyses
The pre-specified subgroup analyses resulted in groups with few trials, and the results should be viewed with caution. We did not find any convincing patterns or explanations for the heterogeneity observed.
For cancer mortality, three trials that used only one health check showed a trend towards harm (relative risk 1.10 (1.00 to 1.21)), and five trials that used more than one health check showed a trend towards benefit (relative risk 0.92 (0.83 to 1.02)). The test for subgroup differences was significant (P = 0.01).
For cardiovascular mortality, the reverse pattern was present. The three trials using only one health check showed a trend towards benefit (relative risk 0.89 (0.69 to 1.14)), and the five trials using more than one health check showed a trend towards harm (relative risk 1.11 (0.95 to 1.30)). The test for subgroup differences was not significant (P=0.13).
In a post hoc sensitivity analysis, we removed the three trials that were biased towards no effect14 18 26 and one trial in which we had prioritised power over contrast in the merging of three intervention groups.16 This did not change the results for total mortality (relative risk 0.98 (0.94 to 1.02), cardiovascular mortality (0.97 (0.86 to 1.09)), or cancer mortality (1.01 (0.88 to 1.17)).
We refer the reader to appendix 2 on bmj.com for detailed results for our secondary outcomes. In summary, we did not find an effect on clinical events, such as coronary heart disease, or other measures of morbidity, but they were infrequently reported. One trial found an increased occurrence of hypertension and hypercholesterolaemia with screening. One trial found a 20% increase in the total number of new diagnoses per participant over six years compared with the control group and an increased occurrence of self reported chronic disease. Other trials reported large numbers of abnormalities detected at the health checks. No trials compared the total number of prescriptions, but two out of four trials found an increased number of people using antihypertensive drugs. Two out of four trials found small beneficial effects on self reported health, but this could be due to reporting bias as the trials were not blinded. We did not find an effect on admission to hospital, disability, worry, additional visits to the physician, or absence from work, but most of these outcomes were poorly studied. We did not find useful results on the number of referrals to specialists, the number of follow-up tests after positive screening results, or the amount of surgery used.
Summary of main results
We did not find an effect on total or cause-specific mortality from general health checks in adult populations unselected for risk factors or disease. For total mortality, our confidence interval includes a 5% reduction and a 3% increase, both of which would be clinically relevant. However, for the causes of death most likely to be influenced by health checks, cardiovascular mortality and cancer mortality, there were no reductions either. A substantial latency of effects on mortality would be expected, but we included several trials with very long follow-up, and they did not show a benefit. Neither did we find a difference in effects in our subgroup analysis comparing trials with up to five years of follow-up with trials with more than five years of follow-up. The results suggest that the lack of effect on total mortality is not a chance finding or due to low power, but that there is no, or only a minimal, effect of the intervention on mortality in general adult populations. We did not include geriatric trials, and our results therefore do not apply to this population.
We also looked at several other outcomes that might be influenced by health checks, but most of these were either infrequently reported or the results were at high risk of bias because of the inevitable lack of blinding and consequent risk of reporting bias and biased loss to follow-up. We did find that health checks led to more diagnoses and more medical treatment for hypertension, as expected, but, as these did not improve mortality or morbidity, they may be considered harms rather than benefits. Two trials found improved self reported health, but the effects were small and could be due to bias.
Strengths and weaknesses of the review
The main strength of this review is our attempt to reduce bias in the review process by conducting it according to a published and peer reviewed Cochrane protocol and by following empirically founded review guidelines. We identified more relevant trials than previous reviews and did a thorough data collection and appraisal of included studies.
The main limitations are the risk of bias in some of the included trials, their age, and infrequent and poor reporting of some of our specified outcomes, in particular the harms. Another possible limitation is the clinical and methodological heterogeneity among the included trials, although the results were generally consistent for the frequently reported outcomes.
Strengths and weaknesses in relation to other studies
A systematic review of “the periodic health evaluation” included both trials and observational studies, and also geriatric studies, but it used a different definition of the intervention.6 The trials reviewed by us are mostly different ones, but the results are broadly similar with regard to the outcomes that were assessed in both reviews: total mortality, hospitalisation, disability, and the number of new diagnoses (disease detection). In terms of the effects of health checks on participants’ health worries, the previous review found one geriatric trial with a beneficial effect, whereas we found two trials with no effect on this outcome. Other reviews studied the effect of calculating and communicating coronary risk, but had a more narrow definition of the intervention, and did not find results on morbidity and mortality.7 8
In order to get the most reliable answers to our questions, we did not include observational studies because the influence of self selection bias is too great compared with the expected small effect of an intervention in a predominantly healthy population. We also chose not to focus on surrogate outcomes such as changes in risk factors or delivery of preventive services, as these may be misleading because an improvement does not necessarily benefit the participant and because they do not measure harms. Nevertheless, we succeeded in identifying several trials that addressed our research questions.
We did not include geriatric trials because they included additional interventions likely to affect the outcomes. A systematic review found that geriatric assessments for general elderly populations reduced the risk of not living at home and of being admitted to a nursing home, but did not find an effect on mortality.28
Meaning of the study
The lack of beneficial effects indicates that the interventions did not work as intended in the included trials. There are several possible explanations for this. Most of the trials were old and consequently used treatments different from what would be used today—such as clofibrate or nicotinic acid for hypercholesterolaemia, instead of statins. Also, thresholds for treating cardiovascular risk factors were higher than they are today. However, it is not a given that the results would be better today, as medical innovations sometimes prove harmful29 and as reducing risk factor thresholds means treating people at lower risk who have a smaller potential for benefit but the same risk of harm.30 Another possibility is that preventive drugs could have a less favourable balance between benefits and harms when used in general populations compared with in pharmacological trials, which often use many exclusion criteria.31 In our meta-analyses, arranged by year of trial start, there are no visible time trends and the idea of increasing benefits over time remains hypothetical. The results on mortality from the Inter99 trial 25 will be published soon and will inform about the effect of health checks in a modern setting.
Finally, some of the trials used only one health check instead of repeated health checks. For cancer mortality, subgroup analysis showed a trend towards benefit from more than one health check and towards harm from one health check only. For cardiovascular mortality, the opposite trends were observed. We regard these results as chance findings. Also, it is not a given that several health checks would be better than one, as some of the harms would increase.
Two other factors are probably important for explaining our results. First, people who accept an invitation to a health check are often different from those who do not. They tend to have higher socioeconomic status,32 lower cardiovascular risk,33 less cardiovascular morbidity,25 and lower mortality.22 Thus, systematic health checks may not reach those who need prevention the most, and they have been described as another example of inverse care.33 Second, many physicians already carry out testing for cardiovascular risk factors or diseases in patients whom they judge to be at risk when they see them for other reasons. This is often considered an integral part of primary care. Such clinically motivated testing may already have identified many people with disease or elevated risk factors, thus eroding the potential for a benefit from systematic screening.
Our results do not support the use of general health checks aimed at a general adult population outside the context of randomised trials. However, they do not imply that physicians should stop clinically motivated testing and preventive activities, as these may be an important reason why systematic health checks showed no effect. Also, our results do not imply that all individual components of the health checks are ineffective, since effects of harmful components may have balanced out effects of beneficial ones.
We suggest that future research is directed at the individual components of health checks, such as screening for cardiovascular risk factors, chronic obstructive pulmonary disease, diabetes, or kidney disease. We also suggest that surrogate outcomes such as changes in risk factors are not used for assessing the benefits of health checks. The large randomised trials with long follow-up that are required are expensive, but not nearly as expensive as the implementation of ineffective or harmful general health check programmes.
What is already known on this subject
General health checks are widely assumed to be effective in reducing morbidity and mortality from disease based on common sense and on observations of reductions in risk factors and increased delivery of preventive services
However, a demonstration of benefits in terms of morbidity and mortality has been lacking
What this study adds
This systematic review of randomised trials suggest that general health checks in adults may not reduce morbidity or mortality from disease
Harms were sparsely studied in individual trials. Since health checks probably increase the number of diagnoses, the absence of benefits suggests overdiagnosis and overtreatment
Current use of general health checks is not supported by the best available evidence
Cite this as: BMJ 2012;345:e7191
We thank Guy De Backer, Walter W Holland, Sven-Olof Isacsson, Torben Jørgensen, Olof Lannerstad, Torsten Lauritzen, David Murray, Charlotta Pisinger, Lennart Welin, and Lars Wilhelmsen for additional information on their trials, and David Mant, Alice Fuller, Holger Theobald, and Janus L Thomsen for providing unpublished outcome data. We also thank the EPOC trials search coordinator, Michelle Fiander, for designing, conducting, and updating the searches, the EPOC Cochrane review group for editorial assistance in producing the corresponding Cochrane review, and the peer reviewers for their valuable comments.
This paper is based on a Cochrane review by the same authors.11 Cochrane reviews are regularly updated as new evidence emerges and in response to comments and criticisms. The Cochrane Library should be consulted for the most recent version of the review.
Contributors: PCG initiated the project. LTK drafted the protocol, and KJJ and PCG provided comments. LTK, CGL, and KJJ screened titles and abstracts and made decisions about inclusion of trials. LTK and KJJ extracted data. LTK analysed data and drafted the review, and KJJ, PCG, and CGL contributed to the revisions. LTK is guarantor.
Funding: LTK was partly supported by a grant from Trygfonden (non-profit foundation). The funder had no role in study design or data collection, analysis, or interpretation.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf (available on request from the corresponding author) and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.
Ethical approval: Not required
Data sharing: An excel sheet detailing the inverse variance analyses and the exact numbers used are available from the authors.
This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.