The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials

Regina Kunz; Andrew D Oxman

doi:10.1136/bmj.317.7167.1185

Papers

The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials

BMJ 1998; 317 doi: https://doi.org/10.1136/bmj.317.7167.1185 (Published 31 October 1998) Cite this as: BMJ 1998;317:1185

Regina Kunz, registrara,
Andrew D Oxman, director (andrew.oxman{at}labmed.uio.no)b

Department of Nephrology, Charité, Berlin, Germany
Health Services Research Unit, National Institute of Public Health, Oslo, Norway

Correspondence to: Dr Oxman

Abstract

Objective To summarise comparisons of randomised clinical trials and non-randomised clinical trials, trials with adequately concealed random allocation versus inadequately concealed random allocation, and high quality trials versus low quality trials where the effect of randomisation could not be separated from the effects of other methodological manoeuvres.

Design Systematic review.

Selection criteria Cohorts or meta-analyses of clinical trials that included an empirical assessment of the relation between randomisation and estimates of effect.

Data sources Cochrane Review Methodology Database, Medline,SciSearch, bibliographies, hand searching of journals, personal communication with methodologists, and the reference lists of relevant articles.

Main outcome measures Relation between randomisation and estimates of effect.

Results Eleven studies that compared randomised controlled trials with non-randomised controlled trials (eight for evaluations of the same intervention and three across different interventions), two studies that compared trials with adequately concealed random allocation and inadequately concealed random allocation, and five studies that assessed the relation between qualityscores and estimates of treatment effects, were identified. Failure to use random allocation and concealment of allocation were associated with relative increases in estimates of effects of 150% or more, relative decreases of up to 90%, inversion of the estimated effect and, in some cases, no difference. On average, failure to use randomisation or adequate concealment of allocation resulted in larger estimates of effect due to a poorer prognosis in non-randomly selected control groups compared with randomly selected control groups.

Conclusions Failure to use adequately concealed random allocation can distort the apparent effects of care in either direction, causing the effects to seem either larger or smaller than they really are. The size of these distortions can be as large as or larger than the size of the effects that are to be detected.

Key messages

Empirical studies support using random allocation in clinical trials and ensuring that the allocation process is concealed—that is, that assignment is impervious to any influence by the people making the allocation
The effect of not using concealed random allocation can be as large or larger than the effects of worthwhile interventions
On average, failure to use concealed random allocation results in overestimates of effect due to a poorer prognosis in non-randomly selected control groups compared with randomly selected control groups, but it can result in underestimates of effect, reverse the direction of effect, mask an effect, or give similar estimates of effect
The adequacy of allocation concealment may be a more sensitive measure of bias in clinical trials than scales used to assess the quality of clinical trials
It is a paradox that the unpredictability of randomisation is the best protection against the unpredictability of the extent and direction of bias in clinical trials that are not properly randomised

Introduction

Observational evidence is clearly better than opinion, but it is thoroughly unsatisfactory. Allresearch on the effectiveness of therapy was in this unfortunate state until the early 1950s. The only exceptions were the drugs whose effect on immediate mortality were so obvious that no trials were necessary, such as insulin, sulphonamide, and penicillin.1

“The basic idea, like most good things, is very simple.”1 Randomisation is the only means of controlling for unknown and unmeasured differences between comparison groups as well as those that are known and measured. Random assignment removes the potential of bias in the assignment of patients to one intervention or another by introducing unpredictability. When alternation or any other preset plan (such as time of admission) is used, it is possible to arrange to enter a patient into a study at an opportune moment. With randomisation, however, each patient's treatment is assigned according to the play of chance. It is a paradox that unpredictability is introduced into the design of clinical trials by using random allocation to protect against the unpredictability of the extent of bias in the results of non-randomised clinical trials.

Despite this simple logic, and many examples of harm being done because of delays in conducting randomised trials, there are limitations to the use of randomised trials, both real and imagined, and scepticism about the value of randomisation.2^–5 We believe this scepticism is healthy. It is important to question assumptions about research methods, and to test these assumptions empirically, just as it is important to test assumptions about the effects of health care. In this paper we have attempted systematically to summarise empirical studies of the relation between randomisation and estimates of effect.

Methods

We included four types of comparisons in our review: randomised clinical trials versus non-randomised clinical trials of the same intervention, randomised clinical trials versus non-randomised clinical trials across different interventions, adequately concealed random allocation versus inadequately concealed random allocation in trials, and high quality trials versus low quality trials in which the specific effect of randomisation or allocation concealment could not be separated from the effect of other methodological manoeuvres such as double blinding. Both descriptive and analytical assessments of the relation between the use of random allocation and estimates of effect are included, based on cohorts or meta-analyses of clinical trials.

We identified studies from the Cochrane Review Methodology Database,6other methodological bibliographies, Medline, and SciSearch, and by hand searching journals, personal communication with methodologists, and checking the reference lists of relevant articles. These searches were conducted up to July 1998. Potentially relevant citations were retrieved and assessed for inclusion independently by both authors. Disagreements were resolved by discussion.

We used the following criteria to appraise the methodological quality of included studies: Were explicit criteria used to select the trials? Did two or more investigators agree regarding the selection of trials? Was there a consecutive or complete sample of clinical trials? Did the study control for other methodological differences such as double blinding and complete follow up? Did the study control for clinical differences in theparticipants and interventions in the included trials? Were similar outcome measures used in the included trials? The overall quality of each study was summarised as: no important flaws, possibly important flaws, or major flaws.

Table 1

Randomised controlled trials (RCTs) compared with non-randomised controlled trials (non-RCTs) of the same intervention

View this table:

For each study one of us (RK) extracted information about the sample of clinical trials, the comparison that was made, the type of analysis undertaken, and the results, and the other checked the extracted data against the published article. The reported relation between randomisation and estimates of effect was recorded and, if possible, converted to the relative overestimation or underestimation of the relative risk reduction. We prepared tables for each type of comparison to facilitate a qualitative analysis of the extent to which the included studies yielded similar results, and heterogeneity in the included studies was explored both within and across comparisons.

In summarising the results we have assumed that evidence from randomised trials is the reference standard to which estimates from non-randomised trials are compared. However, as with other gold standards, randomised trials are not without flaws, and this assumption is not intended to imply that the true effect is known, or that estimates derived from randomised trials are always closer to the truth than estimates from non-randomised trials.

Results

We have identified 18 cohorts or meta-analyses that met our inclusion criteria, totalling 1211 clinical trials.7^–24Efforts to develop an efficient electronic search strategy using Medline have thus far not been successful due to poor indexing. Searches for studies that cited Colditz and colleagues,15Miller and colleagues,16Chalmers and colleagues,18 or Schulz and colleagues19 using SciSearch yielded seven additional studies. Searches using SciSearch for studies that cited the other studies meeting our inclusion criteria did not yield any other additional studies. Exploratory hand searching of three methodological journals (Controlled Clinical Trials, Statistics in Medicine, and the Journal of Clinical Epidemiology) for four years (1970, 1980, 1990, and 1995) yielded a single relevant study published in 1990. The 18 included studies were published in 14 different journals. The majority of studies were identified through personal communication with methodologists and through bibliographies and reference lists.

Randomised trials versus non-randomised trials of the same intervention

Table 1 summarises the eight studies comparing randomised clinical trials and non-randomised clinical trials of the same intervention. In five of the eight studies, estimates of effect were larger in non-randomised trials. Outcomes in the randomised treatment groups and non-randomised treatment groups were frequently similar, but worse outcomes among historical controls spuriously increased the estimated treatment effects. One study found comparable results for both allocation procedures, and two studies reported smaller treatment effects in non-randomised studies. In one study the smaller estimate of effect was due to a poorer prognosis for patients in the non-randomised treatment groups. The deviation of the estimates of effect for non-randomised trials compared with randomised trials ranged from an underestimation of effect of 76% to an overestimation of effect of 160%.

Table 2

Randomised controlled trials (RCTs) compared with non-randomised controlled trials (non-RCTs) across different interventions

View this table:

Table 3

Trials with adequately concealed allocation compared with inadequately concealed allocation

View this table:

Randomised trials versus non-randomised trials across different interventions

The evidence from comparisons across different interventions and various study designs (randomised controlled trials and non-randomised controlled trials, crossover designs, and observational studies) is less clear (table 2). In all three studies several study designs and clinical conditions were combined and their diverse outcomes converted to a standardised effect size. There was substantial clinical heterogeneity, and there were many other factors that could distort or mask a possible association between randomisation and estimates of effect. No consistent relation between study design or quality and the magnitude of the estimates of effect was detected.

Adequately concealed allocation versus inadequately concealed allocation

Concealed random allocation to treatment—that is, blinding of the randomisation schedule to prevent subversion by the investigators or trial participants—should ensure protection against biased allocation. Chalmers and colleagues found that within randomised controlled trials failure adequately to conceal allocation was associated with larger imbalances in prognostic factors and larger treatment effects (table 3).18They reported a more than sevenfold overestimation of the treatment effect in trials with inadequately concealed allocation. They did not, however, control for other methodological factors in their descriptive analysis.18Schulz and colleagues conducted a multivariate analysis that controlled for blinding and completeness of follow up, which yielded similar results.19They found that inadequately concealed random allocation (for example, alternation) compared with adequately concealed random allocation (for example, assignment by a central office) resulted in estimates of effect (odds ratios) that were on average 40% larger.

High quality trials versus low quality trials

Considerable differences in the observed treatment effect were detected when the results of high quality studies were compared with those of low quality studies in the context of systematic reviews of specific health care (table 4). In these studies the estimates of effect were distorted in both directions and even caused the alarming situation of a harmful intervention associated with a reduction in pregnancies (odds ratio 0.5, on the basis of high quality studies) seeming beneficial in low quality studies (odds ratio 2.6, on the basis of low quality studies). In two meta-analyses, low quality studies consistently underestimated the beneficial effect of the intervention being evaluated by 27% to 100%, and an effective treatment could have been discarded based on the results of low quality studies.

Table 4

Studies of high quality trials compared with low quality trials

View this table:

Methodological quality

The methodological quality of the studies included in this review varied. Four studies met all of our criteria. 19 21^–23 Three of these assessed the impact of bias on the effect of a specific healthcare intervention as part of a systematic review, and the analysis was performed as part of a subgroup analysis to test the robustness of the overall finding.21^–23 The other 14 studies had one or more methodological flaws including not controlling for other methodological manoeuvres 16 18 22 27 or clinical differences. 7 13^–17 20 24

Discussion

It has proved difficult to develop efficient search strategies for locating empirical methodological studies such as the ones included in this review. Although we believe it is unlikely that there are many published methodological studies such as the ones by Sacks and colleagues,8Schulz and colleagues,19Chalmers and colleagues,18and Emerson and colleagues20that we have not identified, there may be unpublished or ongoing studies like these that we have not identified, and it is likely that there are many meta-analyses that meet the inclusion criteria for this review that we have not identified. The Cochrane Library contains 428 completed reviews and 397 protocols, and there are over 1700 entries in the database of abstracts of reviews of effectiveness.26 We have not systematically gone through all of these meta-analyses. An expanded version of this review will be published in the Cochrane Library and kept up to date through the Cochrane Empirical Methodological Studies Methods Group.27Additional studies will be added to the review, and any errors that are identified will be corrected.

We have not included comparisons between randomised controlled trials and cohort studies,28case-control studies, 29 30or evaluations of effectiveness using large healthcare ministrative databases,3although some of the studies in this review included observational studies. Observational studies often provide valuable information that is complementary to the results of clinical trials. For example, case-control studies may be the best available study design for evaluating rare adverse effects, and large database studies may provide important information about the extent to which effects that are expected based on randomised clinical trials are achieved in routine practice. However, it is important to remember that it is only possible to control for confounders that are known and measured in observational studies, and we should be wary of hubris and its consequences in assuming that we know all there is to know about any disease.

As with any review the quality of the data is limited by the quality of the studies that we have reviewed. Most of the studies included in the review had one or more methodological flaws. In many of the included comparisons, particularly those between randomised controlled trials and historically controlled trials, methodological differences other than randomisation may account for some of the observed differences in estimates of effect. 7^–9 13 18

Four of the studies met all of our criteria for assessing methodological quality, 19^–21^–23and one study in particular provided strong support for the conclusion that clinical trials that lack adequately concealed random allocation produce estimates of effect that are on average 40% larger than clinicaltrials with adequately concealed random allocation, but that the degree and the direction of this bias varies widely.19This study also shows the potential contribution that systematic reviews, and notably the Cochran Database of Systematic Reviews, can make towards developing an empirical basis for methodological decisions in evaluations of health care. Currently this empirical basis is lacking, and many methodological debates rely more on logic or rhetoric than evidence. Analyses such as the one undertaken by Schulz and colleagues, in which methodological comparisons are made among trials of the same intervention, are likely to yield more reliable results than comparisons that are made across different interventions which, not surprisingly, tend to be inconclusive.15^–17

We have assumed that, in general, differences between randomised trials and non-randomised trials or between trials with adequately concealed random allocation and inadequately concealed random allocation are best explained by bias in the non-randomised controlled trials and inadequately concealed trials. This assumption is supported by findings of large imbalances in prognostic factors as well. However, it is possible that randomised controlled trials can sometimes underestimate the effectiveness of an intervention in routine practice by forcing healthcare professionals and patients to acknowledge their uncertainty and thereby reduce the strength of placebo effects. 4 25 31It is also possible that publication bias can partly explain some of the differences in results observed in studies such as the one by Sacks and colleagues.8This would be the case if randomised trials are more likely to be published regardless of the effect size, than historically controlled trials. However, we are not aware of any evidence that supports this hypothesis, and the available evidence shows consistently that randomised trials, like other research, are also more likely to be published if they have results that are considered significant.32^–35

Several explanations for discrepancies between estimates of effect derived from randomised trials and non-randomised trials are possible. For example, it can be argued that estimates of effect might be larger in randomised trials if the care provided in the context of trials is better than that in routine practice, assuming this is the case for the treatment group and not the control group. Similarly, strict eligibility criteria might select people with a higher capacity to benefit from a treatment, resulting in larger estimates of effect in randomised trials than non-randomised trials with less strict eligibility criteria. If, for some reason, patients with a poor prognosis were more likely to be allocated to the treatment group in non-randomised trials then this would also result in larger estimates of effect in randomised trials. Conversely, if patients with a poor prognosis were more likely to be allocated to the control group in non-randomised trials, as often seems to be the case based on the results of this review, this would result in larger estimates of effect in the non-randomised trials.

Conclusion

Overall, this review supports using random allocation in clinical trials and ensuring that the randomisation schedule is adequately concealed. The effect of not using random allocation with adequate concealment can be as large or larger than the effects of worthwhile interventions. On average, non-randomised trials and randomised trials with inadequately concealed allocation result in overestimates of effect. This bias, however, can go in either direction, can reverse the direction of effect, or can mask an effect.

For those undertaking clinical trials this review provides support for using randomisation to assemble comparison groups.25 For those undertaking systematic reviews of clinical trials, this review provides support for considering sensitivity analyses based on the adequacy of allocation concealment in addition to or instead of on the basis of overall quality scores, which may be less sensitive measures of bias.

As Cochrane stated: “The [randomised controlled trial] is a very beautiful technique, of wide applicability, but as with everything else there are snags.”1 Those making decisions on the basis of clinical trials need to be cautious of small trials (even when they are properly randomised) and systematic reviews of small trials both because of chance effects and the risk of biased reporting. 36 37It is also possible to introduce bias into a trial despite allocation concealment. 19 38 Finally, even when the risk of error due to either bias or chance is small, judgments must be made about the applicability of the results to individual patients 39 40and about the relative value of the probable benefits, harms, and costs. 41 42

Acknowledgments

We thank Alex Jadad, Steve Halpern, and David Cowan for help in locating studies, Dave Sackett and Iain Chalmers for encouragement and advice, Mike Clarke for reviewing the manuscript, Annie Britton and other colleagues for provision of their bibliographies on research methodology, and the investigators who conducted the studies we reviewed.

Contributors: RK and ADO contributed to the preparation of the protocol and the final manuscript and assessed the relevance and methodological quality of retrieved reports. RK prepared the first drafts of the protocol and the paper, undertook the majority of the searches with help from David Cowan, Steve Halpern, Alex Jadad, and collected data from the included studies. ADO checked the collected data against the original reports. Both authors will act as guarantors for the paper.

Footnotes

Funding Norwegian Ministry of Health and Social Affairs
Competing interests None declared.

References

↵
1. Cochrane AL
. Effectiveness and efficiency: random reflections on health services. London: Nuffield Provincial Hospitals Trust, 1972:20–25.
1. Committee for Evaluating Medical Technologies in Clinical Use
. Assessing medical technologies. Washington DC: National Academy Press, 1985:76–78.
↵
1. US Congress, Office of Technology Assessment
. Identifying health technologies that work: searching for evidence, OTA-H-608. Washington DC: US Government Printing Office, 1994:41–51.
↵
1. Black N
. Why we need observational studies to evaluate the effectiveness of health care. BMJ 1996; 312: 1215–1218.
OpenUrl FREE Full Text
↵
1. Weiss CH
. Evaluation. Methods for studying programs and policies. 2nd ed. Upper Saddle River: Prentice Hall, 1998:229–233.
↵
1. Clarke M,
2. Carling C,
3. Oxman AD
1. Cochrane Review Methodology Database
. In: Clarke M, Carling C, Oxman AD, eds. The Cochrane Library. Oxford: Update Software, 1998Issue 3.
↵
1. Chalmers TC,
2. Matta RJ,
3. Smith H Jr.,
4. Kunzler AM
. Evidence favoring the use of anticoagulants in the hospital phase of acute myocardial infarction. N Engl J Med 1977; 297: 1091–1096.
OpenUrl PubMed Web of Science
↵
1. Sacks H,
2. Chalmers TC,
3. Smith H Jr.
. Randomized versus historical controls for clinical trials. Am J Med 1982; 72: 233–240.
OpenUrl CrossRef PubMed Web of Science
↵
1. Diehl LF,
2. Perry DJ
. A comparison of randomized concurrent control groups with matched historical control groups: are historical controls valid?J Clin Oncol 1986; 4: 1114–1120.
OpenUrl Abstract/FREE Full Text
1. Reimold SC,
2. Chalmers TC,
3. Berlin JA,
4. Antman EM
. Assessment of the efficacy and safety of antiarrhythmic therapy for chronic atrial fibrillation: observations on the role of trial design and implications of drug related mortality. Am Heart J 1992; 124: 924–932.
OpenUrl CrossRef PubMed Web of Science
1. Recurrent Miscarriage Immunotherapy Trialists Group
. Worldwide collaborative observational study and meta analysis on allogenic leukocyte immunotherapy for recurrent spontaneous abortion. Am J Reprod Immunol 1994; 32: 55–72.
1. Watson A,
2. Vandekerckhove P,
3. Lilford R,
4. Vail A,
5. Brosens I,
6. Hughes E
. A meta-analysis of the therapeutic role of oil soluble contrast media at hysterosalpingography: a surprising result?Fertil Steril 1994; 61: 470–477.
OpenUrl PubMed Web of Science
↵
1. Pyorala S,
2. Huttunen NP,
3. Uhari M
. A review and meta-analysis of hormonal treatment of cryptorchidism. J Clin Endocrinol Metab 1995; 80: 2795–2799.
OpenUrl CrossRef PubMed Web of Science
1. Carroll D,
2. Tramer M,
3. McQuay H,
4. Nye B,
5. Moore A
. Randomization is important in studies with pain outcomes: systematic review of transcutaneous electrical nerve stimulation in acute postoperative pain. Br J Anaesth 1996; 77: 798–803.
OpenUrl Abstract/FREE Full Text
↵
1. Colditz GA,
2. Miller JN,
3. Mosteller F
. How study design affects outcomes in comparisons of therapy. I: medical. Stat Med 1989; 8: 441–454.
OpenUrl PubMed Web of Science
↵
1. Miller JN,
2. Colditz GA,
3. Mosteller F
. How study design affects outcomes in comparisons of therapy. II: surgical. Stat Med 1989; 8: 455–466.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ottenbacher K
. Impact of random assignment on study outcome: an empirical examination. Control Clin Trials 1992; 13: 50–61.
OpenUrl CrossRef PubMed Web of Science
↵
1. Chalmers TC,
2. Celano P,
3. Sacks HS,
4. Smith H Jr.
. Bias in treatment assignment in controlled clinical trials. N Engl J Med 1983; 309: 1358–1361.
OpenUrl PubMed Web of Science
↵
1. Schulz KF,
2. Chalmers I,
3. Hayes RJ,
4. Altman DG
. Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. JAMA 1995; 273: 408–412.
OpenUrl CrossRef PubMed Web of Science
↵
1. Emerson JD,
2. Burdick E,
3. Hoaglin DC,
4. Mosteller F,
5. Chalmers TC
. An empirical study of the possible relation of treatment differences to quality scores in controlled randomized clinical trials. Control Clin Trials 1990; 11: 339–352.
OpenUrl CrossRef PubMed Web of Science
↵
1. Imperiale TF,
2. McCullough AJ
. Do corticosteroids reduce mortality from alcoholic hepatitis? A meta analysis of the randomized trials. Ann Intern Med 1990; 113: 299–307.
↵
1. Nurmohamed MT,
2. Rosendaal FR,
3. Buller HR,
4. Dekker E,
5. Hommes DW,
6. Vandenbroucke JP,
7. et al
. Low molecular weight heparin versus standard heparin in general and orthopaedic surgery: a meta-analysis. Lancet 1992; 340: 152–156.
OpenUrl CrossRef PubMed Web of Science
↵
1. Khan KS,
2. Daya S,
3. Jadad A
. The importance of quality of primary studies in producing unbiased systematic reviews. Arch Intern Med 1996; 156: 661–666.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ortiz Z,
2. Shea B,
3. Suarez Almazor ME,
4. Moher D,
5. Wells GA,
6. Tugwell P
. The efficacy of folic acid and folinic acid in reducing methotrexate gastrointestinal toxicity in rheumatoid arthritis. A meta-analysis of randomized controlled trials. J Rheumatol 1998; 25: 36–43.
OpenUrl PubMed Web of Science
↵
1. Chalmers I
. Assembling comparison groups to assess the effects of health care. J R Soc Med 1997; 90: 379–386.
OpenUrl FREE Full Text
↵
1. NHS Centre for Reviews and Dissemination
. Database of abstracts of reviews of effectiveness. The Cochrane Library. Oxford: Update Software, 1998 Issue 3.
↵
1. Cochrane Empirical Methodological Studies Methods Group
. The Cochrane Library. Oxford: Update Software, 1998Issue 3.
↵
1. Forgie MA,
2. Wells PS,
3. Laupacis A,
4. Fergusson D
. Preoperative autologous donation decreases allogeneic transfusion but increases exposure to all red blood cell transfusion: results of a meta-analysis. Arch Intern Med 1998; 158: 610–616.
OpenUrl CrossRef PubMed Web of Science
↵
1. Colditz GA,
2. Brewer TF,
3. Berkey CS,
4. Wilson ME,
5. Burdick E,
6. Fineberg HV,
7. et al
. Efficacy of BCG vaccine in the prevention of tuberculosis. Meta analysis of the published literature. JAMA 1994; 271: 698–702.
OpenUrl CrossRef PubMed Web of Science
↵
1. Stieb D,
2. Frayha HH,
3. Oxman AD,
4. Shannon HS,
5. Hutchison BG,
6. Crombie F
. The effectiveness and usefulness of Haemophilus influenzae type b vaccines: a systematic overview (meta-analysis). Can Med Assoc J 1990; 142: 719–732.
OpenUrl Abstract
↵
1. Maynard A,
2. Chalmers I
1. Kleijnen J,
2. G⊘tzsche P,
3. Kunz RH,
4. Oxman AD,
5. Chalmers I
. So what's so special about randomisation?In: Maynard A, Chalmers I, eds. Non-random reflections on health services research: on the 25th anniversary of Archie Cochrane's effectiveness and efficiency. London: BMJ Publishing Group, 1997:93–106.
↵
1. Dicksersin K,
2. Min YI
. NIH clinical trials and publication bias. Online J Curr Clin Trials [serial online] 1993; document No 50.
1. Dickersin K
. How important is publication bias? A synthesis of available data. AIDS Education and Prevention 1997; 9(suppl A): 15–21.
OpenUrl PubMed Web of Science
1. Stern JM,
2. Simes RJ
. Publication bias: evidence of delayed publication of clinical research projects. BMJ 1997; 315: 640–645.
OpenUrl Abstract/FREE Full Text
↵
1. Ioannidis JPA
. Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA 1998; 279: 281–286.
OpenUrl CrossRef PubMed Web of Science
↵
1. Counsell CE,
2. Clarke MJ,
3. Slattery J,
4. Sandercock PAG
. The miracle of DICE therapy for acute stroke: fact or fictional product of subgroup analysis?BMJ 1994; 309: 1677–1681.
OpenUrl Abstract/FREE Full Text
↵
1. Egger M,
2. Davey SG,
3. Schneider M,
4. Minder C
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997; 315: 629–634.
OpenUrl Abstract/FREE Full Text
↵
1. Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group
. Users' guides to the medical literature, II: how to use an article about therapy or prevention, A: are the results of the study valid?. JAMA 1993; 270: 2598–2601.
OpenUrl CrossRef PubMed Web of Science
↵
1. Dans AL,
2. Dans LF,
3. Guyatt GH,
4. Richardson S
. Users' guides to the medical literature:ow to decide on the applicability of clinical trial results to your patient. JAMA 1998; 279: 545–549.
OpenUrl CrossRef PubMed Web of Science
↵
1. Cochrane Methods Working Group on Applicability and Recommendations
. The Cochrane Library. Oxford: Update Software, 1998Issue 3.
↵
1. Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group
. Users' guides to the medical literature, II: how to use an article about therapy or prevention, B: what were the results and will they help me in caring for my patients?. JAMA 1994; 270: 59–63.
OpenUrl
↵
1. Silagy C,
2. Haines A
1. Oxman AD,
2. Flottorp S
. An overview of strategies to promote implementation of evidence based health care. In: Silagy C, Haines A, eds. Evidence based practice. London: BMJ Books, 1998:91–109.

View Abstract

[1] ↵
Cochrane AL
. Effectiveness and efficiency: random reflections on health services. London: Nuffield Provincial Hospitals Trust, 1972:20–25.

[2] Cochrane AL

[3] Committee for Evaluating Medical Technologies in Clinical Use
. Assessing medical technologies. Washington DC: National Academy Press, 1985:76–78.

[4] Committee for Evaluating Medical Technologies in Clinical Use

[5] ↵
US Congress, Office of Technology Assessment
. Identifying health technologies that work: searching for evidence, OTA-H-608. Washington DC: US Government Printing Office, 1994:41–51.

[6] US Congress, Office of Technology Assessment

[7] ↵
Black N
. Why we need observational studies to evaluate the effectiveness of health care. BMJ 1996; 312: 1215–1218.
OpenUrl FREE Full Text

[8] Black N

[9] ↵
Weiss CH
. Evaluation. Methods for studying programs and policies. 2nd ed. Upper Saddle River: Prentice Hall, 1998:229–233.

[10] Weiss CH

[11] ↵
Clarke M,
Carling C,
Oxman AD
Cochrane Review Methodology Database
. In: Clarke M, Carling C, Oxman AD, eds. The Cochrane Library. Oxford: Update Software, 1998Issue 3.

[12] Clarke M,

[13] Carling C,

[14] Oxman AD

[15] Cochrane Review Methodology Database

[16] ↵
Chalmers TC,
Matta RJ,
Smith H Jr.,
Kunzler AM
. Evidence favoring the use of anticoagulants in the hospital phase of acute myocardial infarction. N Engl J Med 1977; 297: 1091–1096.
OpenUrl PubMed Web of Science

[17] Chalmers TC,

[18] Matta RJ,

[19] Smith H Jr.,

[20] Kunzler AM

[21] ↵
Sacks H,
Chalmers TC,
Smith H Jr.
. Randomized versus historical controls for clinical trials. Am J Med 1982; 72: 233–240.
OpenUrl CrossRef PubMed Web of Science

[22] Sacks H,

[23] Chalmers TC,

[24] Smith H Jr.

[25] ↵
Diehl LF,
Perry DJ
. A comparison of randomized concurrent control groups with matched historical control groups: are historical controls valid?J Clin Oncol 1986; 4: 1114–1120.
OpenUrl Abstract/FREE Full Text

[26] Diehl LF,

[27] Perry DJ

[28] Reimold SC,
Chalmers TC,
Berlin JA,
Antman EM
. Assessment of the efficacy and safety of antiarrhythmic therapy for chronic atrial fibrillation: observations on the role of trial design and implications of drug related mortality. Am Heart J 1992; 124: 924–932.
OpenUrl CrossRef PubMed Web of Science

[29] Reimold SC,

[30] Chalmers TC,

[31] Berlin JA,

[32] Antman EM

[33] Recurrent Miscarriage Immunotherapy Trialists Group
. Worldwide collaborative observational study and meta analysis on allogenic leukocyte immunotherapy for recurrent spontaneous abortion. Am J Reprod Immunol 1994; 32: 55–72.

[34] Recurrent Miscarriage Immunotherapy Trialists Group

[35] Watson A,
Vandekerckhove P,
Lilford R,
Vail A,
Brosens I,
Hughes E
. A meta-analysis of the therapeutic role of oil soluble contrast media at hysterosalpingography: a surprising result?Fertil Steril 1994; 61: 470–477.
OpenUrl PubMed Web of Science

[36] Watson A,

[37] Vandekerckhove P,

[38] Lilford R,

[39] Vail A,

[40] Brosens I,

[41] Hughes E

[42] ↵
Pyorala S,
Huttunen NP,
Uhari M
. A review and meta-analysis of hormonal treatment of cryptorchidism. J Clin Endocrinol Metab 1995; 80: 2795–2799.
OpenUrl CrossRef PubMed Web of Science

[43] Pyorala S,

[44] Huttunen NP,

[45] Uhari M

[46] Carroll D,
Tramer M,
McQuay H,
Nye B,
Moore A
. Randomization is important in studies with pain outcomes: systematic review of transcutaneous electrical nerve stimulation in acute postoperative pain. Br J Anaesth 1996; 77: 798–803.
OpenUrl Abstract/FREE Full Text

[47] Carroll D,

[48] Tramer M,

[49] McQuay H,

[50] Nye B,

[51] Moore A

[52] ↵
Colditz GA,
Miller JN,
Mosteller F
. How study design affects outcomes in comparisons of therapy. I: medical. Stat Med 1989; 8: 441–454.
OpenUrl PubMed Web of Science

[53] Colditz GA,

[54] Miller JN,

[55] Mosteller F

[56] ↵
Miller JN,
Colditz GA,
Mosteller F
. How study design affects outcomes in comparisons of therapy. II: surgical. Stat Med 1989; 8: 455–466.
OpenUrl CrossRef PubMed Web of Science

[57] Miller JN,

[58] Colditz GA,

[59] Mosteller F

[60] ↵
Ottenbacher K
. Impact of random assignment on study outcome: an empirical examination. Control Clin Trials 1992; 13: 50–61.
OpenUrl CrossRef PubMed Web of Science

[61] Ottenbacher K

[62] ↵
Chalmers TC,
Celano P,
Sacks HS,
Smith H Jr.
. Bias in treatment assignment in controlled clinical trials. N Engl J Med 1983; 309: 1358–1361.
OpenUrl PubMed Web of Science

[63] Chalmers TC,

[64] Celano P,

[65] Sacks HS,

[66] Smith H Jr.

[67] ↵
Schulz KF,
Chalmers I,
Hayes RJ,
Altman DG
. Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. JAMA 1995; 273: 408–412.
OpenUrl CrossRef PubMed Web of Science

[68] Schulz KF,

[69] Chalmers I,

[70] Hayes RJ,

[71] Altman DG

[72] ↵
Emerson JD,
Burdick E,
Hoaglin DC,
Mosteller F,
Chalmers TC
. An empirical study of the possible relation of treatment differences to quality scores in controlled randomized clinical trials. Control Clin Trials 1990; 11: 339–352.
OpenUrl CrossRef PubMed Web of Science

[73] Emerson JD,

[74] Burdick E,

[75] Hoaglin DC,

[76] Mosteller F,

[77] Chalmers TC

[78] ↵
Imperiale TF,
McCullough AJ
. Do corticosteroids reduce mortality from alcoholic hepatitis? A meta analysis of the randomized trials. Ann Intern Med 1990; 113: 299–307.

[79] Imperiale TF,

[80] McCullough AJ

[81] ↵
Nurmohamed MT,
Rosendaal FR,
Buller HR,
Dekker E,
Hommes DW,
Vandenbroucke JP,
et al
. Low molecular weight heparin versus standard heparin in general and orthopaedic surgery: a meta-analysis. Lancet 1992; 340: 152–156.
OpenUrl CrossRef PubMed Web of Science

[82] Nurmohamed MT,

[83] Rosendaal FR,

[84] Buller HR,

[85] Dekker E,

[86] Hommes DW,

[87] Vandenbroucke JP,

[88] et al

[89] ↵
Khan KS,
Daya S,
Jadad A
. The importance of quality of primary studies in producing unbiased systematic reviews. Arch Intern Med 1996; 156: 661–666.
OpenUrl CrossRef PubMed Web of Science

[90] Khan KS,

[91] Daya S,

[92] Jadad A

[93] ↵
Ortiz Z,
Shea B,
Suarez Almazor ME,
Moher D,
Wells GA,
Tugwell P
. The efficacy of folic acid and folinic acid in reducing methotrexate gastrointestinal toxicity in rheumatoid arthritis. A meta-analysis of randomized controlled trials. J Rheumatol 1998; 25: 36–43.
OpenUrl PubMed Web of Science

[94] Ortiz Z,

[95] Shea B,

[96] Suarez Almazor ME,

[97] Moher D,

[98] Wells GA,

[99] Tugwell P

[100] ↵
Chalmers I
. Assembling comparison groups to assess the effects of health care. J R Soc Med 1997; 90: 379–386.
OpenUrl FREE Full Text

[101] Chalmers I

[102] ↵
NHS Centre for Reviews and Dissemination
. Database of abstracts of reviews of effectiveness. The Cochrane Library. Oxford: Update Software, 1998 Issue 3.

[103] NHS Centre for Reviews and Dissemination

[104] ↵
Cochrane Empirical Methodological Studies Methods Group
. The Cochrane Library. Oxford: Update Software, 1998Issue 3.

[105] Cochrane Empirical Methodological Studies Methods Group

[106] ↵
Forgie MA,
Wells PS,
Laupacis A,
Fergusson D
. Preoperative autologous donation decreases allogeneic transfusion but increases exposure to all red blood cell transfusion: results of a meta-analysis. Arch Intern Med 1998; 158: 610–616.
OpenUrl CrossRef PubMed Web of Science

[107] Forgie MA,

[108] Wells PS,

[109] Laupacis A,

[110] Fergusson D

[111] ↵
Colditz GA,
Brewer TF,
Berkey CS,
Wilson ME,
Burdick E,
Fineberg HV,
et al
. Efficacy of BCG vaccine in the prevention of tuberculosis. Meta analysis of the published literature. JAMA 1994; 271: 698–702.
OpenUrl CrossRef PubMed Web of Science

[112] Colditz GA,

[113] Brewer TF,

[114] Berkey CS,

[115] Wilson ME,

[116] Burdick E,

[117] Fineberg HV,

[118] et al

[119] ↵
Stieb D,
Frayha HH,
Oxman AD,
Shannon HS,
Hutchison BG,
Crombie F
. The effectiveness and usefulness of Haemophilus influenzae type b vaccines: a systematic overview (meta-analysis). Can Med Assoc J 1990; 142: 719–732.
OpenUrl Abstract

[120] Stieb D,

[121] Frayha HH,

[122] Oxman AD,

[123] Shannon HS,

[124] Hutchison BG,

[125] Crombie F

[126] ↵
Maynard A,
Chalmers I
Kleijnen J,
G⊘tzsche P,
Kunz RH,
Oxman AD,
Chalmers I
. So what's so special about randomisation?In: Maynard A, Chalmers I, eds. Non-random reflections on health services research: on the 25th anniversary of Archie Cochrane's effectiveness and efficiency. London: BMJ Publishing Group, 1997:93–106.

[127] Maynard A,

[128] Chalmers I

[129] Kleijnen J,

[130] G⊘tzsche P,

[131] Kunz RH,

[132] Oxman AD,

[133] Chalmers I

[134] ↵
Dicksersin K,
Min YI
. NIH clinical trials and publication bias. Online J Curr Clin Trials [serial online] 1993; document No 50.

[135] Dicksersin K,

[136] Min YI

[137] Dickersin K
. How important is publication bias? A synthesis of available data. AIDS Education and Prevention 1997; 9(suppl A): 15–21.
OpenUrl PubMed Web of Science

[138] Dickersin K

[139] Stern JM,
Simes RJ
. Publication bias: evidence of delayed publication of clinical research projects. BMJ 1997; 315: 640–645.
OpenUrl Abstract/FREE Full Text

[140] Stern JM,

[141] Simes RJ

[142] ↵
Ioannidis JPA
. Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA 1998; 279: 281–286.
OpenUrl CrossRef PubMed Web of Science

[143] Ioannidis JPA

[144] ↵
Counsell CE,
Clarke MJ,
Slattery J,
Sandercock PAG
. The miracle of DICE therapy for acute stroke: fact or fictional product of subgroup analysis?BMJ 1994; 309: 1677–1681.
OpenUrl Abstract/FREE Full Text

[145] Counsell CE,

[146] Clarke MJ,

[147] Slattery J,

[148] Sandercock PAG

[149] ↵
Egger M,
Davey SG,
Schneider M,
Minder C
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997; 315: 629–634.
OpenUrl Abstract/FREE Full Text

[150] Egger M,

[151] Davey SG,

[152] Schneider M,

[153] Minder C

[154] ↵
Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group
. Users' guides to the medical literature, II: how to use an article about therapy or prevention, A: are the results of the study valid?. JAMA 1993; 270: 2598–2601.
OpenUrl CrossRef PubMed Web of Science

[155] Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group

[156] ↵
Dans AL,
Dans LF,
Guyatt GH,
Richardson S
. Users' guides to the medical literature:ow to decide on the applicability of clinical trial results to your patient. JAMA 1998; 279: 545–549.
OpenUrl CrossRef PubMed Web of Science

[157] Dans AL,

[158] Dans LF,

[159] Guyatt GH,

[160] Richardson S

[161] ↵
Cochrane Methods Working Group on Applicability and Recommendations
. The Cochrane Library. Oxford: Update Software, 1998Issue 3.

[162] Cochrane Methods Working Group on Applicability and Recommendations

[163] ↵
Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group
. Users' guides to the medical literature, II: how to use an article about therapy or prevention, B: what were the results and will they help me in caring for my patients?. JAMA 1994; 270: 59–63.
OpenUrl

[164] Guyatt GH, Sackett DL, Cook DJ, for the Evidence-Based Working Group

[165] ↵
Silagy C,
Haines A
Oxman AD,
Flottorp S
. An overview of strategies to promote implementation of evidence based health care. In: Silagy C, Haines A, eds. Evidence based practice. London: BMJ Books, 1998:91–109.

[166] Silagy C,

[167] Haines A

[168] Oxman AD,

[169] Flottorp S

The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials

Abstract

Key messages

Introduction

Methods

Results

Randomised trials versus non-randomised trials of the same intervention

Randomised trials versus non-randomised trials across different interventions

Adequately concealed allocation versus inadequately concealed allocation

High quality trials versus low quality trials

Methodological quality

Discussion

Conclusion

Acknowledgments

Footnotes

References

Article alerts

Log in or register:

Download this article to citation manager

Help

Forward this page

Content links

About us

Resources

Explore BMJ

My account

Information

Search form

The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials

Abstract

Key messages

Introduction

Methods

Results

Randomised trials versus non-randomised trials of the same intervention

Randomised trials versus non-randomised trials across different interventions

Adequately concealed allocation versus inadequately concealed allocation

High quality trials versus low quality trials

Methodological quality

Discussion

Conclusion

Acknowledgments

Footnotes

References

Article alerts

Log in or register:

Download this article to citation manager

Help

Forward this page

Content links

About us

Resources

Explore BMJ

My account

Information