BMJ 2000;321:255-256 ( 29 July )

Editorials

Which clinical studies provide the best evidence?

The best RCT still trumps the best observational study

A common question in clinical consultations is: "For this person, what are the likely effects of one treatment compared with another?" The central tenet of evidence based medicine is that this task is achieved by using the best evidence combined with consideration of that person's individual needs.1 A further question then arises: "What is the best evidence?" Two recent studies in the New England Journal of Medicine have caused uproar in the research community by finding no difference in estimates of treatment effects between randomised controlled trials and non-randomised trials.

The randomised controlled trial and, especially, systematic reviews of several of these trials are traditionally the gold standards for judging the benefits of treatments, mainly because it is conceptually easier to attribute any observed effect to the treatments being compared. The role of non-randomised (observational) studies in evaluating treatments is contentious: deliberate choice of the treatment for each person implies that observed outcomes may be caused by differences among people being given the two treatments, rather than the treatments alone. Unrecognised confounding factors can always interfere with attempts to correct for identified differences between groups.

These considerations have supported a hierarchy of evidence, with randomised controlled trials and derivatives at the top, controlled observational studies in the middle, and uncontrolled studies and opinion at the bottom. The best evidence to use in decisions is then the evidence highest in the hierarchy. Evidence from a lower level should be used only if there is no good randomised controlled trial to answer a particular clinical question. This view was supported by two studies that found larger effects in observational studies than in randomised controlled trials of the same treatment comparisons. 2 3

However, these findings were not confirmed by the two latest studies in the New England Journal of Medicine, which compared individual randomised controlled trials with observational studies in 19 therapeutic areas4 and meta-analyses of randomised controlled trials with meta-analyses of cohort and case-control studies in five therapeutic areas.5 No major differences were found between the estimates of treatment effects in the observational studies and randomised controlled trials.

Do these newer results overturn the idea of best evidence and mean that we should abandon the use of a hierarchy of evidence? The authors speculate that their latest comparisons of study designs failed to confirm older studies for two main reasons. Firstly, observational studies have improved (people who are given different treatments may be more comparable or researchers may be better at allowing for residual differences), and secondly, earlier comparisons used particularly poor observational designs (such as historical controls that use control data from a different set of people and from an earlier period than the one used for the treatment being studied).

However, an accompanying editorial6 found three additional problems with the latest comparisons of observational studies and randomised controlled trials. 4 5 Firstly, the search for corresponding randomised controlled trials and observational studies in well known journals selected a small, potentially atypical subgroup of available randomised controlled trials. Conclusions based on the selected therapies might not extend to other areas. Secondly, one observational study did not involve any treatment but explored risk factors in the general population. Thirdly, meta-analyses and randomised controlled trials published after the studies in the New England Journal of Medicine did not follow the same pattern and disagreed with results of corresponding observational studies. For example, a new meta-analysis of breast cancer screening that included weighting by quality of randomised controlled trial found no evidence of benefit, in contrast to results from observational studies,7 and a randomised controlled trial of hormone replacement therapy in menopausal women found no secondary prevention of coronary risk or reduced fracture risk, in contrast to numerous observational studies.8

Even before the papers in the New England Journal of Medicine an earlier systematic review also found no consistent difference between randomised controlled trials and observational studies in estimates of the effects of treatment in 22 areas.9 Differential quality of care, selection of people with a larger capacity to benefit, and publication bias against negative results from observational studies could explain larger treatment effects in either study design.

The issue is further confused by another systematic review published in JAMA that compared eight randomised controlled trials with non-randomised trials of the same intervention and found larger effects in five of the non-randomised trials.10

It is not surprising that high quality randomised controlled trials and high quality observational studies can sometimes produce similar answers. Not all observational studies are misleading. The hierarchy of evidence is merely a convenient rule of thumb that, all other things being equal, randomised controlled trials are more able to attribute effects to causes. Randomised controlled trials that are well conducted remain the gold standard for evidence of efficacy. However, small inadequate ones do not automatically trump any conflicting observational study. Identifying the best evidence for any question requires detailed appraisal---for example, relevance, allocation concealment (ensuring that the assignment of interventions are unpredictable by all involved in the trial until the point of allocation), intention to treat analysis, and relevant outcomes. If high quality randomised controlled trials exist for a clinical question then they trump any number of observational studies. Limited randomised controlled trials need other forms of evidence to be appraised and considered.

A similar debate took place centuries ago in English law. The legal "best evidence rule" initially created a rigid hierarchy of evidence (that original written documents took precedence over oral evidence). It was replaced by the flexible principle that the weight given to each bit of evidence should be determined by a detailed appraisal of the characteristics of that evidence.11

The new studies do not justify a major revision of the hierarchy of evidence, but they do support a flexible approach in which randomised controlled trials and observational studies have complementary roles. High quality observational studies may extend evidence over a wider population and are likely to be dominant in the identification of harms and when randomised controlled trials would be unethical or impractical.

Stuart Barton, executive editor, Clinical Evidence

BMA House, Tavistock Square, London WC1H 9JR (sbarton{at}bmjgroup.com)

Acknowledgments

Competing interests: SB edits a journal that uses a flexible hierarchy of evidence.



1. Sackett DI, Rosenberg WMC, Gray JAM, Haynes RB, Richardson WS. Evidence based medicine: what it is and what it isn't. BMJ 1996; 312: 71-72[Free Full Text].
2. Chalmers TC, Matta RJ, Smith Jr H, Kunzler A-M. Evidence favouring the use of anticoagulants in the hospital phase of acute myocardial infarction. N Engl J Med 1977; 297: 1091-1096[Abstract].
3. Sacks H, Chalmers TC, Smith Jr H. Randomized versus historical controls for clinical trials. Am J Med 1982; 72: 233-240[CrossRef][Medline].
4. Benson K, Hartz AJ. A comparison of observational and randomized controlled trials. N Engl J Med 2000; 342: 1878-1886[Abstract/Free Full Text].
5. Concato J, Shah N, Horwitz RI. Randomized, controlled trials, observational studies and the hierarchy of research designs. N Engl J Med 2000; 342: 1887-1892[Abstract/Free Full Text].
6. Pocock SJ, Elbourne DR. Randomized trials or observational tribulations? N Engl J Med 2000; 342: 1907-1909[Free Full Text].
7. Gotzsche PC, Olsen O. Is screening for breast cancer with mammography justifiable? Lancet 2000; 355: 129-134[CrossRef][Medline].
8. Hulley S, Grady D, Bush T, Furberg C, Herrington D, Riggs B, et al. Randomized trial of estrogen plus progestin for secondary prevention of coronary heart disease in postmenopausal women. JAMA 1998; 280: 605-613[Abstract/Free Full Text].
9. Britton A, McKee M, Black N, McPherson K, Sanderson C, Bain C. Choosing between randomised and non-randomised studies: a systematic review. Health Technol Assess 1998;2(13). www.hta.nhsweb.nhs.uk (accessed July 25 2000).
10. Kunz R, Oxman AD. The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials. BMJ 1998; 317: 1185-1190[Abstract/Free Full Text].
11. Twining W. Rethinking evidence. Evanston, IL: Northwestern University Press, 1994.


© BMJ 2000

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?

This article has been cited by other articles:

  • Sonmezer, M., Bang, H., Oktay, K. (2008). In Reply: Can GnRH Analogues Preserve Gonadal Function Without Preserving Fertility?. The Oncologist 13: 615-617 [Full text]  
  • Hodgson, R., Bushe, C., Hunter, R. (2007). Measurement of long-term outcomes in observational and randomised controlled trials. Br. J. Psychiatry 191: s78-s84 [Abstract] [Full text]  
  • Grossberg, G., Irwin, P., Satlin, A., Mesenbrink, P., Spiegel, R. (2004). Rivastigmine in Alzheimer Disease: Efficacy Over Two Years. AJGP 12: 420-431 [Abstract] [Full text]  
  • Wright, J. M. (2004). The best type of trial. CMAJ 170: 1773-1774 [Full text]  
  • Pijak, M. R., Gazdik, F., Hrusovsky, S. (2004). The best type of trial. CMAJ 170: 1772-1773 [Full text]  
  • James, D., Young, A., Kulinskaya, E., Knight, E., Thompson, W., Ollier, W., Dixey, J. (2004). Orthopaedic intervention in early rheumatoid arthritis. Occurrence and predictive factors in an inception cohort of 1064 patients followed for 5 years. Rheumatology (Oxford) 43: 369-376 [Abstract] [Full text]  
  • Shrier, I (2003). Cochrane Reviews: new blocks on the kids. Br. J. Sports. Med. 37: 473-474 [Full text]  
  • Aldrich, R., Kemp, L., Williams, J. S., Harris, E., Simpson, S., Wilson, A., McGill, K., Byles, J., Lowe, J., Jackson, T. (2003). Using socioeconomic evidence in clinical practice guidelines. BMJ 327: 1283-1285 [Full text]  
  • Silliman, R. A. (2003). What Constitutes Optimal Care for Older Women With Breast Cancer?. JCO 21: 3554-3556 [Full text]  
  • Walker, J., Sofaer, B. (2003). Randomised controlled trials in the evaluation of non-biomedical therapeutic interventions for pain: The gold standard?. Journal of Research in Nursing 8: 317-329 [Abstract]  
  • Westcombe, A M, Gambles, M A, Wilkinson, S M, Barnes, K, Fellowes, D, Maher, E J, Young, T, Love, S B, Lucey, R A, Cubbin, S, Ramirez, A J (2003). Learning the hard way! Setting up an RCT of aromatherapy massage for patients with advanced cancer. Palliat Med 17: 300-307 [Abstract]  
  • Looker, A. C. (2003). Interaction of Science, Consumer Practices and Policy: Calcium and Bone Health as a Case Study. J. Nutr. 133: 1987S-1991 [Abstract] [Full text]  
  • Pirmohamed, M (2002). Best evidence and the clinical decision making process. Postgrad. Med. J. 78: 316-316 [Full text]  
  • Bleakley, C, MacAuley, D (2002). The quality of research in sports journals. Br. J. Sports. Med. 36: 124-125 [Abstract] [Full text]  
  • Weaver, N, Williams, J L, Weightman, A L, Kitcher, H N, Temple, J M F, Jones, P, Palmer, S (2002). Taking STOX: developing a cross disciplinary methodology for systematic reviews of research on the built environment and the health of the public. J. Epidemiol. Community Health 56: 48-55 [Abstract] [Full text]  
  • McMahon, A. D. (2001). Observation and Experiment with the Efficacy of Drugs: A Warning Example from a Cohort of Nonsteroidal Anti-inflammatory and Ulcer-healing Drug Users. Am J Epidemiol 154: 557-562 [Abstract] [Full text]  
  • Donnelly, C A, Cox, D R (2001). Mathematical biology and medical statistics: contributions to the understanding of AIDS epidemiology. Stat Methods Med Res 10: 141-154 [Abstract]  
  • Anderson, I. M (2001). Meta-analytical studies on new antidepressants. Br Med Bull 57: 161-178 [Abstract] [Full text]  
  • Bigby, M. (2001). Challenges to the Hierarchy of Evidence: Does the Emperor Have No Clothes?. Arch Dermatol 137: 345-346 [Full text]  

Rapid Responses:

Read all Rapid Responses

Restoring the balance
Somdutt Prasad
bmj.com, 2 Aug 2000 [Full text]
Observational studies yield valuable information
Peter Schuck
bmj.com, 9 Aug 2000 [Full text]
Best Evidence Rule
Nicholas Kadar
bmj.com, 21 Aug 2000 [Full text]
Enrolment of cancer patients in clinical trials
P Chung, et al.
bmj.com, 22 Aug 2000 [Full text]



Student BMJ

Intimate examinations

Israeli students are refusing to perform intimate examinations on anaesthetised women without their informed consent.

www.student.bmj.com

Listen to the latest BMJ Interview