Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
Kevork Hopayian Seahills, Leiston Road,
Aldeburgh IP15 5PL k.hopayian{at}btinternet.com
The emergence of systematic reviews raised hopes of a new
era for the objective appraisal of evidence available on a given topic.
Such reviews promised a synthesis of trial results, which could be
conflicting, and an escape from the personal bias inherent in
traditional reviews and expert opinion.1 As the discipline of systematic reviews has evolved, however, two new problems have arisen: the quality of reviews is variable
2 3
; and two or more systematic reviews on the same topic may arrive at different conclusions, raising questions on the validity4-7 or the
relevance8 of the conclusions. Moreover, adherence to a
"checklist" system when appraising trials may overlook important
clinical details in the original trials and so reduce the validity of
the review. I uncovered this last shortcoming when I recently conducted
a study of three systematic reviews; the study is reported here.
Guidelines have been drawn up to improve the quality of
reviews.9 Differences in the quality of reviews,
however, do not always explain discordance. Jadad and
McQuay4 identified six sets of reviews covering six
topics in pain research; despite similar quality scores for reviews in
each set, four of the sets contained discordant reviews. Jadad et
al8 identified six generic differences between
reviews that might lead to discordance: the clinical question asked;
the selection and inclusion of studies; data extraction; assessment of
study quality; assessment of the ability to combine studies; and
statistical methods for data analysis.
The case of epidural steroid injection therapy for sciatica is a good
illustration of the evolution of reviews. The results of randomised
controlled trials of this treatment were inconsistent. Two traditional
reviews of these trials appeared
Table 1.
Background and method
Box 1
: Three part focused question
Box 2
: Quality of systematic reviews
Summary points
The discipline of systematic reviews has given clinicians a
valuable tool with which to synthesise evidence
As the methodology of systematic reviews has evolved, the quality of
reviews has improved
Nevertheless, high quality systematic reviews may overlook
important clinical details in the papers reviewed, thereby diminishing
their validity
This shortcoming might be avoided if trials were assessed from a
clinician's viewpoint as well as from a reviewer's viewpoint
![]()
Background
in 198510 and 1986.11 They reached discordant conclusions. A decade
later, two systematic reviews
by Watts and Silagy12 and
Koes et al13
also reached discordant conclusions. A
comparison of these reviews concluded that the difference in their
methods
namely, vote counting versus pooling
explained the
discordance.14 A further systematic review (of all types
of injection therapies, including epidural) was published by Nelemans
et al for the Cochrane Collaboration in 1999.15 The three
systematic reviews overlap in their nature (qualitative versus
quantitative), method for assessing the quality of randomised
controlled trials (following that of ter Riet et al16 or
Chalmers et al17), and conclusions (table 1). I therefore used them to conduct a general study of the validity of systematic reviews.
![]()
Assessing the validity of the three reviews
My interest in the epidural steroid injection treatment for
sciatica stems from a question arising in general practice and a
general practice commissioning board. It was framed as a three part,
focused question (box 1).18 I retrieved the relevant
trials that were included in all three reviews and critically appraised
each individual paper for validity and relevance to this
question.
19 20
Patients with sciatica
Injection of corticosteroid
into the epidural space compared with placebo or injection of local
anaesthetic
Which intervention leads to quicker
pain relief?
Criteria
Nelemans et al15
Koes et
al13
Watts and Silagy12
Were the search
methods used to find evidence (original research) on the primary
questions stated?
Yes
Yes
Yes
Was
the search for evidence reasonably comprehensive?
The most
comprehensive (Medline and Embase, no language
restriction)
Reasonably but the least comprehensive (Medline,
restricted to English language only)
Medline, no language
restriction.
Were the criteria used for deciding which studies to
include in the overview reported?
Yes
Yes
Yes
Was bias in
the selection of studies avoided?
Yes
Yes
Yes
Were the
criteria used for assessing the validity of the included studies
reported?
Yes (scale of 0-100, following ter Riet et
al16)
Yes (scale of 0-100 following ter Riet et
al16)
Yes (scale of 3-9 following Chalmers et
al17)
Was the validity of all the studies referred to
in the text assessed using appropriate criteria (either in selecting
studies for inclusion or in analysing the studies that are
cited)?
Not applicable (issue explored in this article)
Not
applicable (issue explored in this article)
Not applicable (issue
explored in this article)
Were the methods used to combine the
findings of the relevant studies (to reach a conclusion)
reported?
Yes
Yes (but see answer to next question)
Yes
Were the findings of the relevant studies combined appropriately,
relative to the primary question that the overview
addresses?
Partly, but one of the issues explored in this study was
whether combination was reasonable
Difficult to say, as combination
with pooling was not attempted; results were used for "vote
counting"
Partly, but one of the issues explored in this study was
whether combination was reasonable
Were the conclusions drawn by
the author(s) supported by the data and/or analysis reported in the
overview?
Yes (within the review's own terms)
Yes (within the
review's own terms)
Yes (within the review's own terms)
These questions on criteria have been taken from Oxman and
Guyatt.21 A further question ("How would you rate the
scientific quality of this overview?") asks the rater to give the
review a numerical score.
Findings
All three reviews were of high quality according to the Oxman and
Guyatt index (box 2). Three problems, however, compromised their
validity: the relevance of the study population (inclusion of atypical
populations); the appropriateness of the intervention (inclusion of one
study with a serious problem in its design); and the adequacy of the
outcome measures (inclusion of studies with inappropriate outcome assessments).
Atypical populations
Both the Koes and the Nelemans reviews included atypical
populations
notably patients with pain despite or because of spinal
surgery.
22 23
One trial had a high proportion of patients
with arachnoiditis,24 which can be a complication of surgery and of epidural injections when the steroid used is
methylprednisolone. These populations are clinically and pathologically
distinct from patients with back pain or sciatica who are treated by
most clinicians and included in all the other trials.
that is, the pooling of results from
studies with heterogeneous populations
has been cogently defended,25 guidelines warn against combining studies that
are too heterogeneous.9 The fundamental differences
between most of the randomised controlled trials and the atypical ones
means that lumping in this case make no clinical sense.
Flawed design
Koes contended that a design could be "fatally" flawed through
the use of a checklist system to score randomised controlled trials:
"One of the drawbacks of using this list of methodological criteria
might be that trials showing a fatal mistake . . . might end up with a
high score because of other criteria."13
| ||||||||||||||||
Inadequate outcome measures
Several validated tools for assessing outcome for musculoskeletal
and back pain research are available, measuring pain, disability, or
both.27 Some of the early primary studies used
idiosyncratic tools that fell short of the standards we now expect of
modern research. There are two consequences for modern reviews: the
results of the older trials are less reliable, and their format means
they are not comparable with modern studies. The trials by Beliveau et
al (1971)28 and by Snoek et al (1977)29 (box
3) used idiosyncratic outcome assessments but were included in the
reviews by Watts and by Koes. Both Nelemans and Watts included Beliveau
(and Cuckler26) in their pooling, which casts doubt on
their results. As Messerli said in another context: "A meta-analysis is like a Mediterranean bouillabaisse
in concert, all ingredients will
enhance its delightful flavour but, no matter how much fresh fish is
added, one rotten fish will make it
stink."30
|
| |
Conclusion |
|---|
Does this mean that no conclusions can be drawn from the original randomised controlled trials? Certainly not. Analysis shows that most trials in this field were conducted at a time when trial methodology was less rigorous than it is now. The poor quality of some trials means that we must disregard their findings, or at least resist the temptation to pool them in a meta-analysis. One trial stands out: the trial by Carette et al31 was, at the time of the Nelemans review, the most recent, largest, and most rigorous. Nelemans awarded it a quality score of 76%. This trial was the best evidence available at the time, and therefore we should use its results to inform our decisions. To pool it with others of inferior quality is to accept uncritically that a meta-analysis must be better than a single trial. A large, rigorous trial provides better evidence than a non-credible meta-analysis.
Smith et al32 drew a distinction between the quality and the validity of randomised controlled trials. Quality relates to the conduct of the trial; the scoring systems mentioned above are among several that aim to measure quality. Validity relates to the ability of the trial to answer the question. We can draw a similar distinction in systematic reviews. The quality of the three systematic reviews is high, but their validity is compromised by overlooking important details in the trials themselves. The fact that these oversights occurred in not just one but all three reviews of the same topic suggests that it may be a general rather than an isolated problem. Clinicians were involved in all three reviews, so the oversights did not arise from a lack of involvement by clinicians. Perhaps it was the type of involvement.
This analysis suggests that reading a paper from a clinician's
viewpoint is different from reading a paper from the viewpoint of a
reviewer, who has a duty to apply a set of criteria from a checklist.
Clinicians, whose usefulness up to now has been seen as "content
experts" in systematic review teams, may be able to contribute to the
future evolution of systematic reviews by exploring these different viewpoints.
| |
Footnotes |
|---|
Funding: KH holds a primary care enterprise award from the research and development division of the Eastern regional office of the NHS Executive and has been awarded a grant from the Claire Wand Fund.
Competing interests: None declared.
| |
References |
|---|
| 1. | Mulrow C. The medical review article; state of the science. Ann Intern Med 1987; 106: 485-488. |
| 2. |
Jadad A, Moher M, Browman G, Booker L, Sigouin C, Fuentes M, et al.
Systematic reviews and meta-analyses on treatment of asthma: critical evaluation.
BMJ
2000;
320:
537-540 |
| 3. | Furlan A, Clarke J, Esmail R, Sinclair S, Irvin E, Bombardier C. A critical review of reviews on the treatment of chronic low back pain. Spine 2001; 26: E155-E162[CrossRef][Medline]. |
| 4. | Jadad A, McQuay HJ. Meta-analyses to evaluate analgesic interventions: a systematic qualitative review of their methodology. J Clin Epidemiol 1996; 49: 235-243[CrossRef][Medline]. |
| 5. | Prins J, Buller H. Meta-analysis: the final answer, or even more confusion? Lancet 1996; 348: 199[Medline]. |
| 6. |
Petticrew M, Kennedy S.
Detecting the effects of thromboprophylaxis: the case of the rogue reviews.
BMJ
1997;
315:
665-668 |
| 7. |
Lindback M, Hjortdahl P.
How do two meta-analyses of similar data reach opposite conclusions?
BMJ
1999;
318:
873-874 |
| 8. | Jadad AR, Cook DJ, Browman GP. A guide to interpreting discordant systematic reviews. Can Med Assoc J 1997; 156: 1411-1416[Abstract]. |
| 9. | NHS Centre for Reviews and Dissemination. Undertaking systematic reviews of research on effectiveness. Guidelines for those carrying out or commissioning reviews. York: NHS Centre for Reviews and Dissemination, University of York, 2001. |
| 10. | Kepes E, Duncalf D. Treatment of backache with spinal injections of local anesthetics, spinal and systemic steroids. A review. Pain 1985; 22: 33-47[CrossRef][Medline]. |
| 11. | Benzon H. Epidural steroid injections for low back pain and lumbosacral radiculopathy. Pain 1986; 224: 277-295. |
| 12. | Watts R, Silagy C. A meta-analysis on the efficacy of epidural corticosteroids in the treatment of sciatica. Anaesth Intens Care 1995; 23: 564-569[Medline]. |
| 13. | Koes B, Scholten R, Mens J, Bouter L. Efficacy of epidural injections for low-back pain and sciatica: a systematic review of randomized clinical trials. Pain 1995; 63: 279-288[CrossRef][Medline]. |
| 14. | Hopayian K, Mugford M. Conflicting conclusions from two systematic reviews of epidural steroid injections for sciatica: which evidence should general practitioners heed? Br J Gen Pract 1999; 49(Jan): 57-61[Medline]. |
| 15. | Nelemans P, Bie RA de, Vet HCW de, Sturmans F. Injection therapy for subacute and chronic benign low back pain. In: Cochrane Database of Syst Rev , 2001;(3):CD001824. |
| 16. | Ter Riet G, Kleijnen J, Knipschild P. Acupuncture and chronic pain: a criteria based meta-analysis. J Clin Epidemiol 1990; 43: 1191-1199[CrossRef][Medline]. |
| 17. | Chalmers TC, Smith Jr H, Blackburn B, Silverman B, Schroede B, Reitman D, et al. A method for assessing the quality of a randomized control trial. Controlled Clinical Trials 1981; 2(1): 31-49[CrossRef][Medline]. |
| 18. | Richardson W, Wilson M, Nishikawa J, Hayward R. The well-built clinical question: a key to evidence based decisions. ACP Journal Club 1995; 123: A12-A13[Medline]. |
| 19. |
Guyatt GH, Sackett DL, Cook DJ.
Users' guides to the medical literature. II. How to use an article about therapy or prevention. A. Are the results of the study valid?
JAMA
1993;
270:
2598-2601 |
| 20. |
Guyatt GH, Sackett DL, Cook DJ.
Users' guides to the medical literature. II. How to use an article about therapy or prevention. B. What were the results and will they help me in caring for my patients?
JAMA
1994;
271:
59-56 |
| 21. | Oxman A, Guyatt G. Validation of an index of the quality of review articles. J Clin Epidemiol 1991; 44: 91-98[Medline]. |
| 22. | Dallas T, Lin R, Wu W, Wolskee P. Epidural morphine and methylprednisolone for low-back pain. Anesthesiology 1987; 67: 408-411[CrossRef][Medline]. |
| 23. | Rocco A, Frank E, Kaul A, Lipson S, Gallo J. Epidural steroids, epidural morphine and epidural steroids combined with morphine in the treatment of post-laminectomy syndrome. Pain 1989; 36: 297-303[CrossRef][Medline]. |
| 24. | Glynn C, Dawson D, Sanders R. A double-blind comparison between epidural morphine and epidural clonidine in patients with chronic non-cancer pain. Pain 1988; 34: 123-128[CrossRef][Medline]. |
| 25. |
Gøtzsche P.
Why we need a broad perspective on meta-analysis.
BMJ
2000;
321:
585-586 |
| 26. |
Cuckler JM, Bernini PA, Wiesel SW, Booth Jr RE, Rothman RH, Pickens GT.
The use of epidural steroids in the treatment of lumbar radicular pain. A prospecitive, randomized, double-blind study.
J Bone Joint Surg Am
1985;
67(1):
63-66 |
| 27. | Ruta D, Garratt A, Wardlaw D, Russell I. Developing a valid and reliable measure for health outcome for patients with low back pain. Pain 1994; 19: 1187-1196. |
| 28. | Beliveau P. A comparison between epidural anaesthesia with and without corticosteroid in the treatment of sciatica. Rheum Phys Med 1971; 11: 40-43. |
| 29. | Snoek W, Weber H, Jørgensen B. Double blind evaluation of extradural methyl prednisolone for herniated lumbar discs. Acta Orthop Scand 1977; 48: 635-641[Medline]. |
| 30. | Messerli F. Meta-analysis. Are calcium antagonists safe? Lancet 1985:767-8. |
| 31. |
Carette S, Leclaire R, Marcoux S, Morin F, Blaise G, St Pierre A, et al.
Epidural corticosteroid injections for sciatica due to herniated nucleus pulposus.
N Engl J Med
1997;
336:
1634-1640 |
| 32. | Smith AS, Oldman A, McQuay H, Moore R. Teasing apart quality and validity in systematic reviews: an example from acupuncture trials in chronic neck and back pain. Pain 2000; 86: 119-132[CrossRef][Medline]. |
(Accepted 25 June 2001)
Read all Rapid Responses
What can you learn from this BMJ paper? Read Leanne Tite's Paper+