BMJ  2004;329:1093-1096 (6 November), doi:10.1136/bmj.329.7474.1093

Education and debate

Users' guide to detecting misleading claims in clinical research reports

Victor M Montori, Assistant Professor1, Roman Jaeschke, clinical professor of medicine2, Holger J Schünemann, associate professor of medicine3, Mohit Bhandari, assistant professor of medicine3, Jan L Brozek, lecturer in medicine4, P J Devereaux, assistant professor of medicine3, Gordon H Guyatt, professor of medicine3

1 Department of Medicine, Mayo Clinic College of Medicine, Rochester MN, USA, 2 Department of Medicine, McMaster University, Hamilton ON, Canada, 3 Department of Clinical Epidemiology and Biostatistics, McMaster University, 4 Polish Institute for Evidence Based Medicine, Jagiellonian University Medical School, Krakow, Poland

Correspondence to: G H Guyatt guyatt{at}mcmaster.ca

Plenty of advice is available to help readers identify studies with weak methods, but would you be able to identify misleading claims in a report of a well conducted study?

Introduction

Science is often not objective.1 Emotional investment in particular ideas and personal interest in academic success may lead investigators to overemphasise the importance of their findings and the quality of their work. Even more serious conflicts arise when for-profit organisations, including pharmaceutical companies, provide funds for research and consulting, conduct data management and analyses, and write reports on behalf of the investigators.

Although guides to help recognise methodological weaknesses that may introduce bias are now widely available,2 3 these criteria do not protect readers against misleading interpretations of methodologically sound studies. In this article, we present a guide that provides clinicians with tools to defend against biased inferences from research studies (box).


Guide to avoid being misled by biased presentation and interpretation of data

  1. Read only the Methods and Results sections; bypass the Discussion section
  2. Read the abstract reported in evidence based secondary publications
  3. Beware faulty comparators
  4. Beware composite endpoints
  5. Beware small treatment effects
  6. Beware subgroup analyses


Read methods and results only

The discussion section of research reports often offers inferences that differ from those a dispassionate reader would draw from the methods and results.4 The table gives details of two systematic reviews summarising a similar set of randomised trials assessing the effect of albumin for fluid resuscitation. The trials included in both reviews were small and methodologically weak, and their results are heterogeneous. Both the reviews provide point estimates suggesting that albumin may increase mortality and confidence intervals that include the possibility of a considerable increase in mortality. Nevertheless, one set of authors took a strong position that albumin is dangerous, the other that it is not. Their positions were consistent with the interests of funders of their reviews.5


View this table:
[in this window]
[in a new window]
 
Comparison of two systematic reviews of albumin for fluid resuscitation

 

This is not an idiosyncratic example. Systematic examinations of the relation between funding and conclusions have found that the odds of recommending an experimental drug as treatment of choice increases fivefold with for-profit organisation funding (odds ratio 5.3, 95% confidence interval 2.0 to 14.4) compared with not-for-profit funding).6

If editors insisted that discussion sections of original articles included a systematic review of the relevant literature, this first pointer would no longer be relevant. However, few original trial reports include systematic reviews,7 and this may not change in the foreseeable future.

To follow this advice, readers must be able to make sense of the methods and results. Fortunately, clinicians can access many educational materials to acquire skills in interpreting studies' designs and their findings.2 3

Read the abstract reported in pre-appraised resources

Secondary journals, such as the ACP Journal Club, Evidence-Based Medicine, and Evidence-Based Mental Health, publish structured abstracts produced by a team of clinicians and methodologists. These abstracts often include critical information about the research methods (allocation concealment, blinding, completeness of follow up) omitted from the original papers.8 The conclusions of this "secondary" abstract are the product of critical appraisal by people without competing financial or personal interests.

The objectivity and methodological sophistication of those preparing the independent structured abstracts will often provide additional value for clinicians. For example, substantial discrepancies occur between the full report of the PROGRESS trial and the abstract in ACP Journal Club.w3 w4 The title of the original publication describes the study as testing "A perindopril-based blood pressure lowering regimen" and reports that the perindopril regimen resulted in a 28% relative risk reduction in the risk of recurrent stroke (95% confidence interval 17-38%).w3 The ACP Journal Club abstract and its commentary identified the publication as describing two parallel but separate randomised placebo controlled trials including about 6100 patients with a history of stroke or transient ischaemic attack:

  • In one trial, patients were randomised to receive perindopril or placebo, and active treatment had no appreciable effect on stroke (relative risk reduction = 5%, 95% confidence interval - 19% to 23%).
  • In the second trial, patients were allocated to receive perindopril plus indapamide or double placebo. Combined treatment resulted in a 43% relative risk reduction (30% to 54%) in recurrent stroke.

The ACP Journal Club commentary notes that the authors disagree with the interpretation of the publication as reporting two separate trials (which explains why it is difficult for even the knowledgeable reader to get a clear picture of the design from the original publication).

Beware faulty comparators

Several systematic reviews have shown that industry funded studies typically yield larger treatment effects than not-for-profit funded studies.9 10 One likely explanation is choice of comparators.11 Researchers with an interest in a positive result may choose a placebo comparator rather than an alternative drug with proved effectiveness. For instance, in a study of 136 trials of new treatments for multiple myeloma, 60% of studies funded by for-profit organisations, but only 21% of trials funded by not-for-profit organisations, compared their new interventions against placebo or no treatment.12 Box A on bmj.com gives other examples.

When reading reports of randomised trials, clinicians should ask themselves: "Should the comparator have been another active agent rather than placebo; if investigators chose an active comparator, was the dose, formulation, and administration regimen optimal?"

Beware composite end points

Investigators often use composite end points to enhance the statistical efficiency of clinical trials. Problems in the interpretation of these trials arise when composite end points include component outcomes to which patients attribute very different importance—for example, the primary outcome in a trial of irbesartan v placebo for diabetic nephropathy was a composite end point including death, end stage renal disease, and doubling of serum creatinine concentration.13 Problems may also arise when the most important end point occurs infrequently or when the apparent effect on component end points differs. Thus, the reported effect on "important cardiovascular end points" may reflect mostly the effect of treatment on angina rather than on mortality and non-fatal myocardial infarction (see bmj.com for another example).

When the more important outcomes occur infrequently, clinicians should focus on individual outcomes rather than on composite end points. Under these circumstances, inferences about the effect of treatment on the more important end points (which, because they occur infrequently will have very wide confidence intervals) will be weak. In focusing on the more reliable estimates of effects on the less important outcomes, readers will see that, for instance, a putative reduction in death, myocardial infarction, and revascularisation is really just an effect on the frequency of revascularisation.14

Beware small treatment effects

Increasingly, investigators are conducting very large trials to detect small treatment effects. Results suggest small treatment effects when either the point estimate is close to no effect (a relative or absolute risk reduction close to 0; a relative risk or odds ratio close to 1) or the confidence interval includes values close to no effect. In one large trial, investigators randomly allocated just over 6000 participants to receive angiotensin converting enzyme inhibitors or diuretics for hypertension and concluded "initiation of antihypertensive treatment involving ACE inhibitors in older subjects... seems to lead to better outcomes than treatment with diuretic agents."15 In absolute terms, the difference between the regimens was very small: there were 4.2 events per 100 patient years in the angiotensin converting enzyme group and 4.6 events per 100 patient years in the diuretic group. The relative risk reduction corresponding to this absolute difference (11%) had an associated 95% confidence interval of - 1% to 21%.

Here, we have two reasons to doubt the importance of the apparent difference between the two types of antihypertensive drug. Firstly, the point estimate suggests a very small absolute difference (0.4 events per 100 patient years) and, secondly, the confidence interval suggests it may have been even smaller—indeed, there may have been no true difference at all.

When the absolute risk of adverse events in untreated patients is low, the presentation may focus on relative risk reduction and de-emphasise or ignore absolute risk reduction. Other techniques for making treatment effects seem large include misleading graphical representations,16 and using different time frames to present harms and benefits (see box C on bmj.com).

Beware subgroup analyses

Clinicians often wonder if a subgroup of patients will achieve larger benefits when exposed to a particular treatment. Because the play of chance can lead to apparent but spurious differences in effects between subgroups, clinicians should be cautious about reports of subgroup analyses.


Summary points

Many interests may influence researchers towards favourable interpretation and presentation of their findings

Guides helping clinicians to identify weak studies that are open to bias are available

Clinicians also need to be able to detect when authors of methodologically strong studies make misleading claims

Six pointers to help clinicians avoid being misled are presented


In the trial of angiotensin converting enzyme inhibitor versus diuretic based antihypertensive treatment above, the complete conclusion reads "initiation of antihypertensive treatment involving ACE inhibitors in older subjects, particularly men, appears to lead to better outcomes than treatment with diuretic agents."15 The possibility that effect differed by sex was not one of a small number of prior hypotheses, and the size of the difference in effect was small (relative risk reductions of 17% (95% confidence interval 3 to 29%) in men and 0% (- 20% to 17%) in women).

The difference between the angiotensin converting enzyme inhibitors and diuretics was significant in men but not women. Investigators might use this fact to argue for differences in the effect between men and women. The real issue, however, is whether chance is a sufficient explanation for the difference in effect between women and men, and it is in this case. The P value associated with the null hypothesis that the underlying relative risk is identical in men and women is 0.15. Thus, if there were no true difference in effect between men and women, we would see apparent differences of 17%, or greater, 15% of the time. Overall, the inference that the effect differs in men satisfies only one of seven criteria for a valid subgroup analysis.17

Clinicians may sometimes encounter parallel examples when investigators discount effects of beneficial treatments because of apparent lack of effects in subgroups. Here, clinicians must ask themselves whether there is strong evidence that patients in a subgroup should be denied the beneficial treatment. The answer will usually be "no." In general, we advise clinicians to be sceptical concerning claims of differential treatment effects in subgroups of patients.

Conclusion

We have presented six pointers to help clinicians protect themselves and their patients from potentially misleading presentations and interpretations of research findings. These strategies are unlikely to be foolproof. Decreasing the dependence of the research endeavour on for-profit funding, implementing a requirement for mandatory registration of clinical trials, and instituting more structured approaches to reviewing and reporting research18 19 may reduce biased reporting. At the same time, it is likely that potentially misleading reporting will always be with us, and the guide we have presented will help clinicians to stay armed.


{webplus.f1}Illustrative examples and references w1-w15 are on bmj.com

Contributors and sources: The authors are clinical epidemiologists who are involved in critical review of medical and surgical studies as part of their educational and editorial activities. In that context, they have collected some of the examples presented here. Other examples were collected by searching titles of records in PubMed using the textwords "misleading," "biased," and "spin" using the Related Articles feature for every pertinent hit, and by noting relevant references from retrieved papers. VMM and GHG wrote the first draft of the article. All authors contributed to the ideas represented in the article, made critical contributions and revisions to the first draft, and approved the final version. GHG is guarantor.

Funding: VMM is a Mayo Foundation scholar. PJD is supported by a Canadian Institutes of Health Research senior research fellowship award. MB was supported by a Detweiler fellowship, Royal College of Physicians and Surgeons of Canada and is currently supported by a Canada Research Chair.

Competing interests: VMM, RJ, HJS, PJD, and GHG are associate editors of the ACP Journal Club and Evidence-Based Medicine. JLB and RJ edit an evidence based medicine journal in Poland. MB is editor of the evidence based orthopaedic trauma section in the Journal of Orthopaedic Trauma.

References

  1. Horton R. The rhetoric of research. BMJ 1995;310: 985-7.[Free Full Text]
  2. Greenhalgh T. How to read a paper. London: BMJ Books, 2001.
  3. Guyatt G, Rennie D, eds. Users' guides to the medical literature. A manual for evidence-based clinical practice. Chicago, IL: AMA Press, 2002.
  4. Bero LA, Rennie D. Influences on the quality of published drug studies. Int J Technol Assess Health Care 1996;12: 209-37.[ISI][Medline]
  5. Cook D, Guyatt G. Colloid use for fluid resuscitation: evidence and spin. Ann Intern Med 2001;135: 205-8.[Free Full Text]
  6. Als-Nielsen B, Chen W, Gluud C, Kjaergard LL. Association of funding and conclusions in randomized drug trials: a reflection of treatment effect or adverse events? JAMA 2003;290: 921-8.[Abstract/Free Full Text]
  7. Clarke M, Alderson P, Chalmers I. Discussion sections in reports of controlled trials published in general medical journals. JAMA 2002;287: 2799-801.[Abstract/Free Full Text]
  8. Devereaux PJ, Manns BJ, Ghali WA, Quan H, Guyatt GH. Reviewing the reviewers: the quality of reporting in three secondary journals. CMAJ 2001;164: 1573-6.[Abstract/Free Full Text]
  9. Bekelman JE, Li Y, Gross CP. Scope and impact of financial conflicts of interest in biomedical research: a systematic review. JAMA 2003;289: 454-65.[Abstract/Free Full Text]
  10. Lexchin J, Bero LA, Djulbegovic B, Clark O. Pharmaceutical industry sponsorship and research outcome and quality: systematic review. BMJ 2003;326: 1167-70.[Abstract/Free Full Text]
  11. Mann H, Djulbegovic B. Biases due to differences in the treatments selected for comparison (comparator bias). James Lind Library. www.jameslindlibrary.org/essays/bias/comparator-_bias.html (accessed 2 Oct 2004).
  12. Djulbegovic B, Lacevic M, Cantor A, Fields KK, Bennett CL, Adams JR, et al. The uncertainty principle and industry-sponsored research. Lancet 2000;356: 635-8.[CrossRef][ISI][Medline]
  13. Lewis EJ, Hunsicker LG, Clarke WR, Berl T, Pohl MA, Lewis JB, et al. Renoprotective effect of the angiotensin-receptor antagonist irbesartan in patients with nephropathy due to type 2 diabetes. N Engl J Med 2001;345: 851-60.[Abstract/Free Full Text]
  14. Waksman R, Ajani AE, White RL, Chan RC, Satler LF, Kent KM, et al. Intravascular gamma radiation for in-stent restenosis in saphenous-vein bypass grafts. N Engl J Med 2002;346: 1194-9.[Abstract/Free Full Text]
  15. Wing LM, Reid CM, Ryan P, Beilin LJ, Brown MA, Jennings GL, et al. A comparison of outcomes with angiotensin-converting-enzyme inhibitors and diuretics for hypertension in the elderly. N Engl J Med 2003;348: 583-92.[Abstract/Free Full Text]
  16. Tufte E. The visual display of quantitative information. Cheshire, CT: Graphics Press, 1983.
  17. Oxman A, Guyatt G, Green L, Craig J, Walter S, Cook D. When to believe a subgroup analysis. In: Guyatt G, Rennie D, eds. Users' guides to the medical literature. A manual for evidence-based clinical practice. Chicago, IL: AMA Press, 2002: 553-65.
  18. Docherty M, Smith R. The case for structuring the discussion of scientific papers. BMJ 1999;318: 1224-5.[Free Full Text]
  19. Moher D, Schulz KF, Altman DG. The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomised trials. Lancet 2001;357: 1191-4.[CrossRef][ISI][Medline]
(Accepted 15 September 2004)

Related Articles

Waking up from the DREAM of preventing diabetes with drugs
Victor M Montori, William L Isley, and Gordon H Guyatt
BMJ 2007 334: 882-884. [Extract] [Full Text] [PDF]

Trial protocols: time for more than tinkering
James Penston
BMJ 2005 330: 421. [Extract] [Full Text]

Users' guide to detecting misleading claims in research: Misleading claims may be symptom of even more serious flaws
James Penston
BMJ 2005 330: 145. [Extract] [Full Text] [PDF]

Users' guide to detecting misleading claims in research: Discussion section helps redress the balance
S Anuradha
BMJ 2005 330: 145. [Extract] [Full Text]

Users' guide to detecting misleading claims in research: Meta-analyses have led to misleading claims in the past
Peter J Horsey
BMJ 2005 330: 145. [Extract] [Full Text]

Avoid being misled by claims in robust studies
BMJ 2004 329: 0. [Full Text]

This article has been cited by other articles:

  • Keeley, P. W., Waterhouse, E. T., Noble, S. I.R. (2007). The evidence base of palliative medicine: is inpatient palliative medicine evidence-based?. Palliat Med 21: 623-627 [Abstract]  
  • Abedin, P., Daniels, J., Khan, K. S (2007). Health technology assessment in obstetrics and gynaecology. Part 2: application in practice. The Obstetrician and Gynaecologist 9: 181-187 [Abstract] [Full text]  
  • Montori, V. M, Isley, W. L, Guyatt, G. H (2007). Waking up from the DREAM of preventing diabetes with drugs. BMJ 334: 882-884 [Full text]  
  • (2005). Hit parade. BMJ 330: 485-485 [Full text]  
  • Penston, J. (2005). Trial protocols: time for more than tinkering. BMJ 330: 421-421 [Full text]  
  • Penston, J. (2005). Users' guide to detecting misleading claims in research: Misleading claims may be symptom of even more serious flaws. BMJ 330: 145-145 [Full text]  
  • Anuradha, S (2005). Users' guide to detecting misleading claims in research: Discussion section helps redress the balance. BMJ 330: 145-145 [Full text]  
  • Horsey, P. J (2005). Users' guide to detecting misleading claims in research: Meta-analyses have led to misleading claims in the past. BMJ 330: 145-145 [Full text]  

Rapid Responses:

Read all Rapid Responses

Ethical implications of study design
T.L. James
bmj.com, 5 Nov 2004 [Full text]
Misleading claims - a symptom of even more serious flaws?
James Penston
bmj.com, 8 Nov 2004 [Full text]
Beyond a joke ?
Anna E Livingstone
bmj.com, 10 Nov 2004 [Full text]
Misleading claims about albumin
Mahlon M. Wilkes, et al.
bmj.com, 10 Nov 2004 [Full text]
Re: misleading claims about albumin.
Richard G Fiddian-Green
bmj.com, 11 Nov 2004 [Full text]
Meta analyses and misleading claims
Peter J Horsey
bmj.com, 13 Nov 2004 [Full text]
Is it always misleading?
S Anuradha
bmj.com, 16 Nov 2004 [Full text]
Re: Meta analyses and misleading claims about HRT mortality
Ellen C G Grant
bmj.com, 17 Nov 2004 [Full text]
Albumin, science and the money trough
Ian Roberts
bmj.com, 28 Nov 2004 [Full text]
In response to Rapid Responses about Users' Guides
Victor M. Montori, et al.
bmj.com, 1 Dec 2004 [Full text]
No economic interest?
Mahlon M. Wilkes, et al.
bmj.com, 8 Dec 2004 [Full text]



Student BMJ

Risk of surgery for inflammatory bowel disease: record linkage studies

What can you learn from this BMJ paper? Read Leanne Tite's Paper+

www.student.bmj.com

Listen to the latest BMJ Interview