- Jonathan A C Sterne (jonathan.sterne@bristol.ac.uk), senior lecturer in medical statistics,
- George Davey Smith, professor of clinical epidemiology
- Department of Social Medicine, University of Bristol, Bristol BS8 2PR
- Nuffield College, Oxford OX1 1NF
- Correspondence to: J Sterne
- Accepted 9 November 2000
The findings of medical research are often met with considerable scepticism, even when they have apparently come from studies with sound methodologies that have been subjected to appropriate statistical analysis. This is perhaps particularly the case with respect to epidemiological findings that suggest that some aspect of everyday life is bad for people. Indeed, one recent popular history, the medical journalist James Le Fanu's The Rise and Fall of Modern Medicine, went so far as to suggest that the solution to medicine's ills would be the closure of all departments of epidemiology.1
One contributory factor is that the medical literature shows a strong tendency to accentuate the positive; positive outcomes are more likely to be reported than null results.2–4 By this means alone a host of purely chance findings will be published, as by conventional reasoning examining 20 associations will produce one result that is “significant at P=0.05” by chance alone. If only positive findings are published then they may be mistakenly considered to be of importance rather than being the necessary chance results produced by the application of criteria for meaningfulness based on statistical significance. As many studies contain long questionnaires collecting information on hundreds of variables, and measure a wide range of potential outcomes, several false positive findings are virtually guaranteed. The high volume and often contradictory nature5 of medical research findings, however, is not only because of publication bias. A more fundamental problem is the widespread misunderstanding of the nature of statistical significance.
Summary points
P values, or significance levels, measure the strength of the evidence against the null hypothesis; the smaller the P value, the stronger the evidence against the null hypothesis
An arbitrary division of results, into “significant” or “non-significant” according to the P value, was not the intention of the …
Sign in
Personal subscribers, sign in here:
Article access
Article access for 1 day
Purchase this article for £20 $30 €32*
The PDF version can be downloaded as your personal record
CiteULike
Connotea
Del.icio.us
Digg
Facebook
Reddit
Technorati
Twitter
Stumbleupon
Rapid responses
Latest Responses
Re: How much of a social media profile can doctors have?
Published 13 February 2012
Re: Diagnosis and management of Raynaud’s phenomenon
Published 13 February 2012
Re: Is it unethical for doctors to encourage healthy adults to donate a kidney to a stranger? No
Published 13 February 2012
Re: Report predicts 20 million AIDS orphans in Africa by 2010
Published 13 February 2012
Re: On the impossibility of being expert
Published 13 February 2012
Most responses
Does anyone understand the government’s plan for the NHS? (17 responses)
Published 17 Jan 2012
Bad medicine: medical nutrition (15 responses)
Published 18 Jan 2012
Shared decision making: really putting patients at the centre of healthcare (8 responses)
Published 27 Jan 2012
How much of a social media profile can doctors have? (7 responses)
Published 23 Jan 2012
Why legislation is necessary for my health reforms (7 responses)
Published 1 Feb 2012