- Trisha Greenhalgh, senior lecturera (p.greenhalgh@ucl.ac.uk)
- a Unit for Evidence-Based Practice and Policy, Department of Primary Care and Population Sciences, University College London Medical School/Royal Free Hospital School of Medicine, Whittington Hospital, London N19 5NF
Introduction
This article continues the checklist of questions that will help you to appraise the statistical validity of a paper. The first of this pair of articles was published last week.1
Correlation, regression, and causation
Has correlation been distinguished from regression, and has the correlation coefficient (r value) been calculated and interpreted correctly?
For many non-statisticians, the terms “correlation” and “regression” are synonymous, and refer vaguely to a mental image of a scatter graph with dots sprinkled messily along a diagonal line sprouting from the intercept of the axes. You would be right in assuming that if two things are not correlated, it will be meaningless to attempt a regression. But regression and correlation are both precise statistical terms which serve quite different functions.1
The r value (Pearson's product-moment correlation coefficient) is among the most overused statistical instrument. Strictly speaking, the r value is not valid unless the following criteria are fulfilled:
Summary points
An association between two variables is likely to be causal if it is strong, consistent, specific, plausible, follows a logical time sequence, and shows a dose-response gradient
A P value of <0.05 means that this result would have arisen by chance on less than one occasion in 20
The confidence interval around a result in a clinical trial indicates the limits within which the “real” difference between the treatments is likely to lie, and hence the strength of the inference that can be drawn from the result
A statistically significant result may not be clinically significant. The results of intervention trials should be expressed in terms of the likely benefit an individual could expect (for example, the absolute risk reduction)
The data (or, more accurately, the population from which the data are drawn) should be normally distributed. If they are not, non-itemmetric tests of correlation should be used instead.1
The …
Sign in
Article access
Article access for 1 day
Purchase this article for £20 $30 €32*
The PDF version can be downloaded as your personal record







CiteULike
Connotea
Del.icio.us
Digg
Facebook
Mendeley
Reddit
Technorati
Twitter
Stumbleupon
Rapid responses
Latest Responses
Re: Bringing Nightingale down to size
Published 29 May 2012
Re: Avoid antimuscarinic drugs in people with dementia
Published 29 May 2012
Re: Strengthening primary health care: Related to the integration of medical training, community service need and health administration
Published 29 May 2012
Re: Strengthening primary health care: Related to the integration of medical training, community service need and health administration
Published 29 May 2012
Health Literacy: Patient involvement and engagement with healthcare
Published 29 May 2012
Most responses
Venous thrombosis in users of non-oral hormonal contraception: follow-up study, Denmark 2001-10 (12 responses)
Published 10 May 2012 - 23:32
The psychiatric oligarchs who medicalise normality (9 responses)
Published 2 May 2012 - 15:42
Are doctors justified in taking industrial action in defence of their pensions? No (8 responses)
Published 8 May 2012 - 12:21
Are doctors justified in taking industrial action in defence of their pensions? Yes (8 responses)
Published 8 May 2012 - 12:21
The hardest thing: admitting error (7 responses)
Published 2 May 2012 - 12:27