This article has a correction
Please see: How to read a paper: Papers that report diagnostic or screening tests
- Trisha Greenhalgh, senior lecturer (p.greenhalgh@ucl.ac.uk)a
- a Unit for Evidence-Based Practice and Policy Department of Primary Care and Population Sciences University College London Medical School/Royal Free Hospital School of Medicine Whittington Hospital London N19 5NF
Ten_men_in_the_dock
If you are new to the concept of validating diagnostic tests, the following example may help you. Ten men are awaiting trial for murder. Only three of them actually committed a murder; the seven others are innocent of any crime. A jury hears each case and finds six of the men guilty of murder. Two of the convicted are true murderers. Four men are wrongly imprisoned. One murderer walks free.
PETER BROWN
This information can be expressed in what is known as a two by two table (table 1). Note that the “truth” (whether or not the men really committed a murder) is expressed along the horizontal title row, whereas the jury's verdict (which may or may not reflect the truth) is expressed down the vertical row.
- In this window
- In a new window
Two by two table showing outcome of trial for 10 men accused of murder
These figures, if they are typical, reflect several features of this particular jury:
the jury correctly identifies two in every three true murderers;
it correctly acquits three out of every seven innocent people;
if this jury has found a person guilty, there is still only a one in three chance that they are actually a murderer;
if this jury found a person innocent, he or she has a three in four chance of actually being innocent; and
in five cases out of every 10 the jury gets it right.
These five features constitute, respectively, the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of this jury's performance. The rest of this article considers these five features applied to diagnostic (or screening) tests when compared with a “true” diagnosis or gold standard. A sixth feature—the likelihood ratio—is introduced at the end of the article.
Validating tests against a gold standard
Our window cleaner told me that he had been feeling thirsty recently and had …
Sign in
Personal subscribers, sign in here:
Article access
Article access for 1 day
Purchase this article for £20 $30 €32*
The PDF version can be downloaded as your personal record
CiteULike
Connotea
Del.icio.us
Digg
Facebook
Reddit
Technorati
Twitter
Stumbleupon
Rapid responses
Latest Responses
The decline in the breast cancer incidence is 1.2% and it is not significant.
Published 10 February 2012
'twas ever thus
Published 10 February 2012
The value of historic human remains
Published 10 February 2012
In Praise of British Literature
Published 10 February 2012
Is real shared decision making possible?
Published 10 February 2012
Most responses
Does anyone understand the government’s plan for the NHS? (17 responses)
Published 17 Jan 2012
Bad medicine: medical nutrition (15 responses)
Published 18 Jan 2012
Shared decision making: really putting patients at the centre of healthcare (7 responses)
Published 27 Jan 2012
Why legislation is necessary for my health reforms (7 responses)
Published 1 Feb 2012
Search for evidence goes on (5 responses)
Published 17 Jan 2012