Are these data real? Statistical methods for the detection of data fabrication in clinical trials
BMJ 2005; 331 doi: https://doi.org/10.1136/bmj.331.7511.267 (Published 28 July 2005) Cite this as: BMJ 2005;331:267All rapid responses
Rapid responses are electronic comments to the editor. They enable our users to debate issues raised in articles published on bmj.com. A rapid response is first posted online. If you need the URL (web address) of an individual response, simply click on the response headline and copy the URL from the browser window. A proportion of responses will, after editing, be published online and in the print journal as letters, which are indexed in PubMed. Rapid responses are not indexed in PubMed and they are not journal articles. The BMJ reserves the right to remove responses which are being wilfully misrepresented as published articles or when it is brought to our attention that a response spreads misinformation.
From March 2022, the word limit for rapid responses will be 600 words not including references and author details. We will no longer post responses that exceed this limit.
The word limit for letters selected from posted responses remains 300 words.
What will happen if, from now on, fraudulent authors use computers to
generate their fraudulent data?
Competing interests:
None declared
Competing interests: No competing interests
The paper states that, "In a randomised trial, the data at baseline
should be similar in the randomised groups. ... This is the reason why, in
general, tests for statistical significance are not conducted at baseline
in genuine trials." This is a common, but in my opinion misguided,
attitude. The key word is "should", and a test is always appropriate to
test that assumption.
The question arises whether tests on baselines should be reported or
published. It seems useful to report that the tests were carried out and
either were not significant, confirming the randomisation as successful,
or revealed features in the baseline measures that needed explanation or
adjustment in the analysis. An obvious example would be if outliers
(anomalous observations) were found. Except for explaining retrospective
adjustment to the analysis, it should not be necessary to report baseline
tests in detail. Tables such as Table 1 in the paper are rarely useful to
the reader, in that they do not report "Results" of the study. Baseline
values may appear within the results, since these will presumably show
changes and need qualifying as absolute or relative.
Something implicit in the paper is the pressure to find "significant"
results to justify publication. Since most studies are justified to
funders or ethics committees on the expectation of finding "significant"
results, it is often more significant (in the English sense) when
expectations are not met. It would be a service to science if researchers
were encouraged to report, very briefly, all studies on a basis of (1)
what we expected (2) what we found (3) why [not].
Finally, it is regretable that Al-Marzouki's analysis is reported
with no indication that right of reply has been offered to the impugned
researcher.
Competing interests:
None declared
Competing interests: No competing interests
A simple method for assessing raw medical data for fraudulent or
concocted data is by using Benford's Law.
Benford's Law is a phenomenological law stating that in series of figures
or statistical tables the frequency of the number 1 as the first
significant digit occurs more often than the 10% that might be predicted.
This law is also known as the first digit phenomenon and applies
principally though not exclusively to dimensionless or scale invariant
data in which the numerical value of data depends on the units.
The law largely applies to data obtained in a semi-random manner and is
sensitive to deliberate tampering or biasing of data after which the
profile of the frequency of the first digits of numbers obtained will
differ significantly from that predicted. For this reason its value in
financial audit and other areas is being evaluated.
To date its application in assessing numerical data from medical imaging
has not been reported.
Benford's Law (1) applies to numbers drawn from a wide range of sources
and is not restricted to scale invariate data. Demonstrating this requires
sophisticated and thorough investigations of central limit like theorems
and the mantissa of random variables. With an increase in the number of
variables the density function approaches the logarithmic distribution.
If the law applies to data that are scale invariate or that are not
dimensionless then if a probability distribution exists for these data it
must be invariable over any change of scale.
Thus P ( kx ) = f ( k )* P ( x )
If
( P ( x ) dx = 1 , then ( P ( kx )dx = 1/k and normalisation
implies
f (k ) = 1/k. Differentiating with respect to k and setting k=1 gives
x P' (x ) = -P ( x )
Although this is not a proper probability distribution since it diverges,
the laws of physics and convention impose cut-offs.
If many powers of 10 lie between the cut-offs then the probability that
the first significant digit in base 10 is D is given by the logarithmic
distribution;
ln (D+1/D) / ln 10
= ln (D+1)- ln (D)/ ln 10
= Log (1+1/D)
.
The principle of logarithmically distributed significant digits in various
scientific calculations is well known and widely exploited. An extensive
range of diverse data sets does obey this logarithmic distribution for
significant digits and a considerable amount of empirical evidence
supports the use of Benford's law.
While many data sets do not follow the distribution, combination of data
tables tend to conform more closely to the logarithmic distribution. Hill
(2,3) demonstrated in his theorem that random samples taken from random
selection of distributions will conform to the logarithmic law even though
the initial distributions themselves do not.
This theorem also implies that numerous sampling methods from random
distributions will show a trend towards the same logarithmic distribution.
However as the method is also base invariable, the same distribution
of first digits should be obtained following conversion of the data to
bases other than base 10. Similarly, probability distributions can be
calculated for the frequency of the second or other digits. These tend to
occur with more uniform frequency, e.g. approaching 10% for sixth order
significant digits.
The value of this method is that tampering or external interference with
data acquisition is readily detectable and leads to abnormal skewing or
unexpected increased frequency of first digits other than 1. Biased
sampling techniques or concocted data tend to show a predominance of the
first significant digit 6 with far fewer numbers starting with the digit
1. It may provide a convenient means for medical statisticians to assess
raw data.
Discussions on the desirability of publishing raw data used in medical
research studies have largely centred on the difficulties involved.
Publishing data on a journal website is feasible however, as is the
inclusion of data supplements in material sent to reviewers. In systemic
reviews in particular, data from the primary studies should be available
to readers of the review for analyses to be checked and where required,
investigated. Benford's law may have an application in these circumstances
in assessment of numerical data at a basic level. (5,6,7)
Further potential uses of the significant digit law might be in the
testing of computer generated mathematical models. This could have
applications in analysis of predicted data from models describing tracer
kinetics or compartmental distribution of tracers or contrast medium.
"Goodness of Fit" tests to predicted frequency of digits have been devised
and applied in audit of accounting data. (4)
Application of Benford's law therefore, may be helpful to clinical audit
departments and medical statisticians in assessment of bias in data
acquisition in medical audit or research.
References;
1. Benford F.
The law of anomalous numbers
Proceedings of the American Philosophical Society 1938: 78; 551-572.
2. Hill T.
A statisitical derivation of the significant digit law.
Statistical Science 1996: 10; 354-313.
3. Hill T. P.
The first digit phenomenon.
American Scientist 1998: 86; 358-363
4. Nigrini M,
A Taxpayer compliance application of Benford's law.
Journal of the American Taxation Association 1996: 18; 72-91.
5. Hutchon D.J.R.
Infopoints: Publishing raw data and real time statistical analysis on e-
journals.
British Medical Journal 2001; 322:530.
6. Eysenbach G., Sa E-R.,
Code of conduct is needed for publishing raw data.
British Medical Journal 2001; 323:166.
7. Information for authors. Clinical Chemistry; www.aacc.org/ccj/infoauth/stm
Competing interests:
None declared
Competing interests: No competing interests
Trial Data
We read with great interest your articles regarding possible data
fabrication. The paper by Al-Marzouki, Evans, Marshall and Roberts (1)
illustrates the difficulties faced trying to “prove” data is falsified. A
small point regarding the paper is the random selection of patients from
five centres as the reference trial, compared to patients from a single
centre. It may be argued that the variance of baseline data from the
single centre trial would be expected to be smaller than that from a multi
-centre trial.
Of more overall concern is the number of large multi-centre trials
now performed where even the primary authors do not see the original data.
Data is supplied centrally already in electronic form, possibly via
intermediaries with variable financial interest. Should authors state a
(minimum) percentage of raw data and patient files they specifically
assessed in their quality assurance of the data?
1. Al-Marzouki S, Evans S, Marshall T, Roberts I. Are these data
real? Statistical methods for the detection of data fabrication in
clinical trials BMJ 2005;331:267-270.
Competing interests:
None declared
Competing interests: No competing interests