Discrepancies in sample size calculations and data analyses reported in randomised trials: comparison of publications with protocols

An-Wen Chan; Asbjørn Hróbjartsson; Karsten J Jørgensen; Peter C Gøtzsche; Douglas G Altman

doi:10.1136/bmj.a2299

Research

Discrepancies in sample size calculations and data analyses reported in randomised trials: comparison of publications with protocols

BMJ 2008; 337 doi: https://doi.org/10.1136/bmj.a2299 (Published 04 December 2008) Cite this as: BMJ 2008;337:a2299

An-Wen Chan, clinical fellow1,
Asbjørn Hróbjartsson, senior researcher2,
Karsten J Jørgensen, researcher2,
Peter C Gøtzsche, director2,
Douglas G Altman, director3

¹Mayo Clinic, Rochester, USA
²Nordic Cochrane Centre, Copenhagen, Denmark
³Centre for Statistics in Medicine, University of Oxford, Oxford

Correspondence to: A-W Chan chan.anwen{at}mayo.edu

Accepted 10 September 2008

Abstract

Objective To evaluate how often sample size calculations and methods of statistical analysis are pre-specified or changed in randomised trials.

Design Retrospective cohort study.

Data source Protocols and journal publications of published randomised parallel group trials initially approved in 1994-5 by the scientific-ethics committees for Copenhagen and Frederiksberg, Denmark (n=70).

Main outcome measure Proportion of protocols and publications that did not provide key information about sample size calculations and statistical methods; proportion of trials with discrepancies between information presented in the protocol and the publication.

Results Only 11/62 trials described existing sample size calculations fully and consistently in both the protocol and the publication. The method of handling protocol deviations was described in 37 protocols and 43 publications. The method of handling missing data was described in 16 protocols and 49 publications. 39/49 protocols and 42/43 publications reported the statistical test used to analyse primary outcome measures. Unacknowledged discrepancies between protocols and publications were found for sample size calculations (18/34 trials), methods of handling protocol deviations (19/43) and missing data (39/49), primary outcome analyses (25/42), subgroup analyses (25/25), and adjusted analyses (23/28). Interim analyses were described in 13 protocols but mentioned in only five corresponding publications.

Conclusion When reported in publications, sample size calculations and statistical methods were often explicitly discrepant with the protocol or not pre-specified. Such amendments were rarely acknowledged in the trial publication. The reliability of trial reports cannot be assessed without having access to the full protocols.

Introduction

Sample size calculations and data analyses have an important impact on the planning, interpretation, and conclusions of randomised trials. Statistical analyses often involve several subjective decisions about which data to include and which tests to use, producing potentially different results and conclusions depending on the decisions taken.1 2 3 4 5 6 7 Methods of analysis that are chosen or altered after preliminary examination of the data can introduce bias if a subset of favourable results is then reported in the publication.

The study protocol plays a key role in reducing such bias by documenting a pre-specified blueprint for conducting and analysing a trial. Explicit descriptions of methods before a trial starts help to identify and deter unacknowledged, potentially biased changes made after reviewing the study results.8 9 10 To evaluate the completeness and consistency of reporting, we reviewed a comprehensive cohort of randomised trials and compared the sample size calculations and data analysis methods described in the protocols with those reported in the publications.

Methods

We included all published parallel group randomised trials approved by the scientific-ethics committees for Copenhagen and Frederiksberg, Denmark, from 1 January 1994 to 31 December 1995. We defined a randomised trial as a research study that randomly allocated human participants to healthcare interventions. We manually reviewed the committees’ files in duplicate to identify eligible studies as part of a separate study of outcome reporting.9 We confirmed journal articles for each trial by surveying investigators and searching PubMed, Embase, and the Cochrane Controlled Trials Register.9

For each trial, we reviewed the protocol, statistical appendices, amendments, and publications that reported the primary outcome measures. Two reviewers used electronic forms to independently extract data relating to the design, sample size calculation, and statistical analyses in duplicate; we resolved disagreements by discussion. No reviewer extracted data from both the protocol and publication for the same trial.

The pre-specified primary outcomes of our study were the proportion of trial protocols and publications that did not provide key information (described below) about sample size calculations and statistical methods and the proportion of trials with discrepancies between the information presented in the protocol and the publication. We considered a 10% or greater difference between calculated sample sizes in the protocol and publication to be a discrepancy, as well as any qualitative or quantitative difference in other information we examined.

From publications, we noted the achieved sample size and any statements about early stopping or whether analyses were pre-specified or amended. From both protocols and publications, we categorised the design framework of the trial as superiority, non-inferiority, or equivalence (box 1). We also recorded the fundamental components of sample size calculations (table 1⇓), the basis on which the minimum clinically important effect size (delta) and estimated event rates were derived (for example, previous literature, pilot data), and any reference to re-estimation of sample size on the basis of an interim analysis. When multiple sample size calculations were reported, we focused on the one for the primary outcome measure.

Box 1 Definitions of collected data

Design framework

Superiority trial—Explicitly described as a study designed to show a difference in effects between interventions or not explicitly described as an equivalence or non-inferiority trial

Non-inferiority trial—Explicitly described as a study designed to show that one intervention is not worse than another, or a non-inferiority margin is specified, or a one sided confidence interval is presented

Equivalence trial—Explicitly described as a study designed to show that one intervention is neither inferior nor superior to another or an equivalence margin is specified

Handling of protocol deviations

Intention to treat analysis—All participants with available data are analysed in the original group to which they were randomly assigned (as randomised), regardless of adherence to the protocol. No data are excluded for reasons other than loss to follow-up

Per protocol analysis—Participants with available data are analysed as randomised provided they meet some defined level of adherence to the protocol

As treated analysis—Participants are analysed in the group corresponding to the actual intervention received (ignoring original randomisation)

Primary outcome

Main outcome(s) of interest, in the following hierarchical order:

1. Explicitly defined as primary or main
2. Outcome used in the power calculation
3. Main outcome stated in the trial objectives

Table 1

Reporting of sample size calculations in trial protocols and publications

View this table:

We also documented how deviations from the treatment protocol were handled in terms of both the stated type of analysis (such as intention to treat) and any additional text description (box 1). For the primary outcome measure of the trial (box 1), we recorded the method of handling missing data and the type of statistical test used. We recorded the factors and outcome measures used in any subgroup analyses; the covariates and outcome measures used in any adjusted analyses; and the number of statistical comparisons described or reported between randomised groups, excluding baseline comparisons. Finally, we recorded the use of interim analyses and data monitoring boards.

Results

We identified 70 parallel group randomised trials that received ethics approval in 1994-5 and were subsequently published (table 2⇓).9 The median publication year was 1999 (range 1995-2003). Fifty two of the trials evaluated drugs, and 56 were funded in full or in part by industry. Most trials involved two study arms (n=47), multiple centres (46), and some form of blinding (49). The median achieved sample size per study arm was 66 (10th-90th centile range 13-324). The most common specialty fields were endocrinology (n=11), anaesthesiology (5), cardiology (5), infectious diseases (5), and oncology (5).

Table 2

Characteristics of published parallel group randomised trials

View this table:

Sixty nine trials were designed and reported as superiority trials. One trial was stated to be an equivalence trial in the protocol but reported as a superiority trial in the publication; no explanation was given for the change.

Sample size calculation

Overall, only 11 trials fully and consistently reported all of the requisite components of the sample size calculation in both the protocol and the publication.

Completeness of reporting—An a priori sample size calculation was reported for 62 trials; 28 were described only in the protocol and 34 in both the protocol and the publication. Thirty seven protocols and 21 publications reported all of the components of the sample size calculation (figure⇓). Individual components were reported in 74-100% of protocols and 48-75% of publications (table 1⇑). Nine protocols provided only the calculated sample size without any further details about the calculation. Among trials that reported an estimated minimum clinically important effect size (delta), 20/53 protocols and 10/33 publications stated the basis on which the figure was derived.

Reporting of sample size calculations and data analyses in publications compared with protocols

Comparison of calculated and actual sample sizes—Sixty two trials provided a calculated sample size in the protocol. Of these, 30 subsequently recruited a sample size within 10% of the calculated figure from the protocol; 22 trials randomised at least 10% fewer participants than planned as a result of early stopping (n=3), poor recruitment (2), and unspecified reasons (17); and 10 trials randomised at least 10% more participants than planned as a result of lower than anticipated average age (1), a higher than expected recruitment rate (1), and unspecified reasons (8). A calculated sample size was as likely to be reported accurately in the publication if there was a discrepancy with the actual sample size compared with no discrepancy (11/32 v 14/30).

Discrepancies between publications and protocols—Both the publications and the protocols for 34 trials described a sample size calculation. Overall, we noted discrepancies in at least one component of the published sample size calculation when compared with the protocol for 18 trials (figure⇑). Publications for eight trials reported components that had not been pre-specified in the protocol, and 16 had explicit discrepancies between information contained in the publication and protocol (table 3⇓, box 2). None of the publications mentioned any amendments to the original sample size calculation.

Table 3

Discrepancies in sample size calculations reported in trial publications compared with protocols

View this table:

Box 2 Anonymised examples of unacknowledged discrepancies in sample size calculations and statistical analyses reported in publications compared with protocols

Sample size calculation

Changed delta (1)

Outcome: disease progression or death rate
Protocol: delta 10%; event rates unspecified
Publication: delta 6%; event rates 16% and 10%

Changed delta (2)

Outcome: mean number of active joints
Protocol: delta 2.5 joints
Publication: delta 5 joints

Changed standard deviation

Outcome: mean symptom score
Protocol: 1.4
Publication: 0.49

Changed power

Outcome: survival without disease progression
Protocol: 90%
Publication: 80%

Changed sample size estimate

Outcome: thromboembolic complication rate
Protocol: 2200
Publication: 1500

Statistical analyses

Changed primary outcome analysis

Outcome: global disease assessment
Protocol: χ² test
Publication: analysis of covariance

New subgroups added to publication

Outcome: time to progression or death
Protocol: baseline disease severity
Publication: duration of previous treatment*, type of previous treatment*, blood count*, disease severity

Omitted covariates for adjusted analysis in publication

Outcome: neurological score at six months
Protocol: baseline neurological score, pupil reaction, age, CT scan classification, shock, haemorrhage
Publication: no adjusted analysis reported

*Described explicitly as pre-specified despite not appearing in the protocol

Protocol deviations

The specific method of handling protocol deviations in the primary statistical analysis (as defined in box 1) was named or described in 37 protocols and 43 publications (figure⇑). Overall, the primary method described for handling protocol deviations in the publication differed from that described in the protocol for 19/43 trials; table 4⇓ provides details. None of these discrepancies was acknowledged in the journal publication.

Table 4

Discrepancies in primary method of handling protocol deviations, as reported in publications compared with protocols

View this table:

Thirty protocols and 33 publications used the term “intention to treat” analysis and applied a variety of definitions (table 5⇓). Few of these protocols (n=7) and publications (3) made it explicit whether study participants were analysed in the group to which they were originally randomised. Most protocols (22) and publications (18) incorrectly excluded participants from the intention to treat analysis for reasons other than loss to follow-up (table 5⇓).

Table 5

Definitions of “intention to treat” analysis used in protocols and publications

View this table:

Missing data

The method of handling missing data was described in only 16 protocols and 49 publications (figure⇑). Methods reported in publications differed from the protocol for 39/49 trials. Published methods were often not pre-specified in the protocol (38/49). For one trial, the protocol stipulated that missing data would be counted as failures, whereas in the publication they were excluded from the analysis.

Primary outcome analysis and overall number of tests

Fifty four trials designated at least one outcome measure as primary in the protocol (n=49) or publication (43). The statistical method for analysing the primary outcome measure was described in 39 protocols and 42 publications. Overall, 25 publications that described the statistical test for primary outcome measures differed from the protocol (figure⇑, box 2).

The median number of between group statistical tests defined in 44 protocols was 30 (10th-90th centile range 8-218); the other 26 protocols contained insufficient statistical detail. Publications for all 70 trials reported a median of 22 (8-71) tests. Half of the protocols (n=36) and publications (34) did not define whether hypothesis testing was one or two sided. Interestingly, we found one neurology trial that used two sided P values in one publication (all P values >0.1) and a one sided P value in another (P=0.028).

Subgroup analysis

Overall, 25 trials described subgroup analyses in the protocol (n=13) or publication (20). All had discrepancies between the two documents (figure⇑, box 2). Twelve of the trials with protocol specified analyses reported only some (n=7) or none (5) in the publication. Nineteen of the trials with published subgroup analyses reported at least one that was not pre-specified in the protocol. Protocols for 12 of these trials specified no subgroup analyses, whereas seven specified some but not all of the published analyses. Only seven publications explicitly stated whether the analyses were defined a priori; four of these trials claimed that the subgroup analyses were pre-specified even though they did not appear in the protocol.

Adjusted analysis

Overall, 28 trials described adjusted analyses in the protocol (n=18) or publication (18). Of these, 23 had discrepancies between the two documents (figure⇑, box 2). Twelve of the trials with protocol specified covariates reported no adjustment (n=10) or omitted at least one pre-specified covariate (2) from the published analysis. Twelve of the trials with published adjusted analyses used covariates that were not pre-specified in the protocol. Ten of these trials did not mention any adjusted analysis in the protocol, whereas two trials added new covariates to those specified in the protocol. Publications for only one trial explicitly stated whether the covariates were defined a priori.

Interim analyses and data monitoring boards

Interim analyses were described in 13 protocols, but reported in only five corresponding publications. An additional two trials reported interim analyses in the publications, despite the protocol explicitly stating that there would be none. A data monitoring board was described in 12 protocols but in only five of the corresponding publications.

Discussion

We identified a high frequency of unacknowledged discrepancies and poor reporting of sample size calculations and data analysis methods in an unselected cohort of randomised trials. To our knowledge, this is the largest review of sample size calculations and statistical methods described in trial publications compared with protocols. We reviewed key methodological information that can introduce bias if misrepresented or altered retrospectively. Our broad sample of protocols is a key strength, as unrestricted access to such documents is often very difficult to obtain.11 Previous comparisons have been limited to case reports,6 small samples,12 13 specific specialty fields,14 and specific journals.15 Other reviews of reports submitted to drug licensing agencies did not have access to protocols.4 16 17

One limitation is that our cohort may not reflect recent protocols and publications, as this type of review can be done only several years after protocol submission to allow time for publication. Whether the widespread adoption of CONSORT and other reporting guidelines for publications has improved the quality of protocols or reduced the prevalence of unacknowledged amendments in recent years is unclear.18 However, our results are consistent with more recent examples of discrepancies.3 12 13 Furthermore, we previously found that the prevalence of publication restrictions stated in industry initiated trial protocols did not change between 1995 and 2004.19

We also acknowledge that detailed statistical analysis plans may not always be included in the application for scientific or ethical review as a result of varying standards, even though this information has a role in evaluating the validity of a study. However, this does not explain the frequent discrepancies we found between explicit descriptions in protocols and publications (box 2).

We found that sample size calculations and important aspects of statistical analysis methods were often incompletely described in protocols and publications. When reported in the publication, they were discrepant with the protocol for 44-100% of trials. The 70 trials in this study were part of a larger review that found unacknowledged changes to primary outcome measures in more than half of 102 trials,9 so we are not surprised to find frequent discrepancies in other aspects of study conduct.

Because the choice of parameters for sample size calculations and statistical analyses is based on somewhat subjective judgments, specifying them a priori in order to avoid selective reporting or revisions on the basis of the data is important. This includes defining each component of the sample size calculation20; the analysis plan for primary outcome measures8; the primary population for analysis21; and subgroup,22 23 adjusted, and interim analyses.24 Selective reporting of a favourable subset of analyses done can lead to inaccurate interpretation of results, similar to selective publication or selective outcome reporting.25 26

Even when analysis plans are reported, concerns may exist that published plans were constructed after exploration of the data. Accurate reporting of sample size calculations and data analysis methods is important not only for the sake of transparency but also because the choice of methods and the reasons for choosing them can directly influence the interpretation and conclusions of study results.1 2 3 4 5 6 27 Public access to full protocols is thus needed to reliably appraise trial publications. Several journals have recognised this principle and require submission of protocols with manuscripts.28 29 30

We identified two types of discrepancies: lack of pre-specification, where published information about the sample size calculation or data analysis methods did not appear in the protocol; and unacknowledged amendments, where published information differed from the protocol. Whereas the first type of discrepancy could be explained by varying standards in protocol content, the second represents an explicit change from the time of ethical approval to analysis and publication of the trial. Retrospective amendments to methods of statistical analysis can sometimes be justifiable before unblinding of the trial, but no good reason exists to misrepresent the pre-trial sample size calculation or misleadingly describe the methods as pre-specified when the protocol suggests otherwise. Substantive amendments should be submitted to ethics committees and explicitly acknowledged in the publication so that readers can judge the potential for bias.8

Sample size calculations

Although good reasons exist for reporting sample size calculations in publications,20 31 previous reviews of publications have found poor reporting of sample size calculations and their underlying assumptions.14 32 33 34 35 36 These reviews involved specific specialty fields or journals, and only one had access to trial protocols.14

Inadequate reporting and unacknowledged changes in sample size calculations can introduce bias and lead to misinterpretation of trial results. For example, favourable results could be highlighted by changing the primary outcome measure stated to have been used to calculate sample size,9 or by modifying the pre-specified minimum clinically important effect size (delta), which helps to determine whether one study intervention can be declared superior or equivalent to another.6 31 In addition, publications of studies with poor recruitment might report a modified sample size calculation to give the appearance that a smaller sample size had originally been planned. Although methods for valid changes to sample size parameters mid-trial are emerging,37 no such method was mentioned in any study in our sample.

Data analyses

Although results for primary outcome measures are key determinants of an intervention’s benefits or harms profile, we found that the analysis method for primary outcome measures was often undefined in the protocol or altered in the publication. Such unacknowledged changes compound previously identified discrepancies in primary outcome measures.9 10

The recommended primary analysis population for superiority trials is defined by the intention to treat principle, whereby data from all trial participants are analysed according to their randomly allocated study arm, irrespective of the degree of compliance or crossover to other interventions.38 Missing data from losses to follow-up can be handled with various statistical methods that can each produce different results.39 Previous reviews of high impact journals have found that a high proportion of publications provide insufficient details about methods of handling protocol deviations or missing data.40 41 In addition, a comparison of publications with protocols of cancer trials revealed that intention to treat analysis was done more often than was reported in the publications.14

We found omissions from the trial protocol as well as retrospective choices of analysis populations in almost half of the publications in our cohort. This creates the potential for preferential reporting of per protocol analyses over less favourable intention to treat analyses.4 5 The definition of the term intention to treat was also highly variable and often inaccurately applied in our cohort, as has been found in other reviews.40 41 42 To avoid ambiguity, the study arm in which participants are analysed and the criteria for excluding participants from analyses should be explicitly defined in the protocol and publication.8

Likewise, decisions to examine particular subgroups or to include particular covariates in adjusted analyses may be influenced by extensive exploratory analyses of the data.22 Subgroup analyses in publications are often over-interpreted, poorly reported, and lacking in clarity about whether they were pre-specified.13 27 43 44 45 Although exploratory analyses should be identified as such, we found that assertions about analyses being pre-specified were often misleading when compared with protocols. In addition, most interim analyses described in protocols were not mentioned in the publication, which deprives readers of important information about the decisions to stop or continue a trial as planned.24

Conclusions

Our findings support the need to improve the content of trial protocols and encourage transparent reporting of amendments in publications through research training. In collaboration with journal editors, trialists, methodologists, and ethicists, we have launched the SPIRIT (standard protocol items for randomised trials) initiative to establish evidence based recommendations for the key content of trial protocols.46 Public availability of trial protocols and submissions to regulatory agencies is also necessary to ensure transparent reporting of study methods.16 17 47 48 Prospective trial registration is an effective means of ensuring public access to protocol information, although a limited amount of methodological information is currently recorded on registries.49 50 51

To improve the reliability of published results, investigators should document the sample size calculations and full analysis plans before the trial is started and should then analyse the results with fidelity to the study protocol or describe major amendments in the publication.8 As the guardians of clinical research before study inception, scientific and ethical review committees can help to ensure that statistical analysis plans are well documented in protocols. Only with fully transparent reporting of trial methods and public access to protocols can the results be properly appraised, interpreted, and applied to care of patients.

What is already known on this topic

The results and conclusions of randomised trials are influenced by the choice of statistical analysis methods and individual components of sample size calculations
If these methodological choices are defined or altered after examination of the data, the potential for biased reporting of favourable results is substantial

What this study adds

Trial protocols and publications are often missing important methodological information about sample size calculations and statistical analysis methods
When described, methodological information in journal publications is often discrepant with information in trial protocols

Notes

Cite this as: BMJ 2008;337:a2299

Footnotes

Contributors: All authors contributed to the study design, data collection, data interpretation, and drafting of the manuscript. A-WC also did the statistical analyses and is the guarantor.
Funding sources: None.
Competing interests: A-WC chairs and AH, DGA, and PCG are members of the steering committee for the SPIRIT (standard protocol items for randomised trials) initiative, a project that aims to define key protocol content for randomised trials.
Ethical approval: The scientific-ethics committees for Copenhagen and Frederiksberg granted unrestricted access to trial protocols. The research did not undergo the usual ethical review process, as patients’ data were not reviewed.
Provenance and peer review: Not commissioned; externally peer reviewed.

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

References

↵
Psaty BM, Kronmal RA. Reporting mortality findings in trials of rofecoxib for Alzheimer disease or cognitive impairment: a case study based on documents from rofecoxib litigation. JAMA2008;299:1813-7.
OpenUrl CrossRef PubMed Web of Science
↵
Porta N, Bonet C, Cobo E. Discordance between reported intention-to-treat and per protocol analyses. J Clin Epidemiol2007;60:663-9.
OpenUrl CrossRef PubMed Web of Science
↵
Curfman GD, Morrissey S, Drazen JM. Expression of concern reaffirmed. N Engl J Med2006;354:1193.
OpenUrl CrossRef PubMed Web of Science
↵
Melander H, Ahlqvist-Rastad J, Meijer G, Beermann B. Evidence b(i)ased medicine—selective reporting from studies sponsored by pharmaceutical industry: review of studies in new drug applications. BMJ2003;326:1171-3.
OpenUrl Abstract/FREE Full Text
↵
Schiffner R, Schiffner-Rohe J, Gerstenhauer M, Hofstadter F, Landthaler M, Stolz W. Differences in efficacy between intention-to-treat and per-protocol analyses for patients with psoriasis vulgaris and atopic dermatitis: clinical and pharmacoeconomic implications. Br J Dermatol2001;144:1154-60.
OpenUrl CrossRef PubMed Web of Science
↵
Murray GD. Research governance must focus on research training. BMJ2001;322:1461-2.
OpenUrl Web of Science
↵
Hrachovec JB, Mora M. Reporting of 6-month vs 12-month data in a clinical trial of celecoxib. JAMA2001;286:2398-400.
OpenUrl CrossRef PubMed Web of Science
↵
Altman DG, Schulz KF, Moher D, Egger M, Davidoff F, Elbourne D, et al. The revised CONSORT statement for reporting randomized trials: explanation and elaboration. Ann Intern Med2001;134:663-94.
OpenUrl CrossRef PubMed Web of Science
↵
Chan AW, Hróbjartsson A, Haahr MT, Gøtzsche PC, Altman DG. Empirical evidence for selective reporting of outcomes in randomized trials: comparison of protocols to published articles. JAMA2004;291:2457-65.
OpenUrl CrossRef PubMed Web of Science
↵
Chan AW, Krleža-Jeric K, Schmid I, Altman DG. Outcome reporting bias in randomized trials funded by the Canadian Institutes of Health Research. CMAJ2004;171:735-40.
OpenUrl Abstract/FREE Full Text
↵
Chan AW, Upshur R, Singh JA, Ghersi D, Chapuis F, Altman DG. Research protocols: waiving confidentiality for the greater good. BMJ2006;332:1086-9.
OpenUrl FREE Full Text
↵
Hahn S, Williamson PR, Hutton JL. Investigation of within-study selective reporting in clinical research: follow-up of applications submitted to a local research ethics committee. J Eval Clin Pract2002;8:353-9.
OpenUrl CrossRef PubMed Web of Science
↵
Hernandez AV, Steyerberg EW, Taylor GS, Marmarou A, Habbema JD, Maas AI. Subgroup analysis and covariate adjustment in randomized clinical trials of traumatic brain injury: a systematic review. Neurosurgery2005;57:1244-53.
OpenUrl PubMed Web of Science
↵
Soares HP, Daniels S, Kumar A, Clarke M, Scott C, Swann S, et al. Bad reporting does not mean bad methods for randomised trials: observational study of randomised controlled trials performed by the Radiation Therapy Oncology Group. BMJ2004;328:22-4.
OpenUrl Abstract/FREE Full Text
↵
Al-Marzouki S, Roberts I, Evans S, Marshall T. Selective reporting in clinical trials: analysis of trial protocols accepted by the Lancet. Lancet2008;372:201.
OpenUrl PubMed Web of Science
↵
Hemminki E. Study of information submitted by drug companies to licensing authorities. BMJ1980;280:833-6.
OpenUrl Abstract/FREE Full Text
↵
Turner EH, Matthews AM, Linardatos E, Tell RA, Rosenthal R. Selective publication of antidepressant trials and its influence on apparent efficacy. N Engl J Med2008;358:252-60.
OpenUrl CrossRef PubMed Web of Science
↵
Moher D, Schulz KF, Altman DG. The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomised trials. Lancet2001;357:1191-4.
OpenUrl CrossRef PubMed Web of Science
↵
Gøtzsche PC, Hróbjartsson A, Johansen HK, Haahr MT, Altman DG, Chan AW. Constraints on publication rights in industry-initiated clinical trials. JAMA2006;295:1645-6.
OpenUrl CrossRef PubMed Web of Science
↵
Schulz KF, Grimes DA. Sample size calculations in randomised trials: mandatory and mystical. Lancet2005;365:1348-53.
OpenUrl CrossRef PubMed Web of Science
↵
Fergusson D, Aaron SD, Guyatt G, Hebert P. Post-randomisation exclusions: the intention to treat principle and excluding patients from analysis. BMJ2002;325:652-4.
OpenUrl FREE Full Text
↵
Rothwell PM. Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet2005;365:176-86.
OpenUrl CrossRef PubMed Web of Science
↵
Brookes ST, Whitley E, Peters TJ, Mulheran PA, Egger M, Davey SG. Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives. Health Technol Assess2001;5:1-56.
OpenUrl PubMed
↵
Montori VM, Devereaux PJ, Adhikari NK, Burns KE, Eggert CH, Briel M, et al. Randomized trials stopped early for benefit: a systematic review. JAMA2005;294:2203-9.
OpenUrl CrossRef PubMed Web of Science
↵
Song F, Eastwood AJ, Gilbody S, Duley L, Sutton AJ. Publication and related biases. Health Technol Assess2000;4(10):1-115.
OpenUrl PubMed
↵
Dwan K, Altman DG, Arnaiz JA, Bloom J, Chan AW, Cronin E, et al. Systematic review of the empirical evidence of study publication bias and outcome reporting bias. PLoS ONE2008;3:e3081.
OpenUrl CrossRef
↵
Lagakos SW. The challenge of subgroup analyses—reporting without distorting. N Engl J Med2006;354:1667-9.
OpenUrl CrossRef PubMed Web of Science
↵
Jones G, Abbasi K. Trial protocols at the BMJ. BMJ2004;329:1360.
OpenUrl FREE Full Text
↵
McNamee D, James A, Kleinert S. Protocol review at the Lancet. Lancet2008;372:189-90.
OpenUrl CrossRef PubMed Web of Science
↵
PLoS Med. Guidelines for authors. http://journals.plos.org/plosmedicine/guidelines.php#supporting.
↵
Altman DG, Moher D, Schulz KF. Peer review of statistics in medical research: reporting power calculations is important. BMJ2002;325:491.
OpenUrl FREE Full Text
↵
Weaver CS, Leonardi-Bee J, Bath-Hextall FJ, Bath PMW. Sample size calculations in acute stroke trials: a systematic review of their reporting, characteristics, and relationship with outcome. Stroke2004;35:1216-24.
OpenUrl Abstract/FREE Full Text
↵
Vickers AJ. Underpowering in randomized trials reporting a sample size calculation. J Clin Epidemiol2003;56:717-20.
OpenUrl CrossRef PubMed Web of Science
↵
Hebert R, Wright S, Dittus R, Elasy T. Prominent medical journals often provide insufficient information to assess the validity of studies with negative results. J Negat Results Biomed2002;1:1.
OpenUrl PubMed
↵
Freedman KB, Back S, Bernstein J. Sample size and statistical power of randomised, controlled trials in orthopaedics. J Bone Joint Surg Br2001;83:397-402.
OpenUrl CrossRef PubMed
↵
Moher D, Dulberg CS, Wells GA. Statistical power, sample size, and their reporting in randomized controlled trials. JAMA1994;272:122-4.
OpenUrl CrossRef PubMed Web of Science
↵
Schafer H, Timmesfeld N, Muller HH. An overview of statistical approaches for adaptive designs and design modifications. Biom J2006;48:507-20.
OpenUrl CrossRef PubMed Web of Science
↵
Lachin JM. Statistical considerations in the intent-to-treat principle. Control Clin Trials2000;21:167-89.
OpenUrl CrossRef PubMed Web of Science
↵
Molenberghs G, Thijs H, Jansen I, Beunckens C, Kenward MG, Mallinckrodt C, et al. Analyzing incomplete longitudinal clinical trial data. Biostatistics2004;5:445-64.
OpenUrl Abstract
↵
Hollis S, Campbell F. What is meant by intention to treat analysis? Survey of published randomised controlled trials. BMJ1999;319:670-4.
OpenUrl Abstract/FREE Full Text
↵
Baron G, Boutron I, Giraudeau B, Ravaud P. Violation of the intent-to-treat principle and rate of missing data in superiority trials assessing structural outcomes in rheumatic diseases. Arthritis Rheum2005;52:1858-65.
OpenUrl CrossRef PubMed Web of Science
↵
Gravel J, Opatrny L, Shapiro S. The intention-to-treat approach in randomized controlled trials: are authors saying what they do and doing what they say? Clin Trials2007;4:350-6.
OpenUrl Abstract/FREE Full Text
↵
Moreira ED Jr, Stein Z, Susser E. Reporting on methods of subgroup analysis in clinical trials: a survey of four scientific journals. Braz J Med Biol Res2001;34:1441-6.
OpenUrl PubMed Web of Science
↵
Parker AB, Naylor CD. Subgroups, treatment effects, and baseline risks: some lessons from major cardiovascular trials. Am Heart J2000;139:952-61.
OpenUrl CrossRef PubMed Web of Science
↵
Assmann SF, Pocock SJ, Enos LE, Kasten LE. Subgroup analysis and other (mis)uses of baseline data in clinical trials. Lancet2000;355:1064-9.
OpenUrl CrossRef PubMed Web of Science
↵
Chan AW, Tetzlaff J, Altman DG, Gøtzsche PC, Hróbjartsson A, Krleža-Jeric K, et al. The SPIRIT initiative: defining standard protocol items for randomised trials. German J Evid Quality Health Care (suppl)2008;102:S27.
OpenUrl
↵
MacLean CH, Morton SC, Ofman JJ, Roth EA, Shekelle PG. How useful are unpublished data from the Food and Drug Administration in meta-analysis? J Clin Epidemiol2003;56:44-51.
OpenUrl CrossRef PubMed Web of Science
↵
Turner EH. A taxpayer-funded clinical trials registry and results database. PLoS Med2004;1:e60.
OpenUrl CrossRef PubMed
↵
Laine C, Horton R, DeAngelis CD, Drazen JM, Frizelle FA, Godlee F, et al. Clinical trial registration—looking back and moving ahead. N Engl J Med2007;356:2734-6.
OpenUrl CrossRef PubMed Web of Science
↵
Sim I, Chan AW, Gulmezoglu AM, Evans T, Pang T. Clinical trial registration: transparency is the watchword. Lancet2006;367:1631-3.
OpenUrl CrossRef PubMed Web of Science
↵
Krleža-Jeric K, Chan AW, Dickersin K, Sim I, Grimshaw J, Gluud C. Principles for international registration of protocol information and results from human trials of health related interventions: Ottawa statement (part 1). BMJ2005;330:956-8.
OpenUrl FREE Full Text

View Abstract

[1] ↵
Psaty BM, Kronmal RA. Reporting mortality findings in trials of rofecoxib for Alzheimer disease or cognitive impairment: a case study based on documents from rofecoxib litigation. JAMA2008;299:1813-7.
OpenUrl CrossRef PubMed Web of Science

[2] ↵
Porta N, Bonet C, Cobo E. Discordance between reported intention-to-treat and per protocol analyses. J Clin Epidemiol2007;60:663-9.
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Curfman GD, Morrissey S, Drazen JM. Expression of concern reaffirmed. N Engl J Med2006;354:1193.
OpenUrl CrossRef PubMed Web of Science

[4] ↵
Melander H, Ahlqvist-Rastad J, Meijer G, Beermann B. Evidence b(i)ased medicine—selective reporting from studies sponsored by pharmaceutical industry: review of studies in new drug applications. BMJ2003;326:1171-3.
OpenUrl Abstract/FREE Full Text

[5] ↵
Schiffner R, Schiffner-Rohe J, Gerstenhauer M, Hofstadter F, Landthaler M, Stolz W. Differences in efficacy between intention-to-treat and per-protocol analyses for patients with psoriasis vulgaris and atopic dermatitis: clinical and pharmacoeconomic implications. Br J Dermatol2001;144:1154-60.
OpenUrl CrossRef PubMed Web of Science

[6] ↵
Murray GD. Research governance must focus on research training. BMJ2001;322:1461-2.
OpenUrl Web of Science

[7] ↵
Hrachovec JB, Mora M. Reporting of 6-month vs 12-month data in a clinical trial of celecoxib. JAMA2001;286:2398-400.
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Altman DG, Schulz KF, Moher D, Egger M, Davidoff F, Elbourne D, et al. The revised CONSORT statement for reporting randomized trials: explanation and elaboration. Ann Intern Med2001;134:663-94.
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Chan AW, Hróbjartsson A, Haahr MT, Gøtzsche PC, Altman DG. Empirical evidence for selective reporting of outcomes in randomized trials: comparison of protocols to published articles. JAMA2004;291:2457-65.
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Chan AW, Krleža-Jeric K, Schmid I, Altman DG. Outcome reporting bias in randomized trials funded by the Canadian Institutes of Health Research. CMAJ2004;171:735-40.
OpenUrl Abstract/FREE Full Text

[11] ↵
Chan AW, Upshur R, Singh JA, Ghersi D, Chapuis F, Altman DG. Research protocols: waiving confidentiality for the greater good. BMJ2006;332:1086-9.
OpenUrl FREE Full Text

[12] ↵
Hahn S, Williamson PR, Hutton JL. Investigation of within-study selective reporting in clinical research: follow-up of applications submitted to a local research ethics committee. J Eval Clin Pract2002;8:353-9.
OpenUrl CrossRef PubMed Web of Science

[13] ↵
Hernandez AV, Steyerberg EW, Taylor GS, Marmarou A, Habbema JD, Maas AI. Subgroup analysis and covariate adjustment in randomized clinical trials of traumatic brain injury: a systematic review. Neurosurgery2005;57:1244-53.
OpenUrl PubMed Web of Science

[14] ↵
Soares HP, Daniels S, Kumar A, Clarke M, Scott C, Swann S, et al. Bad reporting does not mean bad methods for randomised trials: observational study of randomised controlled trials performed by the Radiation Therapy Oncology Group. BMJ2004;328:22-4.
OpenUrl Abstract/FREE Full Text

[15] ↵
Al-Marzouki S, Roberts I, Evans S, Marshall T. Selective reporting in clinical trials: analysis of trial protocols accepted by the Lancet. Lancet2008;372:201.
OpenUrl PubMed Web of Science

[16] ↵
Hemminki E. Study of information submitted by drug companies to licensing authorities. BMJ1980;280:833-6.
OpenUrl Abstract/FREE Full Text

[17] ↵
Turner EH, Matthews AM, Linardatos E, Tell RA, Rosenthal R. Selective publication of antidepressant trials and its influence on apparent efficacy. N Engl J Med2008;358:252-60.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Moher D, Schulz KF, Altman DG. The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomised trials. Lancet2001;357:1191-4.
OpenUrl CrossRef PubMed Web of Science

[19] ↵
Gøtzsche PC, Hróbjartsson A, Johansen HK, Haahr MT, Altman DG, Chan AW. Constraints on publication rights in industry-initiated clinical trials. JAMA2006;295:1645-6.
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Schulz KF, Grimes DA. Sample size calculations in randomised trials: mandatory and mystical. Lancet2005;365:1348-53.
OpenUrl CrossRef PubMed Web of Science

[21] ↵
Fergusson D, Aaron SD, Guyatt G, Hebert P. Post-randomisation exclusions: the intention to treat principle and excluding patients from analysis. BMJ2002;325:652-4.
OpenUrl FREE Full Text

[22] ↵
Rothwell PM. Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet2005;365:176-86.
OpenUrl CrossRef PubMed Web of Science

[23] ↵
Brookes ST, Whitley E, Peters TJ, Mulheran PA, Egger M, Davey SG. Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives. Health Technol Assess2001;5:1-56.
OpenUrl PubMed

[24] ↵
Montori VM, Devereaux PJ, Adhikari NK, Burns KE, Eggert CH, Briel M, et al. Randomized trials stopped early for benefit: a systematic review. JAMA2005;294:2203-9.
OpenUrl CrossRef PubMed Web of Science

[25] ↵
Song F, Eastwood AJ, Gilbody S, Duley L, Sutton AJ. Publication and related biases. Health Technol Assess2000;4(10):1-115.
OpenUrl PubMed

[26] ↵
Dwan K, Altman DG, Arnaiz JA, Bloom J, Chan AW, Cronin E, et al. Systematic review of the empirical evidence of study publication bias and outcome reporting bias. PLoS ONE2008;3:e3081.
OpenUrl CrossRef

[27] ↵
Lagakos SW. The challenge of subgroup analyses—reporting without distorting. N Engl J Med2006;354:1667-9.
OpenUrl CrossRef PubMed Web of Science

[28] ↵
Jones G, Abbasi K. Trial protocols at the BMJ. BMJ2004;329:1360.
OpenUrl FREE Full Text

[29] ↵
McNamee D, James A, Kleinert S. Protocol review at the Lancet. Lancet2008;372:189-90.
OpenUrl CrossRef PubMed Web of Science

[30] ↵
PLoS Med. Guidelines for authors. http://journals.plos.org/plosmedicine/guidelines.php#supporting.

[31] ↵
Altman DG, Moher D, Schulz KF. Peer review of statistics in medical research: reporting power calculations is important. BMJ2002;325:491.
OpenUrl FREE Full Text

[32] ↵
Weaver CS, Leonardi-Bee J, Bath-Hextall FJ, Bath PMW. Sample size calculations in acute stroke trials: a systematic review of their reporting, characteristics, and relationship with outcome. Stroke2004;35:1216-24.
OpenUrl Abstract/FREE Full Text

[33] ↵
Vickers AJ. Underpowering in randomized trials reporting a sample size calculation. J Clin Epidemiol2003;56:717-20.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Hebert R, Wright S, Dittus R, Elasy T. Prominent medical journals often provide insufficient information to assess the validity of studies with negative results. J Negat Results Biomed2002;1:1.
OpenUrl PubMed

[35] ↵
Freedman KB, Back S, Bernstein J. Sample size and statistical power of randomised, controlled trials in orthopaedics. J Bone Joint Surg Br2001;83:397-402.
OpenUrl CrossRef PubMed

[36] ↵
Moher D, Dulberg CS, Wells GA. Statistical power, sample size, and their reporting in randomized controlled trials. JAMA1994;272:122-4.
OpenUrl CrossRef PubMed Web of Science

[37] ↵
Schafer H, Timmesfeld N, Muller HH. An overview of statistical approaches for adaptive designs and design modifications. Biom J2006;48:507-20.
OpenUrl CrossRef PubMed Web of Science

[38] ↵
Lachin JM. Statistical considerations in the intent-to-treat principle. Control Clin Trials2000;21:167-89.
OpenUrl CrossRef PubMed Web of Science

[39] ↵
Molenberghs G, Thijs H, Jansen I, Beunckens C, Kenward MG, Mallinckrodt C, et al. Analyzing incomplete longitudinal clinical trial data. Biostatistics2004;5:445-64.
OpenUrl Abstract

[40] ↵
Hollis S, Campbell F. What is meant by intention to treat analysis? Survey of published randomised controlled trials. BMJ1999;319:670-4.
OpenUrl Abstract/FREE Full Text

[41] ↵
Baron G, Boutron I, Giraudeau B, Ravaud P. Violation of the intent-to-treat principle and rate of missing data in superiority trials assessing structural outcomes in rheumatic diseases. Arthritis Rheum2005;52:1858-65.
OpenUrl CrossRef PubMed Web of Science

[42] ↵
Gravel J, Opatrny L, Shapiro S. The intention-to-treat approach in randomized controlled trials: are authors saying what they do and doing what they say? Clin Trials2007;4:350-6.
OpenUrl Abstract/FREE Full Text

[43] ↵
Moreira ED Jr, Stein Z, Susser E. Reporting on methods of subgroup analysis in clinical trials: a survey of four scientific journals. Braz J Med Biol Res2001;34:1441-6.
OpenUrl PubMed Web of Science

[44] ↵
Parker AB, Naylor CD. Subgroups, treatment effects, and baseline risks: some lessons from major cardiovascular trials. Am Heart J2000;139:952-61.
OpenUrl CrossRef PubMed Web of Science

[45] ↵
Assmann SF, Pocock SJ, Enos LE, Kasten LE. Subgroup analysis and other (mis)uses of baseline data in clinical trials. Lancet2000;355:1064-9.
OpenUrl CrossRef PubMed Web of Science

[46] ↵
Chan AW, Tetzlaff J, Altman DG, Gøtzsche PC, Hróbjartsson A, Krleža-Jeric K, et al. The SPIRIT initiative: defining standard protocol items for randomised trials. German J Evid Quality Health Care (suppl)2008;102:S27.
OpenUrl

[47] ↵
MacLean CH, Morton SC, Ofman JJ, Roth EA, Shekelle PG. How useful are unpublished data from the Food and Drug Administration in meta-analysis? J Clin Epidemiol2003;56:44-51.
OpenUrl CrossRef PubMed Web of Science

[48] ↵
Turner EH. A taxpayer-funded clinical trials registry and results database. PLoS Med2004;1:e60.
OpenUrl CrossRef PubMed

[49] ↵
Laine C, Horton R, DeAngelis CD, Drazen JM, Frizelle FA, Godlee F, et al. Clinical trial registration—looking back and moving ahead. N Engl J Med2007;356:2734-6.
OpenUrl CrossRef PubMed Web of Science

[50] ↵
Sim I, Chan AW, Gulmezoglu AM, Evans T, Pang T. Clinical trial registration: transparency is the watchword. Lancet2006;367:1631-3.
OpenUrl CrossRef PubMed Web of Science

[51] ↵
Krleža-Jeric K, Chan AW, Dickersin K, Sim I, Grimshaw J, Gluud C. Principles for international registration of protocol information and results from human trials of health related interventions: Ottawa statement (part 1). BMJ2005;330:956-8.
OpenUrl FREE Full Text

Search form

Discrepancies in sample size calculations and data analyses reported in randomised trials: comparison of publications with protocols

Abstract

Introduction

Methods

Box 1 Definitions of collected data

Design framework

Handling of protocol deviations

Primary outcome

Results

Sample size calculation

Box 2 Anonymised examples of unacknowledged discrepancies in sample size calculations and statistical analyses reported in publications compared with protocols

Sample size calculation

Changed delta (1)

Changed delta (2)

Changed standard deviation

Changed power

Changed sample size estimate

Statistical analyses

Changed primary outcome analysis

New subgroups added to publication

Omitted covariates for adjusted analysis in publication

Protocol deviations

Missing data

Primary outcome analysis and overall number of tests

Subgroup analysis

Adjusted analysis

Interim analyses and data monitoring boards

Discussion

Sample size calculations

Data analyses

Conclusions

What is already known on this topic

What this study adds

Notes

Footnotes

References

Article alerts

Log in or register:

Download this article to citation manager

Help

Forward this page

Content links

About us

Resources

Explore BMJ

My account

Information