The influence of study characteristics on reporting of subgroup analyses in randomised controlled trials: systematic reviewBMJ 2011; 342 doi: https://doi.org/10.1136/bmj.d1569 (Published 28 March 2011) Cite this as: BMJ 2011;342:d1569
- Xin Sun, research fellow 12,
- Matthias Briel, assistant professor13,
- Jason W Busse, scientist14,
- John J You, assistant professor 15,
- Elie A Akl, associate professor 16,
- Filip Mejza, research fellow 7,
- Malgorzata M Bala, research fellow 8,
- Dirk Bassler, associate professor 9,
- Dominik Mertz, research fellow 110,
- Natalia Diaz-Granados, doctoral candidate1,
- Per Olav Vandvik, researcher 1112,
- German Malaga, associate professor13,
- Sadeesh K Srinathan, assistant professor 14,
- Philipp Dahm, associate professor15,
- Bradley C Johnston, postdoctoral fellow1,
- Pablo Alonso-Coello, researcher 16,
- Basil Hassouneh, research fellow 1,
- Jessica Truong, undergraduate student17,
- Neil D Dattani, medical student18,
- Stephen D Walter, professor1,
- Diane Heels-Ansdell, statistician 1,
- Neera Bhatnagar, librarian 19,
- Douglas G Altman, professor 20,
- Gordon H Guyatt, professor1
- 1Department of Clinical Epidemiology and Biostatistics, McMaster University, 1200 Main Street, Hamilton, ON, Canada L8N 3Z5
- 2Chinese Evidence-Based Medicine Center, West China Hospital, Sichuan University, Chengdu, China
- 3Basel Institute for Clinical Epidemiology and Biostatistics, University Hospital Basel, Switzerland
- 4Institute for Work and Health, Toronto, ON, Canada
- 5Department of Medicine, McMaster University, Hamilton, ON, Canada
- 6Departments of Medicine and Family Medicine, State University of New York at Buffalo, NY, USA
- 7Department of Pulmonary Diseases, Jagiellonian University School of Medicine, Krakow, Poland
- 8II Department of Internal Medicine, Jagiellonian University School of Medicine, Krakow, Poland
- 9Department of Neonatology and Center for Pediatric Clinical Studies, University Children’s Hospital Tuebingen, Tuebingen, Germany
- 10Michael G DeGroote Institute for Infectious Diseases Research, McMaster University, Hamilton, ON, Canada
- 11Norwegian Knowledge Centre for the Health Services, Oslo, Norway
- 12Department of Medicine, Innlandet Hospital Trust, Gjøvik, Norway
- 13Universidad Peruana Cayetano Heredia, Lima, Peru
- 14Department of Surgery, University of Manitoba, Winnipeg, MB, Canada
- 15Department of Urology, College of Medicine, University of Florida, Gainesville, FL, USA
- 16IberoAmerican Cochrane Centre, Clinical Epidemiology and Public Health Department, Institute of Biomedical Research-CIBER of Epidemiology and Public Health, Barcelona, Spain
- 17Bachelor of Health Sciences Program, McMaster University, Hamilton, ON, Canada
- 18Doctor of Medicine Programme, University of Toronto, Toronto, ON, Canada
- 19Health Sciences Library, McMaster University, Hamilton, ON, Canada
- 20Center for Statistics in Medicine, University of Oxford, UK
- Correspondence to: G H Guyatt
- Accepted 24 December 2010
Objective To investigate the impact of industry funding on reporting of subgroup analyses in randomised controlled trials.
Design Systematic review.
Data sources Medline.
Study selection Randomised controlled trials published in 118 core clinical journals (defined by the National Library of Medicine) in 2007. 1140 study reports in a 1:1 ratio by high (five general medicine journals with largest number of total citations in 2007) versus lower impact journals, were randomly sampled. Two reviewers, independently and in duplicate, used standardised, piloted forms to screen study reports for eligibility and to extract data. They also used explicit criteria to determine whether a randomised controlled trial reported subgroup analyses. Logistic regression was used to examine the association of prespecified study characteristics with reporting versus not reporting of subgroup analyses.
Results 469 randomised controlled trials were included, of which 207 (44%) reported subgroup analyses. High impact journals (adjusted odds ratio 2.64, 95% confidence interval 1.62 to 4.33), non-surgical (versus surgical) trials (2.10, 1.26 to 3.50), and larger sample size (3.38, 1.64 to 6.99) were associated with more frequent reporting of subgroup analyses. The strength of association between trial funding and reporting of subgroups differed in trials with and without statistically significant primary outcomes (interaction P=0.02). In trials without statistically significant results for the primary outcome, industry funded trials were more likely to report subgroup analyses (2.29, 1.30 to 4.72) than non-industry funded trials. This was not true for trials with a statistically significant primary outcome (0.79, 0.46 to 1.36). Industry funded trials were associated with less frequent prespecification of subgroup hypotheses (31.3% v 38.0%, adjusted odds ratio 0.49, 0.26 to 0.94), and less use of the interaction test for analyses of subgroup effects (41.4% v 49.1%, 0.52, 0.28 to 0.97) than non-industry funded trials.
Conclusion Industry funded randomised controlled trials, in the absence of statistically significant primary outcomes, are more likely to report subgroup analyses than non-industry funded trials. Industry funded trials less frequently prespecify subgroup hypotheses and less frequently test for interaction than non-industry funded trials. Subgroup analyses from industry funded trials with negative results for the primary outcome should be viewed with caution.
Subgroup analyses are common in randomised controlled trials.1 2 3 4 5 6 Previous studies have found that 60% of trials published in high impact general medical journals,1 6 61% of cardiovascular trials,3 and 37% of surgical trials2 report subgroup analyses. Investigators carry out subgroup analyses to examine if observed treatment effects differ across baseline characteristics. Once reported, these analyses can have substantial influence on clinical and public health decision making. This influence may be misleading as subsequent studies have proved that many subgroup findings are spurious.7 For instance, a randomised trial showed that aspirin was ineffective in the secondary prevention of stroke in women8; a subsequent large collaborative meta-analysis, however, showed that aspirin was beneficial in both men and women.9 In another example, a subgroup analysis in a randomised trial found ticlopidine to be superior to aspirin for preventing recurrent stroke, myocardial infarction, or vascular death in black patients but not in white patients,10 whereas a subsequently larger trial showed no statistically significant difference between the drugs in preventing stroke, myocardial infarction, and vascular death in black patients.11
An understanding of factors underlying reporting of subgroup analyses may aid the interpretation and appropriate use of subgroup findings in trials. However, the investigation of factors associated with reporting of subgroup analyses has thus far been limited. Two studies have shown an association between larger sample size with reporting of subgroup analysis,3 6 one of which found that the rate of subgroup reporting varied among high impact medical journals.6 These two studies were, however, restricted to trials published in high impact general medical journals or selected cardiovascular journals, and one was restricted to trials reporting interaction tests for subgroup analyses.6 These restrictions limit the generalisability of their results.
Existing studies have left several potentially important factors unexplored. A number of studies have reported evidence on the influence of sponsors on aspects of trial design, conduct, and reporting other than the use of subgroup analysis.12 13 14 15 16 17 18 19 20 Funding by industry may also influence the reporting of subgroup analyses. One hypothesis would suggest that, in the absence of a statistically significant primary outcome, industry funded trials may, looking for a positive effect, seek statistically significant findings in patient subgroups. Were this the case, the influence of industry would have an effect on subgroup reporting in trials with negative findings but not positive findings. Other factors that may influence subgroup reporting include clinical area (for example, surgical v medical) and journal types (high impact journals v others).1 2 3 5
We systematically reviewed randomised controlled trials to investigate the association of prespecified study characteristics with reporting of subgroup analyses. In particular we examined the impact of industry funding on the reporting of subgroups.
The protocol for our study, detailing the design and analysis, is published elsewhere.21 We included any randomised controlled trial carried out on humans unless it focused on a subset of the original population enrolled, was explicitly labelled as a phase I trial, was exclusively a pharmacokinetic study, or was reported as a research letter. We applied no restrictions to study design (parallel, factorial, crossover), number of trial arms, unit of randomisation, type of study (superiority, non-inferiority, equivalence), or study sample size.
We applied a predefined search strategy (see web extra appendix 1),21 developed with the help of an experienced librarian, to the core clinical journals in 2007 in Medline through Ovid. The search strategy applied both MeSH terms and free texts and was highly sensitive to identify randomised controlled trials. The core clinical journals defined by the National Library of Medicine, known as the Abridged Index Medicus, included 118 journals in 2007, covering all specialties of clinical medicine and public health sciences (see web extra appendix 2).22 We stratified these journals into high and lower impact groups according to the total citations in 2007 defined by the Web of Science.23 The five high impact journals, with the highest number of total citations, were the Annals of Internal Medicine, BMJ, JAMA, Lancet, and New England Journal of Medicine. After removing duplicate articles, our search resulted in 3662 journal reports.
Sample size and random sampling
Prior to our definitive study, we carried out a pilot study of 139 randomised trials and found that 62 (45%) reported subgroup analyses and 27 (19%) claimed subgroup effects.
Our sample size estimation for the definitive study was based on the examination of study characteristics associated with the claim of subgroup effects for any outcome. In our regression analysis of study characteristics with the claim of subgroup effects, we planned to include six study characteristics, a total of nine categories of variables. Setting a criterion of 10 events (that is, the claim of subgroup effect) for each category resulted in a total of 90 events (and at least 90 total non-events). Given the results of our pilot study, we determined we would require a total of 464 trials. 21
We used the random sampling procedure available in the Stata statistical program to randomly select, in a 1:1 ratio, study reports from each journal group—that is, high versus lower impact journals. We repeated the random sampling until the planned sample size was reached. At each sampling process we excluded previously sampled reports from the database. We ultimately chose 570 reports in each of the high and lower impact journals, resulting in 1140 reports.
Study screening and data extraction
Eight pairs of reviewers trained in methodology used standardised, previously piloted forms with detailed written instructions on screening the title, abstract, and full text and extracting data, independently and in duplicate.21 To ensure consistency across reviewers we carried out calibration exercises before starting the review.
While screening the title and abstract, reviewers determined if the studies were randomised controlled trials enrolling humans. The reviewers independently screened the full text of potentially eligible trials to determine eligibility. At the stage of full text screening, the reviewers selected a primary outcome for eligible studies using prespecified criteria (see web extra appendix 3) and identified a pairwise comparison if the studies included three or more study arms (see web extra appendix 4).
We defined a subgroup as a subset of a trial population that was identified on the basis of a characteristic of a patient or intervention that was measured either at baseline or after randomisation. We defined a subgroup analysis as a statistical analysis that explored whether effects of the intervention (experimental v control) differed according to status of a subgroup variable.
We judged a subgroup analysis to be present if the study reported one of the following: a point estimate and an associated confidence interval or a P value for one or more subgroups, the magnitude of difference in the effect between patient subgroups, the results from an interaction test, or an explicit statement that a subgroup analysis had been done.
The reviewers extracted data on study characteristics, including funding sources; clinical area; and type of intervention, and determined whether results for the primary outcome were statistically significant using a threshold of P<0.05. Reviewers recorded whether trials reported subgroup analyses for any outcomes (primary or secondary), number of outcomes for which subgroup analyses were reported, type of outcomes reported, number of subgroup analyses reported, whether any subgroup analysis was specified a priori, and whether any subgroup effect was stated to have been analysed by a test of interaction. We used detailed written instructions for extracting this information.
We defined a priori subgroup analyses as those that prespecified subgroup hypotheses—that is, prespecified subgroup variables for examination of a subgroup effect. We defined the source of funding based on statements reported in the methods, disclosure of conflicts of interest, acknowledgements, and funding sections of the study report. We categorised the source of funding as governmental agencies, private not for profit organisations, industry funding, explicit statement of no funding, or funding source not reported. When the reviewers were unclear as to the category of declared funding source, we searched the websites of funding agencies for clarification.
Teams of reviewers resolved discrepancies by consensus or, if a discrepancy remained, through discussion with one of two arbitrators (XS or GHG). The inter-rater agreement was high for initial opinions on study eligibility (observed agreement=95%, κ=0.80) and reporting of a subgroup analysis (observed agreement=91%, κ=0.82).
We calculated the proportion of trials reporting subgroup analyses. To examine the association of reporting versus not reporting of subgroup analyses with study characteristics, we carried out univariable and multivariable logistic regression analyses, with reporting of a subgroup analysis as the dependent variable.
We prespecified six study characteristics: journal type (high v lower impact), study area (non-surgical v surgical), mean sample size per arm, number of prespecified primary outcomes, source of funding (industry v other), and statistical significance of the primary outcome. We also prespecified the interaction between the statistical significance of the primary outcome and funding source. These seven factors were included in the regression model as independent variables. We also prespecified direction of these hypotheses: trials are more likely to report subgroup analyses if they have a larger sample size, are published in high impact journals, investigate non-surgical interventions, and report more prespecified primary outcomes. Trials funded by industry are more likely to report subgroup results when the primary outcome is not statistically significant than if the primary outcome is statistically significant.
To test the interaction between statistical significance of the primary outcome (significant v not significant) and type of funding (industry v non-industry), we included the six independent variables and the interaction term in our regression model. Conditional on the finding of the statistically significant interaction (P<0.05), we further calculated the association of funding source with subgroup reporting in two subgroups (presence v absence of statistically significant main effect).
We compared the reporting of subgroup analyses in industry funded versus non-industry funded trials, including the number of subgroup analyses reported and the number of variables for patients or interventions, as well as number of outcomes used for subgroup analyses. We also examined whether authors specified a subgroup hypothesis a priori, and reported a test of interaction for subgroup analysis in their trial reports. Given that the trial investigators probably report smaller number of subgroup analyses than are actually carried out, we also estimated the number of subgroup analyses that were likely to have been done by trial investigators according to the information provided in the study reports. Typically, if authors stated that they specified a number of variables and used a number of outcomes for the subgroup analyses, we would multiple these together to estimate the number of subgroup analyses. We used the Wilcoxon rank sum test for the analysis of continuous data and the χ2 test for binary data.
To further examine whether industry funded trials versus non-industry funded trials differed in the rate of a priori specification of subgroup hypotheses and use of an interaction test, we also carried out multivariable logistic regression, including the six variables and the interaction term (funding×statistical significance of the primary outcomes) in our model.
In our analysis we defined that a study was funded by industry if it received partial or full funding from industry. We considered a study as non-industry funded if it received other sources of funding, had no funding, or did not report a funding source. To examine the influence of trials that did not report a funding source, we did sensitivity analyses excluding those studies. We used Stata 11.0 for all analyses. All comparisons were two tailed, and P<0.05 was considered statistically significant.
This study included 469 eligible trials reported in 459 articles (figure⇓, also see web extra appendix 5), of which 207 (44%) reported subgroup analyses. Table 1⇓ presents study characteristics of trials that did and did not report subgroup analyses. Of these 469 trials, 186 were funded by industry, 66 did not report a funding source, and the other 217 had other sources of funding or received no funding (see web appendix 6). In the 66 trials that did not report a funding source, 15 (23%) reported subgroup analyses (see web extra appendix 7); these trials generally had small sample sizes (interquartile range 16-51 in mean size per arm).
Univariable analyses showed that high impact journals, non-surgical trials, larger sample size, and industry funding were statistically associated with more frequent reporting of subgroup analyses (table 2⇓). Multivariable analyses showed more frequent reporting of subgroup analyses with high impact journals, non-surgical trials, and larger sample size (table 2). A differential strength of association of industry funding with subgroup reporting was present in trials with and without significant primary outcomes (interaction P=0.021): when the primary outcome was not significant, the likelihood of reporting subgroup analyses in industry funded trials was higher than in non-industry funded trials (67% v 40%, adjusted odds ratio 2.29, 95% confidence interval 1.30 to 4.72, P=0.005). By contrast, industry funding was not statistically associated with reporting subgroup analysis when the primary outcome was significant (37% v 48% in other trials, 0.79, 0.46 to 1.36).
In the 207 trials reporting subgroup analyses, 99 (49%) were funded by industry. No statistically significant differences were present in characteristics of subgroup reporting between trials funded or not funded by industry in the unadjusted analyses (table 3⇓). The proportion of trials prespecifying subgroup hypotheses (<40%) and reporting an interaction test for analysis of subgroup effect (50%) was low in both industry and non-industry funded trials. The total number of subgroup analyses that were probably carried out seemed to be higher in industry funded trials than in non-industry funded trials (P=0.063). Multivariable analyses found that industry funded trials were associated with less prespecification of subgroup hypotheses (adjusted odds ratio 0.49, 95% confidence interval 0.26 to 0.94, P=0.032, table 4⇓) and less use of the interaction test for analyses of subgroup effects (0.52, 0.28 to 0.97, P=0.039, table 5⇓) than non-industry funded trials.
Excluding those studies that failed to clearly report funding sources did not change the association of study characteristics with reporting of subgroup analyses (see web extra appendix 8). The magnitude of association of industry funding with subgroup reporting, in the absence of a statistically significant primary outcome, was larger (3.23, 1.57 to 6.66) than in our primary analysis. Industry funding remained statistically associated with less prespecification of subgroup hypotheses (0.52, 0.27 to 0.99, see web extra appendix 9) and non-significantly associated with a lower likelihood of reporting a test of interaction for subgroup analyses (0.58, 0.32 to 1.08, see web extra appendix 10).
Randomised controlled trials with larger sample sizes, studying non-surgical topics, and in high impact journals were associated with more frequent reporting of subgroup analyses. The higher rate of reporting in high impact journals may be a result of the independent efforts of the trials’ investigators. Alternatively, editors and reviewers in high impact journals may be more inclined to request such analyses than those in journals with a lower impact. Without direct correspondence with authors, the true explanation remains speculative.
We also found a differential strength of association of trial funding with subgroup reporting in trials with and without statistically significant primary outcomes. If results for the primary outcome were not statistically significant, the odds of reporting subgroup analyses in industry funded trials were 2.3 times that of other trials.
The implication of our results is that particular caution is needed in interpreting subgroup analyses of otherwise negative, and thus possibly unexciting, studies when they are funded by industry. It is perhaps ironic that this finding comes from a subgroup analysis of a study that could be viewed as having otherwise unexciting results. Applying our previously published criteria for the credibility of a subgroup analysis,24 we note that our hypothesis was prespecified and that we correctly prespecified the direction of effect. This subgroup finding was the only subgroup hypothesis tested. The interaction P value (0.023) was statistically significant, and the magnitude of subgroup effect (the difference of the associations in the presence versus absence of a statistically significant primary outcome) was large. Our results are consistent with a large body of literature suggesting that positive “spin” is commonly applied in industry funded studies12 13 15 16 17 19 25 and are supported with a clear rationale (corresponding to biological rationale in randomised trials). The rationale of our a priori hypothesis is as follows. If a study has positive findings, clinicians are likely to consider the intervention for all eligible patients. Under these circumstances, there is no motivation for industry sponsors to carry out subgroup analyses. If, however, the trial has negative findings, clinicians are unlikely to consider the intervention for any patients unless an analysis suggests benefit in a subgroup of patients. Thus, this subgroup analysis meets our previously suggested criteria for credibility.
Conduct of subgroup analyses in randomised controlled trials
We found that in both industry and non-industry funded trials, the proportion prespecifying subgroup hypotheses and reporting a test of interaction was low (table 3), suggesting that many fail to meet key methodological criteria in carrying out subgroup analyses. One study26 found that some so called “prespecified” subgroup analyses were not actually defined in the study protocols, suggesting that the proportion of real prespecified subgroup analyses may be even lower than reported. It is possible that trial investigators may, blinded to the trial data, prespecify subgroup hypotheses in the detailed statistical analysis plan, but not in the study protocol, before the trial is closed out, which represents an appropriate approach to a priori specification of subgroup hypotheses. However, in our sample, whatever was done elsewhere, most trial investigators failed to include this information in the study reports.
We also found that industry funded trials were less likely to prespecify subgroup hypotheses and were less likely to carry out the test of interaction, irrespective of whether the primary outcome was or was not statistically significant. These findings further support our hypothesis that trials funded by industry are more likely to look for positive subgroup findings, and suggest that, compared with non-industry funded trials, the quality of carrying out subgroup analyses is more questionable. On the other hand, with industry funded trials, subgroup analyses reported in journals may differ from those submitted to regulatory authorities. Typically, regulators require a prespecified statistical plan including subgroup analyses and caution about the claim of subgroup effects. Industry funded trials may choose to only report prespecified subgroup analyses and provide more details on the conduct and interpretation of subgroup analyses when submitting to regulatory authorities. However, the failure to disclose some of these trials’ results publicly has still hampered the unbiased assessment of the effect of treatments.27 28
Instead of using the test of interaction, many trials tested whether the results of each subgroup met the threshold for statistical significance.1 6 This approach to analysing subgroup effects fails to deal with the critical null hypothesis of subgroup analysis—that is, there is no difference in treatment effect between subgroups. The interaction test, which addresses the likelihood that chance explains the apparent differences in effect across subgroups, helps avoid spuriously positive subgroup findings.
Interpretation of subgroup analyses
Subgroup analyses represent an effort to tackle heterogeneity of treatment effects. With appropriate design, conduct, and interpretation of studies, findings from subgroup analyses can provide crucial information that ultimately improves the management of patients. Subgroup analyses, however, pose many challenges. On the one hand, trials are rarely planned to detect subgroup effects, resulting in false negative findings for subgroups. On the other hand, trial investigators may carry out a large number of subgroup analyses without prespecifying subgroup hypotheses and without testing for interaction; as a result, subgroup analyses are often associated with false positive findings.4 29 30 31
The purposes of doing subgroup analyses vary. They may serve to generate important hypotheses (exploratory subgroup analyses). Indeed, there are examples of subgroup analyses that have generated important hypotheses that proved real when tested in subsequent randomised trials.32 33 Because, more often, such preliminary apparent subgroup findings ultimately prove spurious, a much higher standard is necessary to make definitive claims for subgroups—that is, confirmatory subgroup analyses.
To distinguish between true and spurious subgroup effects we have suggested a set of criteria that can systematically be applied.24 These criteria cover aspects of the design, conduct, and context of subgroup analyses. The more criteria subgroup analyses fulfil, the more likely the apparent subgroup effect is real. For instance, in an industry funded randomised trial discussing the effect of ivabradine versus placebo for patients with coronary artery disease and left ventricular systolic dysfunction on the composite primary outcome of cardiovascular mortality, admission to hospital for acute myocardial infarction or heart failure,34 the authors claimed a likely effect of treatment on reduction of admission to hospital for acute myocardial infarction, admission to hospital for acute myocardial infarction or unstable angina, and coronary revascularisation—all of which were secondary outcomes—in a subgroup of patients with a baseline heart rate of 70 or more beats/min. Applying our criteria, we found that, although authors prespecified subgroup hypotheses, provided external evidence consistent with their findings, and justified the biological rationale of their findings, they carried out a large number of subgroup analyses (probably 99), and failed to report the P value associated with the test for interaction. They also did not prespecify the direction of the interaction and check the independence of multiple significant subgroup effects. Failure to meet most criteria, particularly the large number of subgroup hypotheses done, and absence of statistically significant interaction, suggests the subgroup claim warrants a high degree of scepticism.
Limitations of the study
Our study has several limitations. Firstly, we did not search all medical journals and therefore our findings may not be applicable to journals outside our sample. We did, however, include all core clinical journals, which is a much wider spectrum of journals than previously studied. Secondly, all trials in our study were published in 2007, and our results may not be generalisable to other years. A previous study has, however, suggested a similar relative frequency of subgroup reporting from 1994 to 2004.6 Thirdly, we categorised trials as positive or negative according to the P value threshold of 0.05, and the approach to categorising trials may be questioned. However, most editors and authors still use such categorisation. Fourthly, we dichotomised the journals as high versus lower impact according to the total number of citations, and trials as industry funded versus non-industry funded. These categorisations ignore gradients both in impact and in industry influence. For instance, it may be expected that industry initiated projects would have substantial influence from industry on the interpretation and reporting of studies, whereas investigator initiated grants that obtained some industry support would have much less influence from industry. Our dichotomisation approach precluded exploring the impact of such gradients. Strengths of our study include the identification of a large cohort of randomised controlled trials acquired through a systematic search, use of standardised screening and data extraction forms as well as calibration exercises to enhance the consistency between reviewers, and prespecified hypotheses to guide our analyses.21
Randomised controlled trials published in high impact journals, with larger sample size, studying non-surgical topics, and with industry funding—if the primary outcome is not statistically significant—are associated with more frequent reporting of subgroup analyses. The proportion of trials prespecifying subgroup hypotheses and carrying out interaction tests for subgroup analyses is low in both industry funded and non-industry funded trials. Industry funded trials, regardless of the statistical significance of primary outcomes, less often prespecify subgroup hypotheses and less often use the interaction test for analyses of subgroup effects compared with trials that are not funded by industry. Our findings suggest that clinicians, reviewers, and journal editors should view all subgroup analyses with caution. Particular attention is warranted in industry funded trials with negative results for the primary outcome.
What is already known on this topic
Trial authors often report subgroup analyses
Larger trials are more likely to report subgroup analyses
A small proportion of trials prespecify subgroup hypotheses and use the formal test of interaction for subgroup analyses
What this study adds
Trials published in high compared with lower impact journals and non-surgical trials are more likely to report subgroup analyses
Subgroup analyses are more likely to be reported by trials funded by industry than by non-industry funded trials, but only if the primary outcome is not statistically significant
Industry funded trials prespecify subgroup hypotheses and use the interaction test for analyses of subgroup effects less often than non-industry funded trials
Cite this as: BMJ 2011;342:d1569
We thank Monica Owen for administrative assistance and Aravin Duraikannan for developing the electronic data abstraction forms.
Contributors: XS and GHG conceived the study, had full access to all of the data in the study, and take responsibility for the integrity of the data and the accuracy of the data analysis. GHG is the guarantor. XS, GHG, MB, JWB, EAA, SDW, DGA, and DH-A designed the study. MB, EAA, JWB, ND-G, JJY, FM, MMB, DB, DM, POV, GM, SKS, PD, BCJ, PA-C, BH, XS, JT, NDD, and NB acquired the data. XS, GHG, SDW, and DH-A analysed and interpreted the data. XS drafted the manuscript. All authors critically revised the manuscript. XS provided administrative and technical support. The funder had no role in the study design, writing of the manuscript, or the decision to submit this manuscript for publication.
Funding: This study was supported by the National Natural Science Foundation of China (project No 70703025). XS is supported by the Ontario graduate scholarship and the National Natural Science Foundation of China. MB is supported by santésuisse and the Gottfried and Julia Bangerter-Rhyner Foundation. JWB is funded by a new investigator award from the Canadian Institutes of Health Research and the Canadian Chiropractic Research Foundation. DB is supported by the European Union (grant award health-F5-2009-223060). DM is supported by a research scholarship from the Swiss National Science Foundation (PBBSP3-124436). PD is supported by a Dennis W Jahnigan career development award by the American Geriatrics Society. BCJ holds a SickKids Foundation post-doctoral fellowship. PA-C is funded by a Miguel Servet contract by the Instituto de Salud Carlos III (CP09/00137). JJY is supported by a career scientist award from the Ontario Ministry of Health and Long-Term Care.
Competing interests: All authors have completed the Unified Competing Interest form at www.icmje.org/coi_disclosure.pdf (available on request from the corresponding author) and declare: This study was supported by the National Natural Science Foundation of China (project No 70703025); no financial relationships with any organisations that might have an interest in the submitted work in the previous three years, no other relationships or activities that could appear to have influenced the submitted work.
Ethical approval: Not required.
Data sharing: No additional data available.
This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.