Methotrexate monotherapy and methotrexate combination therapy with traditional and biologic disease modifying antirheumatic drugs for rheumatoid arthritis: abridged Cochrane systematic review and network meta-analysisBMJ 2016; 353 doi: https://doi.org/10.1136/bmj.i1777 (Published 21 April 2016) Cite this as: BMJ 2016;353:i1777
- Glen S Hazlewood, assistant professor1 2 3 4,
- Cheryl Barnabe, assistant professor1 2 4,
- George Tomlinson, associate professor5,
- Deborah Marshall, associate professor2 4,
- Dan Devoe, research assistant4,
- Claire Bombardier, professor5 6 7
- 1Department of Medicine, University of Calgary, Calgary, AB, Canada, T2N4Z6
- 2McCaig Institute for Bone and Joint Health, University of Calgary, Calgary, AB, Canada, T2N4Z6
- 3Institute of Health, Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada, M5T3M6
- 4Department of Community Health Sciences, University of Calgary, Calgary, AB, Canada, T2N4Z6
- 5Department of Medicine and Institute of Health Policy, Management, and Evaluation, University of Toronto, Toronto, ON, Canada, M5G2C4
- 6Toronto General Research Institute, University Health Network, Toronto, ON, Canada, M6J3S3
- 7Mount Sinai Hospital, Division of Rheumatology, Toronto, ON, Canada, M5T3L9
- Correspondence to: G S Hazlewood, 3330 Hospital Dr NW, Calgary, AB, Canada T2N1N1
- Accepted 22 March 2016
Objective To compare methotrexate based disease modifying antirheumatic drug (DMARD) treatments for rheumatoid arthritis in patients naive to or with an inadequate response to methotrexate.
Design Systematic review and Bayesian random effects network meta-analysis of trials assessing methotrexate used alone or in combination with other conventional synthetic DMARDs, biologic drugs, or tofacitinib in adult patients with rheumatoid arthritis.
Data sources Trials were identified from Medline, Embase, and Central databases from inception to 19 January 2016; abstracts from two major rheumatology meetings from 2009 to 2015; two trial registers; and hand searches of Cochrane reviews.
Study selection criteria Randomized or quasi-randomized trials that compared methotrexate with any other DMARD or combination of DMARDs and contributed to the network of evidence between the treatments of interest.
Main outcomes American College of Rheumatology (ACR) 50 response (major clinical improvement), radiographic progression, and withdrawals due to adverse events. A comparison between two treatments was considered statistically significant if its credible interval excluded the null effect, indicating >97.5% probability that one treatment was superior.
Results 158 trials were included, with between 10 and 53 trials available for each outcome. In methotrexate naive patients, several treatments were statistically superior to oral methotrexate for ACR50 response: sulfasalazine and hydroxychloroquine (“triple therapy”), several biologics (abatacept, adalimumab, etanercept, infliximab, rituximab, tocilizumab), and tofacitinib. The estimated probability of ACR50 response was similar between these treatments (range 56-67%), compared with 41% with methotrexate. Methotrexate combined with adalimumab, etanercept, certolizumab, or infliximab was statistically superior to oral methotrexate for inhibiting radiographic progression, but the estimated mean change over one year with all treatments was less than the minimal clinically important difference of 5 units on the Sharp-van der Heijde scale. Triple therapy had statistically fewer withdrawals due to adverse events than methotrexate plus infliximab. After an inadequate response to methotrexate, several treatments were statistically superior to oral methotrexate for ACR50 response: triple therapy, methotrexate plus hydroxychloroquine, methotrexate plus leflunomide, methotrexate plus intramuscular gold, methotrexate plus most biologics, and methotrexate plus tofacitinib. The probability of response was 61% with triple therapy and ranged widely (27-70%) with other treatments. No treatment was statistically superior to oral methotrexate for inhibiting radiographic progression. Methotrexate plus abatacept had a statistically lower rate of withdrawals due to adverse events than several treatments.
Conclusions Triple therapy (methotrexate plus sulfasalazine plus hydroxychloroquine) and most regimens combining biologic DMARDs with methotrexate were effective in controlling disease activity, and all were generally well tolerated in both methotrexate naive and methotrexate exposed patients.
Methotrexate based treatments form the core of rheumatoid arthritis treatment. Methotrexate is recommended as the first disease modifying antirheumatic drug (DMARD) for most patients with rheumatoid arthritis, and methotrexate co-prescription is recommended when using biologic DMARDs or the recently approved tofacitinib.1 2 Combining methotrexate with other conventional synthetic DMARDs, however, is more controversial. A trial of conventional synthetic DMARD combination therapy before biologic therapy is not recommended by either major rheumatology guideline, although each provides the option.1 2 Understanding the comparative benefits and harms of these treatments is essential to inform decision making, as treatment with biologic DMARDs or tofacitinib costs 10-20 times more than treatment with methotrexate and most conventional synthetic DMARDs.
Network (mixed treatment) meta-analyses are a natural avenue of comparative effectiveness research, as they combine all direct and indirect evidence to estimate treatment effects between all treatments of interest.3 If treatments A and B are in the same study, direct evidence links A and B. If they are compared in separate studies with a common comparator C, then the A-C and B-C studies allow an indirect comparison of A and B. Longer chains of indirect comparisons (A-B, B-C, C-D) are also possible. Considering indirect evidence is critical if a treatment decision must be made and the treatments have not been directly compared in a head-to-head trial. Indirect evidence is also important to consider when head-to-head trials are available, as it adds to the entire body of evidence and may help to refine the precision in estimation of the treatment effect.3
A previous Cochrane network meta-analysis examined the relative effects of different biologic therapies through indirect comparisons and found some differences between agents.4 Our review expands on this, by including combination therapy with methotrexate plus conventional synthetic DMARDs. A previous traditional (non-network) Cochrane meta-analysis did not find an additional overall benefit with combination therapy over methotrexate alone.5 By including indirect evidence, we expand the evidence base available to draw from. For example, three recently published trials compared combination therapy with methotrexate plus sulfasalazine plus hydroxychloroquine versus methotrexate plus anti-tumor necrosis factor (TNF) therapy.6 7 8 The inclusion of these trials in a network meta-analysis adds indirect evidence on the relative effects of methotrexate plus sulfasalazine plus hydroxychloroquine compared with all other treatments in the network.
This is an abridged version of a Cochrane systematic review. The protocol and Cochrane review (once published) can be accessed from the Cochrane Library.9
We included randomized trials or quasi-randomized trials (where allocation to treatment groups was not truly random—for example, alternate patients) of at least 12 weeks’ duration that contained any intervention of interest (defined in detail below) and could be linked within the network through a shared comparator. For example, if we identified a trial comparing methotrexate with hydroxychloroquine, the trial was included if another trial was available that compared hydroxychloroquine with any other treatment of interest. We divided trials into groups based on patients’ previous exposure to methotrexate: either methotrexate naive or inadequate response to methotrexate. We excluded trials that required all patients to have failed to respond to anti-TNF therapy.
The interventions of interest were oral methotrexate; parenteral (intramuscular or subcutaneous) methotrexate; methotrexate in combination with conventional synthetic DMARD therapy including antimalarials (hydroxychloroquine/chloroquine), sulfasalazine, leflunomide, ciclosporin, intramuscular gold, and azathioprine; methotrexate in combination with biologic DMARD therapy (adalimumab, certolizumab pegol, etanercept, golimumab, infliximab, abatacept, rituximab, tocilizumab); and methotrexate in combination with tofacitinib. We applied no dose restriction to conventional synthetic DMARDs, given the variability of dosing in clinical practice. Biologic DMARDs and tofacitinib were limited to currently recommended doses or dose equivalents.
The major outcomes of the review were American College of Rheumatology (ACR) 50 response, a composite measure of improvement in disease activity (dichotomous outcome)13; radiographic progression, measured by Larsen, Sharp, or modified Larsen/Sharp scores (continuous outcome)14; and withdrawals due to adverse events, including death (dichotomous outcome). Multiple secondary outcomes were evaluated and are reported in the full Cochrane review.9
We did an electronic database search in Medline, Embase, and Cochrane Central from inception to 19 January 2016. The search strategies contained subject headings and keywords for “rheumatoid arthritis”, “methotrexate”, and “randomized controlled trial”. The search strategy is available in the full Cochrane review.9 We also searched the trial registries ClinicalTrials.gov (http://clinicaltrials.gov/) and the International Clinical Trials Registry Platform (http://apps.who.int/trialsearch/) by using the search term “rheumatoid arthritis AND methotrexate”. Finally, we hand searched for abstracts from American College of Rheumatology and European League Against Rheumatism conferences from 2009 to 2015 and reviewed existing Cochrane reviews. All languages were included.
Two review authors (GH, ChB) independently screened articles for inclusion by title or abstract and full text if necessary. Disagreements were resolved by consensus or discussion with a third review author (ClB).
Data extraction and handling of missing data
Three review authors working in pairs (GH, ChB, DD) abstracted relevant data from included studies into an Excel spreadsheet. Characteristics of trials and baseline characteristics of patients were extracted by one author (GH) and confirmed by a second (ChB or DD); outcome data were extracted independently, with disagreements resolved through discussion. For all trials, we also sought data from clinical trial registries (clinicaltrials.gov, www.who.int/ictrp/en/), the US Food and Drug Administration, the European Medicines Agency, and drug manufacturers’ websites. For continuous measures, if standard deviations were not available, we calculated them from available variance data (for example, standard errors) if possible. For toxicity outcomes, if the drug exposure was not available, we calculated it; we assumed withdrawals to occur at a constant rate, unless specific information was available to permit a more accurate calculation. If data were presented only in graphical format, two independent reviewers (GH, ChB) extracted numbers digitally and averaged or corrected them if an obvious discrepancy was apparent.
Time point of outcome assessment
For parallel group trials, we used end of trial data for all outcomes. For crossover trials or trials comparing different DMARD sequences, we used data at the time of the initial switch in treatment for efficacy outcomes (ACR50 response and radiographic progression). For withdrawals due to adverse events in crossover and strategy trials, we abstracted data from the longest follow-up available, provided the adverse event was assigned to the treatment received at the time of the adverse event (on-treatment data). In certain trials, a switch in treatment was allowed or required for patients who failed to achieve a certain response at a given time (usually 12-16 weeks). Patients who receive early escape therapy are typically treated as treatment failures at all future time points. For early escape trials, we used end of trial (carried forward) data for ACR50 and radiographic progression. For withdrawals due to adverse events, we used on-treatment data from the longest follow-up available, similar to the approach for crossover trials.
Risk of bias of studies
The methodological quality of included trials was independently assessed using the Cochrane Collaboration’s tool for assessing risk of bias by two review authors or by one author (GH) if his assessment agreed with a published Cochrane review. Studies were graded as having a “low risk,” “high risk,” or “unclear risk” of bias across the seven specified domains.15 The risk of bias was assessed separately for the three major outcomes for domains in which the risk of bias could differ. The domains “blinding of participants” and “blinding of outcome assessment” were assessed separately for radiographic progression; “incomplete outcome data” was assessed separately for each of the three major outcomes. For each of the three major outcomes, we also judged an overall risk of bias.
For the primary analyses, we excluded trials with a high risk of bias for that outcome. We evaluated treatment effects by using odds ratios for ACR50 response and a standardized mean difference for radiographic progression. We estimated standardized mean differences by dividing the modeled change in each arm by the within trial pooled standard deviation of the change value. We summarized withdrawals due to adverse events as rate ratios to allow for differences in exposure between arms in early escape and crossover trials.
We fitted random effects bayesian network meta-analyses for each outcome measure.16 17 18 The outcome measures were standardized mean differences for the continuous outcome radiographic progression, logarithms of odds ratios for the dichotomous outcome ACR50 response, and logarithms of rate ratios for withdrawals due to adverse events. The model code was based on published work and is presented in annotated form in appendix A.16 17 18 In this model, each trial of a pair of treatments is assumed to estimate a particular treatment effect, which varies around a mean effect with a shared between study variance (in fixed effects models in our sensitivity analyses, the between study variance around the mean effect was set to zero). This mean treatment effect is then broken down into “basic parameters” that are unique for each treatment. The basic parameter was set to “0” (no effect) for oral methotrexate, so that the basic parameter for the other treatments provided the treatment effect relative to oral methotrexate. From these basic parameters, we determined the treatment effect between every pair of treatments. This model assumes consistency between the indirect and direct evidence, such that both contribute to estimation of the same treatment effect.
We used uninformative prior probability distributions for all parameters. We used 10 000 burn-in iterations followed by 10 000 monitoring iterations. We assessed convergence by running three chains, inspecting the sampling history plots, and calculating Gelman-Rubin-Brooks statistics.19 We assessed model fit by using residual deviance and the deviance information criterion. We used R statistical software version 3.1.2 (www.r-project.org) for all data analyses, with rjags package version 3-14 running Just Another Gibbs Sampler (JAGS) version 184.108.40.206
Presentation of results
We report the posterior median and 95% credible interval for all pairwise treatment effects. To facilitate interpretation of the results, we considered an effect to be “significant” if its 95% credible interval excluded the null effect, which equates to a 97.5% probability that one of the treatments is superior to the other. We recognized, however, that this cut-off was arbitrary and therefore also calculated the probability that each treatment was superior to each other and the rank ordering of treatments.
We converted the average treatment effect for each outcome into an absolute response by using an assumed (baseline) value for oral methotrexate. For all analyses, the assumed baseline value was the median from a bayesian random effects model of the oral methotrexate arms. For ACR50 response, we used all trials in the analysis to estimate the assumed probability of response. For radiographic progression, we calculated the assumed mean over one year on the Sharp-van der Heijde scale from the trials that reported this outcome for oral methotrexate. We then calculated the absolute effect for each treatment by using this assumed value and the mean differences for each treatment relative to oral methotrexate on the Sharp-van der Heijde scale, which we calculated by multiplying the standardized mean differences by the pooled within arm standard deviation for studies that used the Sharp-van der Heijde scale. For withdrawals due to adverse events, we estimated the assumed rate at one year from the available trials and converted it to an absolute probability by using the rate ratio, assuming that the time to withdrawal over one year was exponentially distributed for each person.
Quality of evidence (GRADE)
We used GRADE (Grading of Recommendations Assessment, Development and Evaluation) guidance to assess the quality of evidence from a network meta-analysis, with the quality of evidence graded from high (best) to very low (worst).21 This approach considers the quality of both the direct and indirect evidence, as well as the consistency (coherence) of the indirect and direct evidence and likelihood of “intransitivity,” which exists if heterogeneity is present in the trials that form the different comparisons within the indirect evidence. We did “node splitting” to separate the indirect evidence from the direct evidence to inform these evaluations.22 We calculated a statistical measure of inconsistency by comparing the treatment effects from the indirect and direct evidence where both were available and calculating the probability that they were the same; lower values indicate a higher likelihood of inconsistency.22
Meta-regression and sensitivity analyses
We did meta-regression for several pre-specified trial level characteristics. Selected post-hoc sensitivity analyses were added, including fixed effect models for the major outcome “radiographic progression,” as few trials were available to estimate a random effect. Full details of these analyses are provided in the full review.
We also did sensitivity analyses around several modeling assumptions. In the protocol, we had planned to use odds ratios to pool withdrawals due to adverse events. However, we changed the analyses to rate ratios given the differences in exposure between arms in early escape and crossover trials. As a sensitivity analysis, we compared the rate ratios with odds ratios, in which we used the total exposure (in patient months) in each arm as the denominator, instead of the number of patients. The model estimates the effect on the monthly odds of an outcome, assuming independence between months, and should approximate the rate ratio from a Poisson model.
The choice of prior distribution for the between study variance may affect the estimated treatment effects, although this effect has been found to be small in analyses of 10 or more studies.23 For the primary analysis, we followed published guidance and chose a prior that was vague but realistic.23 We then did sensitivity analyses using an additional uninformative prior and potentially informative priors of Turner et al (odds ratio for ACR50 response and rate ratio for withdrawals due to adverse events)24 and Rhodes et al (radiographic progression).25
No patients were involved in setting the research question or the outcome measures, nor were they involved in the design and implementation of the study. There are no plans to involve patients in the dissemination of results.
Search results and description of included studies
From 9817 unique records, we identified 158 trials including more than 37 000 patients meeting our inclusion criteria (fig 1⇓). Methotrexate dosing varied across studies and was variably reported (table 1⇓; full list of studies in appendix B). Most trials enrolled patients with high disease activity (median baseline swollen joint count 15.1), with a similar distribution across drug classes (table 1⇓).
In the methotrexate naive network (fig 2⇓, top), most comparisons of methotrexate plus biologic DMARDs were against methotrexate, with no head-to-head comparisons between different biologic DMARDs. Trials evaluating conventional synthetic DMARD therapy were generally smaller but more interconnected than trials of biologic DMARDs. In the methotrexate inadequate response network (fig 2⇓, bottom), connections between methotrexate plus biologic DMARDs and methotrexate were large in size (more patients), whereas connections between conventional synthetic DMARDs were few and small in size. Four head-to-head trials of biologic therapy formed links between several biologic therapies,26 27 28 29 and all four trials that compared methotrexate plus biologic therapy against methotrexate plus conventional synthetic DMARDs were included in this network.6 7 8 30 The network diagrams show all trials; the actual numbers of trials for each outcome varied and are reported below.
Methodological quality of included studies
The risk of bias of the trials varied considerably across each domain (fig 3⇓). The overall risk of bias was high in 30% of trials for ACR50 response, in 21% for radiographic progression, and in 17% for withdrawals due to adverse events. These trials were excluded from the primary analysis.
Effects of interventions: major outcomes
Tables 2 and 3⇓ summarize the main results; the pairwise results for each comparison and ranking of treatments are in appendix C. Appendix D shows details of the appraisal of the quality of evidence using the GRADE approach.
Methotrexate naive patients
ACR50—Twenty nine trials with 10 697 patients were included in this analysis. The combination of methotrexate plus several biologic DMARDs (intravenous abatacept, adalimumab, etanercept, infliximab, rituximab, tocilizumab 8 mg/kg) and methotrexate plus tofacitinib were statistically superior to oral methotrexate, with the 95% credible interval excluding the null effect (table 2⇑). A high probability existed that methotrexate plus subcutaneous abatacept and methotrexate plus tocilizumab 4 mg/kg were also superior to oral methotrexate (97% for both) (table 2⇑). “Triple therapy” (methotrexate plus sulfasalazine plus hydroxychloroquine) was the only conventional synthetic DMARD combination that had a statistically significant higher odds of ACR50 response than oral methotrexate. This comparison was based on indirect evidence and was judged to be moderate quality. The magnitude of the estimated probability of ACR50 response was similar between triple therapy (61.2%, 95% credible interval 44.2 to 76.5) and the other DMARDs that had a statistically significant benefit relative to oral methotrexate (point estimate range 56-67%). In comparison, the estimated probability of ACR50 response with oral methotrexate was 40.5%. In pairwise comparisons, we found no statistically significant difference between triple therapy and methotrexate plus any biologic DMARD, although we could not rule out an important difference, as the credible intervals were wide for some comparisons (appendix C).
Radiographic progression—Eighteen trials with 7594 patients were included in this analysis. The combinations of methotrexate plus several biologic DMARDs (adalimumab, certolizumab, etanercept, infliximab) were associated with a statistically significant reduction in radiographic progression relative to oral methotrexate (table 2⇑). The probability of superiority to oral methotrexate was more than 95% with methotrexate plus rituximab and tocilizumab 8 mg/kg, with the 95% credible interval narrowly including the null effect. The sizes of the effects for all interventions relative to oral methotrexate were small. The expected radiographic progression was 2.34 points over one year with oral methotrexate (the reference treatment) and lower for all other treatments, which is below the minimal clinically important difference of 5 units on the Sharp-van der Heijde scale.31 We found no statistically significant differences between treatments in pairwise comparisons (appendix C).
In post-hoc sensitivity analyses using fixed effects models, the point estimates were nearly identical to the random effects model but the credible intervals were not as wide, resulting in several biologic DMARDs (plus methotrexate) for which the treatment effects reached statistical significance relative to oral methotrexate (appendix C). Methotrexate plus sulfasalazine plus hydroxychloroquine was not statistically superior to oral methotrexate in either the random effects or fixed effect models.
Withdrawals due to adverse events—Thirty seven trials with a total follow-up of 10 528 patient years were included in this analysis. The combination of methotrexate plus azathioprine had a statistically significant increase in withdrawals due to adverse events compared with oral methotrexate and several other treatments (table 2⇑ and appendix C). A high probability also existed that intramuscular/subcutaneous methotrexate plus ciclosporin, methotrexate plus infliximab, or methotrexate plus tocilizumab 8 mg/kg had a higher rate of withdrawals due to adverse events than oral methotrexate (97%, 97%, and 95%), with the results narrowly failing to reach statistical significance. We found no statistically significant differences in pairwise comparisons between different biologic DMARDs (plus methotrexate). Methotrexate plus sulfasalazine plus hydroxychloroquine was associated with a statistically lower rate of withdrawals due to adverse events than methotrexate plus infliximab (rate ratio 0.26, 95% credible interval 0.06 to 0.91).
Methotrexate inadequate response patients
ACR50—Forty five trials with 12 549 patients were included in this analysis. Several treatments were superior to oral methotrexate for ACR50 response (table 3⇑). The results reached statistical significance for the combination of methotrexate and several conventional synthetic DMARDs (sulfasalazine plus hydroxychloroquine, hydroxychloroquine, leflunomide, or intramuscular gold), methotrexate plus all biologic DMARDs with available evidence, and methotrexate plus tofacitinib. The estimated probability of an ACR50 response with triple therapy was 60.5% (39.4% to 81.8%) and varied widely for other treatments (point estimate range 27-70%). We found no evidence for certolizumab, as the available trials were judged to be at high risk of bias. In general, the credible intervals in the pairwise comparisons between different treatments combinations were wide, although some estimates reached statistical significance (appendix C): methotrexate plus etanercept was superior to the combination of methotrexate plus most biologic DMARDs, and methotrexate plus sulfasalazine plus hydroxychloroquine was superior to methotrexate plus the biologic DMARDs intravenous abatacept, infliximab, and tocilizumab 4 mg/kg.
The quality of the evidence for triple therapy (methotrexate plus sulfasalazine plus hydroxychloroquine) versus methotrexate was judged to be moderate, as some minor inconsistencies existed in the findings of the two trials that compared triple therapy with methotrexate plus etanercept,6 8 and because the study design of one of the trials was judged to indirectly address the comparison of interest.8 This trial randomized patients at baseline to a step-up to triple therapy versus a step-up to methotrexate plus etanercept only if an inadequate response to methotrexate was found after six months.8
Radiographic progression—Ten trials with 3238 patients were included in this analysis. We found no statistically significant differences between any treatment and oral methotrexate, although the probability of superiority ranged from 76% (methotrexate plus subcutaneous golimumab) to 94% (methotrexate plus infliximab) (table 3⇑). Similar to the analysis in methotrexate naive patients, the credible intervals were more precise in the post-hoc fixed effect model, resulting in several treatments that reached statistical significance relative to oral methotrexate (methotrexate plus abatacept (intravenous and subcutaneous), adalimumab, etanercept, intravenous golimumab, and infliximab) (appendix C). A 96% probability existed that methotrexate plus sulfasalazine plus hydroxychloroquine was superior to oral methotrexate in the fixed effects model (standardized mean difference −0.40, 95% credible interval −0.84 to 0.04).
Withdrawals due to adverse events—Fifty three trials with a total follow-up of 9950 patient years were included in this analysis. Methotrexate plus ciclosporin and methotrexate plus tocilizumab 8 mg/kg were the only treatments with statistically significant higher rates of withdrawals due to adverse events relative to oral methotrexate (table 3⇑). In pairwise comparisons, methotrexate plus subcutaneous abatacept and methotrexate plus intravenous abatacept were associated with a statistically significant lower rate of withdrawals due to adverse events than several treatments, including methotrexate plus biologic DMARDs and methotrexate plus sulfasalazine plus hydroxychloroquine (appendix C).
GRADE quality assessment
Appendix D shows the overall quality of evidence for each outcome. The assessment of consistency in the evidence was limited, as few comparisons had both indirect and direct evidence available. The only comparison that was downgraded for inconsistency was the comparison of tocilizumab 8 mg/kg versus methotrexate in the analysis of ACR50 response in patients with inadequate response to methotrexate. The odds ratio for the direct evidence was 1.68 (0.62 to 4.56), compared with 4.15 (1.72 to 9.63), with a P value of 0.17. We did not detect evidence of publication bias, although this was difficult to assess, as we had too few trials for any direct comparison to permit formal tests of funnel plot asymmetry.
Results for the minor outcomes are reported in detail in the full Cochrane review. Compared with oral methotrexate, some treatments had a statistically significant higher rate of certain adverse events: methotrexate plus sulfasalazine plus hydroxychloroquine and methotrexate plus sulfasalazine had a higher rate of total gastrointestinal events (excluding oral and liver toxicity) in the analysis of methotrexate naive patients (rate ratio 2.10 (95% credible interval 1.19 to 3.96) and 1.90 (1.18 to 2.99)), methotrexate plus leflunomide had a higher rate of alanine aminotransferase elevations in the analysis of patients with inadequate response to methotrexate (4.75, 1.16 to 20.70), and methotrexate plus tocilizumab 8 mg/kg had a higher rate of leukopenia in this analysis (16.25, 1.48 to 206).
In meta-regression analyses, we found a significant association between the odds ratio for ACR50 response and certain study level covariates (appendix C), but the adjusted treatment effects were largely unchanged (appendix C). When all studies (both methotrexate naive and methotrexate inadequate response patients) were included in the same network meta-analysis and the network assignment was specified with a meta-regression covariate, the odds ratios of methotrexate inadequate response trials were 2.05 (1.70 to 2.48) times higher.
Sensitivity analyses for ACR50 response
When we excluded studies with partial methotrexate exposure from the methotrexate naive analysis, the comparison of triple therapy against oral methotrexate for ACR50 response was no longer statistically significant (appendix C). The point estimate, however, was slightly higher than in the main analysis (favoring triple therapy) and higher than the comparison of any other treatment versus methotrexate. When we included trials at high risk of bias in the methotrexate inadequate response ACR50 response analysis, methotrexate plus certolizumab and subcutaneous/intramuscular methotrexate had a statistically significant higher odds of ACR50 response compared with oral methotrexate (appendix C). Little change in the point estimates for ACR50 response occurred at different time points of assessment, although the credible intervals were wider for several comparisons (appendix C). This supports the meta-regression results in which no association was found between trial duration and ACR50 response.
The rate ratios for withdrawals due to adverse events were similar to the odds ratios for both the methotrexate naive and methotrexate inadequate response analyses (appendix E, tables E1 and E2). For results with wide credible intervals, the point estimates for the rate ratios and odds ratios showed a larger difference, as expected, but this would not change the interpretation of the results. The choice of prior distribution around the between study variance had little effect on the posterior distribution of the between study variance (appendix E, table E3). The results of ACR50 response were very similar for the alternative uninformative prior and slightly more precise for the informative prior, but this would also not affect the interpretation of our results (appendix E, tables E4 and E5).
Our systematic review and network meta-analysis compared methotrexate and all currently used combinations of DMARDs with methotrexate. The main new finding from our review was that methotrexate plus sulfasalazine plus hydroxychloroquine (“triple therapy”) was superior to oral methotrexate for ACR50 response, in both methotrexate naive and methotrexate inadequate response populations. We found a statistically significant benefit for other conventional synthetic DMARD combinations compared with oral methotrexate, but only after an inadequate response to methotrexate, and the magnitude of effect or quality of evidence was graded lower than for triple therapy. Most biologic DMARDs, in combination with methotrexate, had a statistically significant benefit compared with oral methotrexate for ACR50 response in both methotrexate naive and methotrexate inadequate response populations. Importantly, the magnitude of effect for ACR50 response was higher for triple therapy than for most biologic DMARDs, and we did not find any statistical benefit for methotrexate plus biologic therapy compared with triple therapy. This has important policy implications given the difference in cost between these treatments. Methotrexate combined with adalimumab, certolizumab, etanercept, or infliximab had a statistically significant benefit for inhibiting joint damage compared with oral methotrexate, but the effect was small and observed only in methotrexate naive patients. Most treatments were well tolerated, although some combinations of treatments with methotrexate (azathioprine, ciclosporin, tocilizumab 8 mg/kg) had a statistically significant increase in the rate of withdrawals due to adverse events compared with oral methotrexate in either the methotrexate naive or methotrexate inadequate response populations.
Completeness and applicability of evidence
We did not evaluate the effect of glucocorticoids, which are known to have a disease modifying effect, particularly in early rheumatoid arthritis.32 We did not exclude trials with corticosteroids, however, so our findings relate to the effects of DMARD therapy independent of a corticosteroid effect. Our results should not be generalized to patients who have had an inadequate response to biologic therapy, as we did not include these trials. Our study was also not designed to directly compare treatment strategies. Specifically, we did not directly compare the approach of starting with methotrexate monotherapy in methotrexate naive patients and progressing to triple therapy versus the strategy of starting with triple therapy directly. The estimates of absolute risk with each treatment can help to inform this decision. On the basis of the included trials, about 40% of patients naive to methotrexate are expected to have an ACR50 response to oral methotrexate, compared with 60% for triple therapy. Patients may accept this difference in risk and choose methotrexate monotherapy as initial treatment, reserving combination therapy for if they fail to respond adequately. Triple therapy was also associated with an increase in total gastrointestinal events in methotrexate naive patients, which may influence patients’ decisions.
Strengths and weaknesses of review
This review included 158 trials with more than 37 000 patients. We used a rigorous approach to identification of trials and abstraction of outcomes, so we have confidence that the results encompass the best evidence from randomized controlled trials of the comparative benefits and harms of the treatments of interest.
The extent to which indirect evidence is considered can affect the results of a network meta-analysis.33 With our search strategy, we included all direct and first order indirect evidence comparing the treatments of interest; we did not attempt to capture all second order indirect evidence. The contribution of the indirect evidence to the overall estimate from the network meta-analysis decreases quite rapidly as the “order” of the comparison increases.33 In addition, most of the treatments that would potentially form second order indirect evidence were monotherapy with conventional synthetic DMARDs, for which the few trials that exist are generally small and have rarely measured ACR responses.32 We therefore expect a minimal effect of the exclusion of these trials.
An “early escape” design was common in trials of methotrexate plus biologic DMARDs and methotrexate plus tofacitinib, particularly in more recent trials. Although this allows trials to ethically include a placebo arm, it presents challenges in interpreting and synthesizing the results. The proportion of patients remaining on the control treatment at the end of the trial can be very low. We chose to extract end of trial data for the efficacy outcomes, considering patients who required rescue treatment to be treatment failures. We included a sensitivity analysis for ACR50 response using pre-rescue data and found few differences in treatment effects. Synthesizing adverse events in early escape trials is also challenging. Patients who cross over from placebo to active therapy often represent substantial patient years of exposure; excluding these patients may obscure important safety signals. We therefore choose to summarize all toxicity data as rate ratios with exposure adjusted estimates, using the on-treatment data from early escape trials. This could potentially bias the estimates, as patients who cross over may differ in certain ways from patients assigned to the original treatment. We believed this method to have less potential bias than excluding the patients who crossed over and excluding trials that reported only exposure adjusted data.
Through meta-regression, we showed that treatments in trials of patients with an inadequate response to methotrexate were associated with odds ratios for ACR50 response that were twice as high as those in trials of methotrexate naive patients. Thus, previous response to methotrexate is a strong effect modifier of the clinical response, and pooling studies in methotrexate naive and methotrexate inadequate response patients will yield biased estimates that are difficult to relate to clinical practice. This supports our decision to analyze trials of methotrexate naive and methotrexate inadequate response patients separately. We used end of trial data for all outcomes, pooling studies with variable follow-up (from three months to two years). For ACR50 response, our results were robust to sensitivity analyses using six or 12 month data, and we did not detect an association between trial duration and treatment effects through meta-regression. We had too few trials available for radiographic progression to be able to do meaningful sensitivity analyses or meta-regression, so this may represent a source of heterogeneity.
Agreements and disagreements with other studies or reviews
Multiple network meta-analyses of biologic therapy in rheumatoid arthritis, including a Cochrane review, have been reported.4 34 To our knowledge, this is the first review that has systematically compared all methotrexate based conventional synthetic DMARD and biologic DMARD/tofacitinib treatment approaches. A previous network meta-analysis by Graudal and colleagues compared combination DMARD therapy with conventional synthetic DMARDs and biologic DMARDs for radiographic outcomes.35 Overall, Graudal found that one conventional synthetic DMARD plus one biologic DMARD was not superior to combination therapy with two or three conventional synthetic DMARDs for radiographic outcomes. Several important differences from our study exist. Firstly, we evaluated a range of outcomes beyond radiographic progression, covering multiple domains relevant to decision making. Secondly, Graudal grouped conventional synthetic DMARD combination therapy according to the number of drugs (two or three), whereas each biologic agent was considered separately. Grouping DMARD combinations that are commonly used with those that are rarely used (for example, combinations with bucillamine or auranofin) adds heterogeneity to the estimates and makes application of the results to clinical practice difficult. Trials in DMARD naive and DMARD inadequate response patients were also grouped, which, as we showed for ACR50 response, may bias the estimated treatment effects. Finally, Graudal included trials in which corticosteroids were part of the intervention (that is, applied differently between arms). The results therefore address a different research question.
Other traditional (non-network) systematic reviews have evaluated combination therapy with conventional synthetic DMARDs.5 36 37 The reviews differed in the outcomes considered and inclusion criteria, particularly around the inclusion and exclusion of interventions with corticosteroids. Combined withdrawal due to inefficacy or adverse events was used as the primary outcome for two of the reviews,5 37 as it is commonly reported. Trials are not designed around this outcome, however, and it does not allow the separation of benefits and harms necessary to inform clinical decisions.
In contrast to other systematic reviews, our review evaluated the risk of bias separately for different outcomes, which is the approach recommended by GRADE.38 We also used recently published GRADE guidance for grading the quality of the evidence.21 Although this approach requires subjective decisions, it should increase the transparency of these choices.
Implications for practice
On the basis of all available direct and indirect evidence, our results suggest that triple therapy (methotrexate plus sulfasalazine plus hydroxychloroquine) is effective in both methotrexate naive and methotrexate inadequate response patients and not statistically different from methotrexate plus biologic therapy for controlling disease activity. As triple therapy costs 10-20 times less than biologic therapy and is not currently recommended strongly by international guidelines, this has important policy implications. Specifically, our results suggest that triple therapy should be considered as a low cost, effective treatment option either as initial treatment or after an inadequate response to methotrexate.
What is already known on this topic
Meta-analyses have shown that most biologic disease modifying antirheumatic drugs (DMARDs) combined with methotrexate are superior to methotrexate alone for controlling disease activity
However, the benefits of combining methotrexate with conventional synthetic DMARDs are uncertain
A Cochrane network meta-analysis of biologic treatments for rheumatoid arthritis showed some differences but did not compare biologic therapy against combination therapy with conventional synthetic DMARDs
Understanding the comparative benefits and harms of these treatments is essential, given that biologic therapy costs 10-20 times more than most conventional synthetic DMARDs
What this study adds
This Cochrane network meta-analysis included 158 trials with more than 37 000 patients
“Triple therapy” (methotrexate plus sulfasalazine plus hydroxychloroquine) was superior to methotrexate alone and not statistically different from methotrexate plus any biologic DMARD or tofacitinib for controlling disease activity, either as initial therapy or after an inadequate response to methotrexate
Given the low cost of triple therapy relative to biologic DMARDs and tofacitinib, these findings support a therapeutic trial of triple therapy as initial treatment or after an inadequate response to methotrexate
Contributors: GH developed the concept for the study, wrote the protocol, and participated in all stages of the study including literature search and data abstraction; he did all analyses and wrote and revised the manuscript. ChB edited and revised the protocol and manuscript and assisted with the literature search and data abstraction. GT edited and revised the protocol and manuscript and assisted with developing the concept and the analysis and interpretation of the data. DM edited and revised the protocol and manuscript and assisted with developing the concept for the study. DD edited and revised the manuscript and assisted with the data abstraction. ClB edited and revised the protocol and manuscript and assisted with developing the concept for the study. All authors had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis. GH is the guarantor.
Funding: Partial funding was provided by the Arthur J E Child chair in rheumatology outcomes research.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organization for the submitted work; GH has received fellowship funding supported by the Canadian Rheumatology Association, the Arthritis Society, and UCB Pharma, honorariums and travel expenses from Abbott, and honorariums from UCB Pharma and has participated in an advisory board meeting for Amgen; GH was supported by an Alberta Innovates health solutions clinical fellowship; ChB holds the Canadian Rheumatology Association/Arthritis Society clinician investigator salary award and is a Canadian Institutes of Health Research new investigator (community-based primary healthcare); in the past two years, ChB has participated in advisory boards for Roche and UCB and received honorariums from UCB and Amgen and an unrestricted travel grant from Celgene; DM is supported by a Canada Research chair in health systems and services research and an Arthur J E Child chair in rheumatology; in the past year, DM has received honorariums from Abbvie for a seminar; ClB has received grant support from Janssen, Pfizer, Amgen, Abbott/Abbvie Canada, BMS, Celgene, Eli Lilly, Fresenius Kabi, Hoffman La Roche, Sanofi, UCB, and Calea, has acted as a consultant for AstraZeneca, Abbott/Abbvie Canada, and BMS, and has participated in advisory board meetings for Janssen, Pfizer, Amgen, and AstraZeneca. ClB also holds a Pfizer chair and a Canada Research chair in knowledge transfer for musculoskeletal care; no other relationships or activities that could appear to have influenced the submitted work.
Ethics approval: Not needed.
Transparency declaration: The lead author (the manuscript’s guarantor) affirms that this manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.
Data sharing: No additional data available.
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/.