- Peter C Gøtzsche, director (, )
- Helle Krogh Johansen, senior researcher
- Nordic Cochrane Centre, Rigshospitalet, Department 7112, Tagensvej 18 B, DK-2200 Copenhagen N, Denmark
- Correspondence to: Dr Gøtzsche
- Accepted 18 November 1997
Objective: To determine whether short term, oral low dose prednisolone (⩽15 mg daily) is superior to placebo and non-steroidal anti-inflammatory drugs in patients with rheumatoid arthritis.
Design: Meta-analysis of randomised trials of oral corticosteroids compared with placebo or a non-steroidal anti-inflammatory drug.
Setting: Trials conducted anywhere in the world.
Subjects: Patients with rheumatoid arthritis.
Main outcome measures: Joint tenderness, pain, and grip strength. Outcomes measured on different scales were combined by using the standardised effect size (difference in effect divided by SD of the measurements).
Results: Ten studies were included in the meta-analysis. Prednisolone had a marked effect over placebo on joint tenderness (standardised effect size 1.31; 95% confidence interval 0.78 to 1.83), pain (1.75; 0.87 to 2.64), and grip strength (0.41; 0.13 to 0.69). Measured in the original units the differences were 12 (6 to 18) tender joints and 22 mm Hg (5 mm Hg to 40 mm Hg) for grip strength. Prednisolone also had a greater effect than non-steroidal anti-inflammatory drugs on joint tenderness (0.63; 0.11 to 1.16) and pain (1.25; 0.26 to 2.24), whereas the difference in grip strength was not significant (0.31; −0.02 to 0.64). Measured in the original units the differences were 9 (5 to 12) tender joints and 12 mm Hg (−6 mm Hg to 31 mm Hg). The risk of adverse effects during moderate and long term use seemed acceptable.
Conclusion: Prednisolone in low doses (⩽15 mg daily) may be used intermittently in patients with rheumatoid arthritis, particularly if the disease cannot be controlled by other means.
Prednisolone in low doses—that is, no more than 15 mg daily—is highly effective in patients with rheumatoid arthritis
The risk of adverse effects is acceptable in short, moderate, or long term use
Oral low dose prednisolone may be used intermittently in patients with rheumatoid arthritis, particularly if the disease cannot be controlled by other means
Further short term placebo controlled trials to study the clinical effect of prednisolone or other oral corticosteroids are no longer necessary
Editorial by Dennison and Cooper
Corticosteroids were first shown to be effective in patients with rheumatoid arthritis in 1949 in an uncontrolled study.1 In 1959, a two year randomised trial showed that an initial dose of prednisolone 20 mg daily was significantly superior to aspirin 6 g daily.2 Important adverse effects were also noted, however, and the authors concluded that the highest acceptable dose for long term treatment was probably in the region of 10 mg daily.
Corticosteroids have received renewed interest in recent years because of their possible beneficial effect on radiological progression.3 Tendencies towards such an effect were noted both in the early trials and in a recent report.4
These findings are interesting, but oral corticosteroids are still being used mainly for their symptomatic effect—for example, for acute exacerbations of rheumatoid arthritis and as “bridge therapy” before slow acting drugs have taken effect.5 The effect of low doses has been variable, however, and was questioned as late as 1995 when the most recent trial of low dose steroids was published.6 We therefore performed a systematic review of randomised trials that compared corticosteroids, given at a dose equivalent to no more than 15 mg prednisolone daily, with placebo or with non-steroidal anti-inflammatory drugs. Our review is limited to the short term effect—that is, as recorded within the first weeks of treatment. In an analysis of the adverse effects of steroids, however, we also included long term trials and matched cohort studies.
All randomised studies that compared an oral corticosteroid with placebo or a non-steroidal anti-inflammatory drug in patients with rheumatoid arthritis were eligible if they reported clinical outcomes within 1 month after the start of treatment. When there were data from several visits, the data that came closest to 1 week of treatment were used for the analyses. We excluded studies with high dose steroids (exceeding an equivalent of 15 mg prednisolone daily); studies of combination treatments—for instance, of a steroid and a non-steroidal anti-inflammatory drug; and studies that used quasi-randomisation methods, such as allocation by date of admission or by toss of a coin (no such studies were actually found). The outcome variables were joint tenderness (usually Ritchie's joint index), pain, and grip strength.
Medline was searched from 1966 onwards and most recently updated in September 1997. We used the Explode option (which searches for a broad term plus related narrower items) for “glucocorticoids” or “glucocorticoids, -synthetic” (for all subheadings) combined with Explode “arthritis-rheumatoid” (for all subheadings) and with “placebos” or “comparative study” in MeSH. The reference lists were scanned for additional trials, and an archive in possession of one of the authors was searched. As most of the retrieved trials were very old and the steroid drugs were non-proprietary ones authors and companies were not asked about possible unpublished studies. We did not handsearch journals for relevant trials as this work is already being organised by the Cochrane Collaboration for all medical journals, including specialist rheumatological journals. The results of these handsearches are made available in the Cochrane Controlled Trials Register in The Cochrane Library,7 which we searched with prednisolone and prednisone as text words combined with rheumatoid.
Decisions on which trials to include were taken independently by two observers based only on the methods sections of the trials; disagreements were resolved by discussion. Details on the nature and dose of treatments, number of randomised patients, the randomisation and blinding procedures, and exclusions after randomisation were noted. When an outcome was measured on the same scale in all trials we calculated the weighted mean difference as the summary estimate for the effect. As the outcomes were often measured on different scales, however, even when they referred to the same quality—for example, tender joints—we also calculated standardised effect measures.8 With this method the difference in effect between two treatments is divided by the standard deviation of the measurements. By that transformation the effect measures become dimensionless, and outcomes from trials which have used different scales may therefore often be combined. As an example, the tender joint count may be recorded either as the number of tender joints or as Ritchie's index, in which each joint is scored on a scale from 0 to 3 for pain on firm palpation and the scores added. Often the two types of counts will give similar values, but if the patients have very severe disease Ritchie's index may be higher. The standard deviation will then also be higher, however, and by dividing the counts with their standard deviations (for example, of the baseline measurements) the effect sizes will be of the same magnitude.
The random effects model9 was used if P<0.10 for the test of heterogeneity; otherwise a fixed effects analysis was performed. As data from crossover trials were reported in only summary form, as if they had been generated from a group comparative trial, we analysed them accordingly. We therefore assumed that no important carryover effects had occurred.
Twenty eight randomised trials were initially identified, several of which had been published more than once. Eighteen trials were excluded for various reasons. 2 4 5 10-31 Nine trials did not fulfil the inclusion criteria for the meta-analysis: five had studied combinations of drugs 10 17-19 27 31; two used too high a dose 2 20-23; in one, 4 mg methylprednisolone was given to all the patients in the placebo group28; and one concerned patients with juvenile rheumatoid arthritis (this trial found prednisolone to be significantly better than placebo).24
The other nine excluded studies were potentially eligible for the meta-analysis. However, one was a five way crossover trial with a grossly unbalanced design—for instance, placebo was given to 9, 13, 3, 6, and 6 patients during weeks 1, 2, 3, 4, and 5, respectively.12 Because of regression towards the mean we found it inappropriate to include this trial. Another trial was also unbalanced as the steroid group was kept mobile whereas the control group received bed rest and splints for the inflamed joints.25 Two trials were too poorly reported to be usable for the meta-analysis, 15 16 26 and one reported only on joint size.29 Three of these four trials found prednisolone or prednisone to be significantly more effective than placebo; the fourth compared prednisolone and indomethacin and gave no numerical data but just reported that there was “no significant difference in response.”26 The four other excluded trials were long term studies that did not report short term data. 4 5 11 13 14 We contacted the authors of these studies to make sure that no short term data had been recorded without being reported. This was confirmed in two cases 4 11; we were unable to contact any of the authors of the other two studies or of the study that reported only joint size29 to ensure that no further variables had been recorded.
Ten studies were included in the meta-analysis (table 1). 6 32-43 Most of the studies were quite old and rather small. In all but one 35 36 the criteria of the American Rheumatism Association for classical or definite rheumatoid arthritis were fulfilled. Age, proportion of women, and duration of disease were reported in only half of the studies but they were typical for studies in rheumatoid arthritis: mean age was 55 years, two thirds were women, and the mean (range) duration of disease was 6 (2.1 to 9.6) years. As expected for patients enrolled in steroid trials the severity of the disease, expressed as number of tender joints or Ritchie's tender joint index, was quite pronounced (see fig 1). Prednisolone was used in six trials and prednisone in four. 6 32 34 40 41 43 As prednisone is equipotent with prednisolone and is a pro-drug of prednisolone we have used “prednisolone” as a general term throughout the paper. The doses were 2.5, 3.0, and 7.5 mg in one study each, 10 mg in three studies, and 15 mg in four. The median length of treatment was 1 week.
The randomisation method was not described in any of the trial reports but details were obtained from the authors for one of the studies in which the treatment allocation seemed to have been adequately concealed. 6 32 These authors also provided short term data from their long term trial. All studies were double blind apart from a single blind study in which the patients seemed to have been blinded. 40 41 Eight of the studies were of a crossover design but only one of them reported having tested for sequence effects.43 Apart from one study43 the tender joint count was recorded as Ritchie's index; pain was recorded on a ranking scale with 4 or 5 classes in two studies, 35 36 40 41 on a visual analogue scale in two studies, 6 32 33 and as a composite pain index in two studies. 38 43
The results of the meta-analysis are shown in figures 1 and 2. It should be noted that if prednisolone is better than control, the standardised effect size is negative for joint tenderness and pain but positive for grip strength.
Prednisolone had a clear effect over placebo on joint tenderness (standardised effect size −1.31; 95% confidence interval −1.83 to −0.78), pain (−1.75; −2.64 to −0.87), and grip strength (0.41; 0.13 to 0.69). Measured in the original units, in an analysis of the weighted mean difference the difference between prednisolone and placebo was 12 tender joints (95% confidence interval 6 to 18; test for heterogeneity χ2 46.42, df=6; P<0.00001). The effect on grip was always measured in mm Hg or in kPa. After conversion of kPa to mm Hg the superiority of prednisolone over placebo was 22 mm Hg (95% confidence interval 5 mm Hg to 40 mm Hg; test for heterogeneity χ2 5.47, df=5; P=0.36).
Prednisolone also had a greater effect than non-steroidal anti-inflammatory drugs on joint tenderness (−0.63; −1.16 to −0.11), pain (−1.25; −2.24 to −0.26), and grip strength, although the difference in grip strength was not significant (0.31; −0.02 to 0.64). Measured in the original units the difference between prednisolone and non-steroidal anti-inflammatory drugs was 9 tender joints (5 to 12; test for heterogeneity χ2 4.06, df=3; P=0.26). The effect on grip strength showed a non-significant superiority of prednisolone over non-steroidal anti-inflammatory drugs of 12 mm Hg (−6 mm Hg to 31 mm Hg; test for heterogeneity χ2 3.03, df=3; P=0.39).
Our meta-analysis has shown that low dose prednisolone is not only highly effective but also significantly more effective than non-steroidal anti-inflammatory drugs. The point estimate for the difference in effect between prednisolone and non-steroidal anti-inflammatory drugs on grip strength was 12 mm Hg. It is interesting that the point estimate for the difference in effect between non-steroidal anti-inflammatory drugs and placebo was also found to be 12 mm Hg in an earlier meta-analysis.44 It was not surprising that the difference in effect on grip strength between prednisolone and non-steroidal anti-inflammatory drugs was not significant as this effect measure is considerably less sensitive to change than pain and joint tenderness.45
We used a random effects model for some of the analyses because of heterogeneity. Which model to use is a matter of dispute among statisticians, but the results were not too different if analysed with a fixed effects model, which gave standardised effect sizes for prednisolone versus placebo of −1.23 (−1.51 to −0.95) for joint tenderness and −1.35 (−1.63 to −1.08) for pain, and for prednisolone versus non-steroidal anti-inflammatory drugs of −0.61 (−0.95 to −0.27) for joint tenderness and −0.97 (−1.32 to −0.63) for pain.
It is always important to try to explain heterogeneity. Our attempts to do so, however, have been rather unsuccessful. As most of the studies were done more than 20 years ago an obvious reason for the heterogeneity could be that the earlier trials had overestimated the effect—for instance, because of insufficiently concealed randomisation methods.46 The methodological quality of the trials was acceptable in the whole time span of nearly 30 years, however, and it was, for example, similar to the quality of comparative non-steroidal anti-inflammatory drug trials.47 In accordance with this there were no time trends for the differences in joint tenderness and pain between prednisolone and placebo. There was marginal heterogeneity (P=0.08) for the difference between prednisolone and non-steroidal anti-inflammatory drugs in joint tenderness, but the heterogeneity disappeared when the analysis was performed in the original units (P=0.26).
Blinding did not seem to have been important for heterogeneity. Only one trial was not double blind, and this trial did not yield larger effect estimates than the other trials. Small trials may exaggerate the effect because of publication bias. 48 49 This possibility could not be studied as the trials were all rather small and contributed similar weights to the meta-analysis. The effect was so pronounced, however, that it would have been unreasonable to plan large trials; in this respect steroid trials resemble trials of non-steroidal anti-inflammatory drugs that have also shown convincingly their superiority over placebo in small crossover trials.45 One would need to postulate that an unrealistically large number of unpublished trials existed that had shown no effect before the positive effect shown in our meta-analysis would become nullified.
An obvious cause for the heterogeneity could be varying degrees of concomitant treatment with additional non-steroidal anti-inflammatory drugs. Although sometimes stated in trial protocols, it may be difficult to ensure in practice that patients do not take additional drugs. As there was very sparse information on drug intake in the reports this possibility could not be evaluated. Another source could be the use of different measurement scales. Pain, for example, was measured on three different types of scale. They were all ranking scales, and we would therefore definitely have preferred to analyse pain with rank sum tests or as binary data after reduction of the level of measurement. The problem in analysing rank data with parametric methods is not only that they are often far from being normally distributed but also that we do not know the “distances” between the levels on the scale. As the original authors had used parametric statistics we decided to do so as well because our only other option was to discard the data.
Surprisingly, there was no clear relation between dose and effect despite the fact that the doses varied from 2.5 mg to 15 mg daily. It was not the aim of our review, however, to study dose-response relations, which are elucidated more reliably in studies where patients are randomised to different doses. A remarkable effect was seen in a study in which the average dose was only 3 mg daily but where the patients were allowed to start on 7.5 mg when they experienced flares of the arthritis and were advised to take nothing when they were well.43 This study suggests that it could be an advantage to take steroids intermittently, which would also diminish their adverse effects.
We could be criticised for including crossover trials for which we assumed but could not test that no important carryover effects had occurred. Our arguments for doing this were threefold. Firstly, it is not uncommon in statistical analyses to make necessary assumptions which cannot be properly tested in the data at hand—for example, in multiple regression analyses. Secondly, the problem with crossover trials is not only of a statistical nature, it also has an important ethical dimension. As crossover trials almost without exception are poorly reported and do not allow checks of the assumptions for this design,47 we would have to discard a vast amount of useful information in the literature in practically all areas of health care if we chose to behave as statistical purists. This would lead to much superfluous research being done, which is not in the best interest of patients or society. Thirdly, and most importantly, one would not expect carryover problems for drugs with relatively quick and reversible symptomatic effects such as steroids or non-steroidal anti-inflammatory drugs in patients with rheumatoid arthritis. In fact in a meta-analysis of non-steroidal anti-inflammatory drugs very similar results were obtained with the two trial designs.44 For these reasons we believe our approach is justified. Only two studies were of a group comparative design, and the heterogeneity we found could not be explained by type of design.
The titles of the included trials were generally quite uninformative and some of the them were not easy to find as they were performed within experiments designed to study other factors. Several of the studies were retrieved from an archive in possession of one of the authors assembled during work on a thesis50 before the electronic data searches were performed. The authors of the most recent study in this topic 6 32 had found only one of five trials comparing steroids with placebo in long term studies and none of the nine short term trials included in our review. These short term trials were described in 11 reports that were all indexed in Medline with the term for rheumatoid arthritis; in addition, all but one38 contained the terms for clinical trial or comparative study. Further, all nine trials were identifiable by using the search term “placebo*” and (“prednisone” or “prednisolone”). This illustrates the value of a systematic and careful search of the literature before starting new clinical trials, and funding bodies and ethical review committees should demand a systematic review of the relevant literature before approving of new clinical research.51
Recently, another meta-analysis of low dose corticosteroids (⩽15 mg prednisolone daily) in rheumatoid arthritis was published.52 This meta-analysis looked at moderate term effectiveness and focused on the outcome after 6 months; only two of the included trials were the same as in our meta-analysis. 6 32 43 These authors also noted heterogeneity, but they did not explore possible reasons for it or show the individual results for each trial; they only showed the combined result for each outcome. The weighted mean difference between steroid and placebo was surprisingly small, corresponding to only 2.4 tender joints (four trials, 95% confidence interval 0.3 to 4.6), while the standardised effect size of 0.90 (−0.18 to 2.00), although not significant, was more comparable to the one we found.
It is not easy to get a clear picture of the adverse effects of low dose steroids. Five of our short term studies did not report on side effects; one study reported that no side effects occurred38; two patients on prednisone had “subjective reactions” in one study34; and one patient developed acute psychosis while on prednisone in one study. 40 41 The two remaining studies were moderate term studies from which we extracted short term efficacy data. 6 32 43 These studies did not report short term side effects but are included in the analysis of moderate or long term adverse effects below.
The meta-analysis of moderate term low dose steroid trials did not examine adverse effects at all.52 The information in the most recently conducted two year placebo controlled trial is also sparse4; the aim of this study was to assess the progression of radiological damage, but films were taken only of the hands not of the lumbar spine, which could have detected any compression fractures. We reviewed moderate and long term randomised trials that had compared low dose steroids with placebo or a non-steroidal anti-inflammatory drug. We also identified cohort studies of rheumatoid arthritis that had compared patients treated with steroids with a matched, untreated control group. For this purpose we limited our broad search strategy to Explode “glucocorticoids, -synthetic” (adverse-effects) or Explode “glucocorticoids” (adverse-effects), combined with Explode “arthritis, -rheumatoid” (for all subheadings).
We found eight trials and two matched cohort studies (table 2). Spinal x ray photographs were taken of all patients in three of the trials; four fractures were detected in a total of 83 patients randomised to prednisolone and one in 75 patients randomised to placebo. In the five remaining trials, comprising a total of 193 patients taking prednisolone and 190 taking placebo or aspirin, only one fracture with prednisolone and one with placebo were reported. No cases of cataract were reported in the trials. One of the trials was highly atypical as the starting dose was 300 mg cortisone, equivalent to 60 mg prednisolone.20-22 Its high number of adverse effects may therefore not be representative.
One of the cohort studies used a survival-type analysis and found a large difference in time to first adverse event, with a total of 92 events in the steroid group and 31 in the untreated group.53 The risk of fracture increased with increasing doses: odds ratio 32.3 (95% confidence interval 4.6 to 220) for >10-15 mg prednisolone daily, 4.5 (2.1 to 9.6) for 5-10 mg, and 1.9 (0.8 to 4.7) for less than 5 mg daily. The overall risks for first event were 3.9 (0.8 to 18.1) for fracture, 8.0 (1.0 to 64.0) for infection, and 3.3 (0.9 to 12.1) for gastrointestinal bleed or ulcer. This study also included patients who received oral steroid “pulses,” which do not necessarily lead to the same incidence and severity of adverse effects as continuous low dose treatment. The other cohort study followed two groups of 122 patients for 10 years54. Fractures were noted in 31 versus 19 patients, osteonecrosis in 5 versus 2, and cataracts in 36 versus 22 (table 2).
The main problem with studies of matched cohorts is of course that the two groups can never be completely comparable as patients treated with steroids must be expected to be more severely affected than those not treated. This fact may escape notice by traditional measures of morbidity or the difference may be significant for one54 or more53 indicators of severity of disease, as in the two cohort studies we reviewed. It is noteworthy, for example, that the first study found a similarly increased risk for fractures as for ulcers,53 though five meta-analyses of around 100 randomised trials of steroids in various diseases have shown either no increase in risk or, at most, a marginally increased risk of ulcers, which lacks clinical significance.55 Another meta-analysis of 71 randomised trials, which looked at the risk of infectious complications, showed no increase in risk in patients given less than 10 mg prednisolone daily, and the relative risk for a mean dose under 20 mg was only 1.3 (1.0 to 1.6), which contrasts with the eightfold increased risk in the cohort study.56 Although the confidence intervals were wide in the cohort study, this illustrates the well known dangers of non-randomised comparisons.
Other treatments for rheumatoid arthritis—, that is, non-steroidal anti-inflammatory drugs and slow acting antirheumatic drugs—have important adverse effects, which may occasionally even be life threatening. We therefore suggest that short term prednisolone in low doses—, that is, not exceeding 15 mg daily—may be used intermittently in patients with rheumatoid arthritis, particularly if they have flares in their disease that cannot be controlled by other means. This suggestion is in accordance with a recent detailed review of the adverse effects of low dose steroids.57 As prednisolone is highly effective, short term placebo controlled trials to study the clinical effect of low dose prednisolone or other oral corticosteroids are no longer necessary. If additional relevant trials are performed in future—, for example, comparison of steroids with non-steroidal anti-inflammatory drugs—they will be included in the electronic version of this meta-analysis,58 which will be continuously updated.
We are grateful for the unpublished data provided by Anke van Gestel and Roland Laan.
Contributors: PCG wrote the draft meta-analysis protocol and the draft manuscript. HKJ commented on the drafts. Both authors contributed to selection of studies and extraction of data. PCG is guarantor for the study.
Funding: Danish Medical Research Council.
Conflict of interest: None.