Results

Alex R Kemper; Remy Coeytaux; Gillian D Sanders; Heather Van Mater; John W Williams; Rebecca N Gray; R Julian Irvine; Amy Kendrick

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Kemper AR, Coeytaux R, Sanders GD, et al. Disease-Modifying Antirheumatic Drugs (DMARDs) in Children With Juvenile Idiopathic Arthritis (JIA) [Internet]. Rockville (MD): Agency for Healthcare Research and Quality (US); 2011 Sep. (Comparative Effectiveness Reviews, No. 28.)

This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Disease-Modifying Antirheumatic Drugs (DMARDs) in Children With Juvenile Idiopathic Arthritis (JIA)

Disease-Modifying Antirheumatic Drugs (DMARDs) in Children With Juvenile Idiopathic Arthritis (JIA) [Internet].

Show details

Contents

< Prev Next >

Results

Literature Search and Screening

Searches of all sources identified a total of 4815 potentially relevant citations. Table 3 details the number of citations identified from each source.

Table 3

Sources of citations.

Figure 2 describes the flow of literature through the screening process. Of the 4815 citations identified by our searches, 3998 were excluded at the abstract screening stage. Of the 817 articles that passed the initial abstract screening, 313 were gray literature articles that were excluded from further review. The remaining 504 articles went on to full-text screening. Of these, 306 were excluded, leaving a total of 198 included articles. Appendix F provides a complete list of articles excluded at the full-text screening stage, with reasons for exclusion.

Figure 2

Literature flow diagram.

Figure 3 summarizes the treatment comparisons evaluated in the included efficacy studies (Key Questions 1, 2, and 4). Six non-biologic DMARDs and seven biologic DMARDs have been compared to conventional treatment with or without methotrexate. Two different sets of non-biologic DMARDs have been directly compared (leflunomide vs. methotrexate and hydroxychloroquine vs. penicillamine), and two biologic DMARDs have been directly compared (etanercept vs. infliximab). Three of the biologic DMARDs that have been compared to conventional treatment were in the same class (TNF inhibitors: adalimumab, etanercept, and infliximab). However, study heterogeneity precluded meta-analysis of this combined class versus conventional treatment. Details on the number of studies describing each treatment comparison are provided under the relevant Key Question, below.

Figure 3

Treatment comparisons evaluated in efficacy studies.

Key Question 1. In children with JIA, does treatment with DMARDs, compared to conventional treatment, improve laboratory measures of inflammation or radiological progression, symptoms (e.g., pain, symptom scores), or health status (e.g., functional ability, mortality)?

Key Points

Among the non-biologic DMARDs, there is some evidence that methotrexate is superior to conventional therapy and oral corticosteroids.
Among children who have responded to a biologic DMARD, randomized discontinuation trials suggest that continued treatment for 4 months to 2 years decreases the risk of having a flare. Although these studies evaluated DMARDs with different mechanisms of action (abatacept, adalimumab, anakinra, etanercept, intravenous immunoglobulin [IVIG], tocilizumab) and used varying comparators, followup periods, and descriptions of flare, the finding of a reduced risk of flare was precise and consistent.
Conventional treatment has changed over time (e.g., use of oral corticosteroids in older studies of non-biologic DMARDs versus more frequent use of methotrexate in more recent studies of biologic DMARDs). Comparing the effectiveness of biologic and non-biologic DMARDs is challenging because of variations in comparators and how these comparators are described.
There is significant variation in outcome measures and how these outcome measures are reported.

Detailed Analysis

Literature Identified

We identified of 20 publications describing 18 unique studies and involving 1532 patients that compared DMARDs to conventional treatments with or without methotrexate. Among these were 10 studies that evaluated seven biologic DMARDs (abatacept, adalimumab, anakinra, etanercept, infliximab, IVIG, and tocilizumab; see Table 4) and eight studies that evaluated five non-biologic DMARDs (azathioprine, penicillamine, hydroxychloroquine, methotrexate, and sulfasalazine; see Table 5).

Table 4

Studies comparing biologic DMARDs versus conventional treatments with or without methotrexate.

Table 5

Studies comparing non-biologic DMARDs versus conventional treatments with or without methotrexate.

There were 10 RCTs, of which four (described in five papers) were of good quality,^11-15 four were of fair quality,^16-19 and two were of poor quality.²⁰^,²¹ Key problems in the fair- and poor-quality studies included unclear methods of allocating to therapy, questionable blinding, and incomplete followup. There were two open-label comparison studies of poor quality.²²^,²³ Six studies were randomized discontinuation studies, of which three (described in four papers) were of good quality,^24-27 two were of fair quality,²⁸^,²⁹ and one was of poor quality.³⁰

A detailed summary of these studies, by DMARD evaluated, is provided below.

There were no good-quality RCTs comparing biologic DMARDs to conventional therapy. There were two good-quality RCTs comparing methotrexate, a non-biologic DMARD, to conventional therapy.¹³^,¹⁴ However, in both studies, each group could also receive oral corticosteroids, which are not currently considered conventional therapy. A single good-quality trial of sulfasalazine showed better short-term (24-week) outcomes than treatment with NSAIDs.¹⁵

Biologic DMARDs Versus Conventional Treatment With or Without Methotrexate

Abatacept

One good-quality randomized discontinuation study evaluated abatacept.²⁴ During the 6-month double-blind period of this study, there was statistically significant improvement compared to placebo in the active joint count (4.4 vs. 6; p = 0.02), CHAQ score (0.8 vs. 0.7; p = 0.04), physician global assessment (14.7 vs. 12.5; p < 0.01), and ACR Pediatric 90 (40 percent vs. 16 percent; p < 0.01). There was no statistically significant improvement in parent/patient global assessment (17.9 vs. 23.9; p = 0.70) or erythrocyte sedimentation rate (ESR; 25.1 vs. 30.7; p = 0.96).

Adalimumab

We found one good-quality randomized discontinuation trial that compared adalimumab to conventional therapy.²⁵ The results were stratified by use of methotrexate. At the end of the 48-week double-blind phase, the proportion of patients who had a flare of disease in the adalimumab without methotrexate group was lower than in the conventional treatment group without methotrexate (43 percent vs. 71 percent; p = 0.03), and lower than in those groups that did receive methotrexate (37 percent vs. 65 percent; p = 0.02). The proportion who achieved ACR Pediatric 50 score in the adalimumab without methotrexate group was higher than in the conventional treatment without methotrexate group (53 percent vs. 32 percent; p = 0.01), and higher than in those groups that received methotrexate (63 percent vs. 38 percent; p = 0.03). Although the proportion who achieved ACR Pediatric 90 score was higher in the adalimumab without methotrexate group than in the conventional treatment without methotrexate group (30 percent vs. 18 percent), the difference was not statistically significant (p = 0.28). Similarly, the difference in the proportion who achieved the ACR Pediatric 90 among those who also received methotrexate was higher in the adalimumab group than in the conventional treatment group, but did not achieve statistical significance (42 percent vs. 27 percent; p = 0.17).

Anakinra

One randomized discontinuation trial compared anakinra to conventional therapy.³⁰ This study was rated as poor in quality because it did not have sufficient statistical power to evaluate efficacy, there was insufficient reporting of randomization and concealment. The main goal of the study was to evaluate safety. By week 28 of blinded treatment, 16 percent who received anakinra and 40 percent who received placebo had had a flare (p = 0.11). There was improvement in the CHAQ score in the anakinra group compared to placebo (-0.25 vs. 0.13; no p-value reported). Similarly, there was improvement in the ESR among those who were treated with anakinra (-2.21 vs. 13.73; no p-value reported).

Etanercept

Two studies evaluated etanercept versus placebo. One good-quality randomized discontinuation trial evaluated children with a polyarticular course of JRA.²⁶ In the double-blind component, fewer patients who received etanercept had a flare (28 percent vs. 81 percent; p = 0.003). There was also an improvement in the CHAQ score (-0.8 vs. -0.1). Overall, there was a 54 percent median improvement among those who received etanercept compared to no median change in the placebo group. There was an overall improvement in the number of active joints (7 vs. 13; no p-value reported); physician global assessment (2 vs. 5; no p-value reported); parent global assessment (3 vs. 5; no p-value reported); ESR (18 vs. 30; no p-value reported); and the proportion who achieved ACR Pediatric 50 (72 percent vs. 23 percent; no p-value reported).

The other study of etanercept was a fair-quality RCT that evaluated efficacy for the treatment of uveitis.¹⁶ This study had a small sample size. During the study, 6 of 12 in the test treatment arm and 2 of 5 in the conventional treatment arm improved. This was described by study investigators as no apparent difference.

Infliximab

One fair-quality RCT compared infliximab to conventional treatment.¹⁷ This study inconsistently and incompletely reported outcomes. The study did not find statistically significant differences between infliximab and conventional treatment in the ACR Pediatric 50 at 14 weeks (50 percent vs. 33.9 percent, respectively; p = 0.13) or the rate of clinical remission at 52 weeks (44.1 percent vs. 43.1 percent, respectively).

IVIG

Three studies compared IVIG to conventional treatment. One small (19 total in the double-blind phase), fair-quality, randomized discontinuation trial²⁸ found a 3 percent decrease in the active joint count among those who were treated compared to a 30 percent increase in the placebo group. Physician global assessment improved for 3 percent of patients in the treatment group and worsened for 91 percent in the placebo group. This study used a main outcome measure that has not been validated and provided no statistical significance testing; there was also a potential conflict of interest with the study sponsor.

Another study²² compared IVIG to methylprednisolone. This study was considered to be of poor quality because it was open-label and non-randomized, analyses were not adjusted for baseline differences, and the sample was not adequately described. Investigators found no statistically significant difference between the IVIG and methylprednisolone groups for ESR (59 at baseline and 21 at 6 months vs. 61 at baseline and 24 at 6 months, respectively).

A small RCT²⁰ found that IVIG compared to conventional therapy was associated with a non-statistically significant improvement in the median change in active joint count (-2 vs. -1) and in physician global assessment of improvement (50 percent improvement vs. 27 percent improvement; p > 0.3). This study was considered to be of poor quality because of the small sample size and high dropout rate.

Tocilizumab

One fair-quality randomized discontinuation trial evaluated tocilizumab.²⁹ The screening and randomization procedures were not described. No p-values were reported for the outcomes of interest in this review. From the RCT component, the active joint count in the tocilizumab group decreased from 3.5 to 0. Similarly, in the conventional treatment group it decreased from 4 to 0. There was improvement in the CHAQ score for each group (-0.5 vs. -0.25). Both physician global assessment (51.0 to 5.5 vs. 51 to 14) and parent global assessment (51.0 to 4.5 vs. 55 to 39) improved. The ESR decreased for both the tocilizumab and conventional treatment group (35 to 0.1 vs. 38 to 15). The ACR Pediatric scores were reported graphically. The ACR Pediatric 70 increased in the tocilizumab group from approximately 70 percent to approximately 80 percent, but decreased in the conventional treatment group from approximately 80 percent to approximately 30 percent.

Meta-Analysis of Randomized Discontinuation Trials

Randomized discontinuation trials include only patients who initially responded to a treatment and primarily assess the risk of worsening when treatment is withdrawn. These studies evaluate sustainability of treatment effects and not the potential treatment effect among those who have not yet begun treatment. The randomized discontinuation trials identified by our search evaluated only biologic DMARDs (abatacept, adalimumab, anakinra, etanercept, IVIG, tocilizumab).

Four of the trials reported flare of arthritis,^24-26^,³⁰ allowing us to calculate a summary measure of the risk of flare over the 4-month to 2-year durations of the studies. Other outcomes were too heterogeneous or were reported too incompletely to calculate a summary estimate. Although there were differences in the interventions, comparators, and duration of followup among the four studies, we found very little statistical heterogeneity. Figure 4 summarizes the risk ratio (RR) for flare (with 95 percent confidence interval [CI]) based on a random-effects model. Overall, the RR for having a flare among those who continued compared to those who discontinued was 0.48 (95 percent CI 0.36 to 0.63) over 4 months to 2 years. Although there is heterogeneity in study design, the RR for having a flare was similar across all studies (χ² = 3.18, df = 3, p = 0.36; I² = 6 percent). This suggests that among those who respond to a biologic DMARD, there is a significant risk of flare after discontinuation. There was insufficient evidence regarding the efficacy of the biologic DMARDs from the other studies that compared these treatments to conventional therapy with or without methotrexate.

Figure 4

Comparison of symptomatic flares in children with JIA randomized to continuing a biologic DMARD versus placebo. Flares are listed as “Events” in the figure.

Non-Biologic DMARDs Versus Conventional Treatment With or Without Methotrexate

Azathioprine

One poor-quality RCT evaluated azathioprine.¹⁸ Allocation was not specified; there were baseline differences between those who received and did not receive azathioprine; it was unclear if outcomes were assessed blinded to the intervention status of subjects; and the outcomes were not well described. At 16 weeks of treatment, this study found non-statistically significant improvements with azathioprine in the number of active joints (-7 vs. -1; p = 0.45), physician global assessment (-5 vs. -2; p = 0.12), and the proportion with 50 percent improvement in ESR (4/13 subjects vs. 2/11 subjects; p = 0.36).

Hydroxychloroquine

Two RCTs evaluated hydroxychloroquine. One (described in two publications¹¹^,¹²) found no significant difference in the change in mean active joint count compared to placebo after 12 months (6.7 [95 percent CI -9.4 to -4] vs. -5.4 [-8 to -2.8]). The physician global assessment appeared slightly better for hydroxychloroquine than for placebo (70 percent better, 26 percent same, 2 percent worse compared to 53 percent better, 41 percent same, 6 percent worse; no p-value reported). There was no difference in the mean ESR decrease at 12 months (10 each).

The other study was an open-label RCT that compared hydroxychloroquine to gold.²¹ This study was considered to be of poor quality because allocation concealment was not specified, there were important baseline differences between the treatment groups, it was unclear if outcomes were assessed blinded to the intervention, and the outcomes were not well described. At 50 weeks, there were no statistically significant differences in the active joint count (–4 vs. –5), median change in the physician global assessment (-8 vs. -9), or change in the ESR (–12 vs. –11). Similarly, the physician overall assessment of at least 50 percent improvement was not statistically significantly different between the hydroxychloroquine group and the gold group (12 of 17 improved vs. 10 of 15 improved, respectively).

Methotrexate

Three studies compared methotrexate to conventional treatment without methotrexate. One good-quality RCT compared low-dose methotrexate, very low-dose methotrexate, and placebo in a 6-month trial.¹³ The mean active joint count decreased with low-dose methotrexate (-7.5), very low-dose methotrexate (-5.2), and placebo (-5.2; p > 0.3 overall). Physician global assessment improved with low-dose methotrexate compared to placebo (p = 0.02), but there was no statistically significant difference between the low-dose and very low-dose methotrexate groups for this outcome (p = 0.06). Based on a composite index with at least 25 percent improvement in articular score and improvement according to physicians and parents, 63 percent of those in the low-dose methotrexate group improved, compare to 32 percent in the very low-dose methotrexate group, and 36 percent in the placebo group (p = 0.013).

Another good-quality study¹⁴ compared methotrexate to placebo among children with extended oligoarticular JIA or systemic JIA in a double-blind RCT with crossover. Among those with oligoarticular JIA, there was statistically significant improvement in physician global assessment (p < 0.001) and ESR (p < 0.001) with methotrexate. The change in the number of joints with synovitis (-3) did not achieve statistical significance (p < 0.1). Similarly, among those with systemic JIA, there was improvement in physician global assessment (p < 0.001), but not in ESR (p = 0.06) or in the number of joints with synovitis (p = 0.06) in patients taking methotrexate.

A poor-quality, non-randomized study compared methotrexate to NSAIDs and to methylprednisolone.²³ In this study, the active joint count improved more in the methylprednisolone group than in either the methotrexate or NSAID groups (-7.1 vs. -4 vs. -0.8, respectively; p = 0.008). This study, however, had confounding by indication; the analysis did not adjust for potential confounders; outcomes were not assessed blinded to the treatment condition; and patients were not blinded to their treatment assignments.

Penicillamine

Four publications describing three distinct studies evaluated penicillamine. One good-quality RCT¹¹^,¹²) found no statistically significant effect on the mean active joint count with penicillamine compared to placebo after 12 months (-3 [95 percent CI -4.8 to -1.1] vs. -5.4 [-8 to -2.8]); results were similar for physician global assessment (56 percent better, 28 percent same, 16 percent worse vs. 53 percent better, 41 percent same, 6 percent worse) and mean decrease in ESR (9.4 vs. 10).

A fair-quality RCT¹⁹ found no statistically significant effect on ESR in a 6-month study in patients treated with penicillamine compared to conventional treatment (-18 vs. -8). However, this study did find a statistically significant decrease in the number of painful joints in patients taking penicillamine (-3 vs. -1.6; p < 0.04). This study was of fair quality because the patients in the placebo group may have had worse disease.

A poor-quality, open-label RCT²¹ found no statistically significant effect for penicillamine compared to gold at 50 weeks in the active joint count (-2.5 vs. -5), median change in the physician global assessment (-7.5 vs. -9), change in ESR (-8 vs. -11), or the proportion of patients who had at least a 50 percent improvement based on physician assessment (8/12 vs. 10/15).

Sulfasalazine

One good RCT evaluated sulfasalazine versus placebo.¹⁵ In this study, it was unclear which time points were compared. However, there was statistically significant improvement with sulfasalazine in active joint count (-5.54 vs. -0.78; p = 0.005), physician global assessment (-1.95 vs. -0.99; p = 0.0002), patient/parent global assessment (-0.98 vs. -0.44; p = 0.01), and decrease in ESR (-0.74 vs. -0.04; p < 0.001). The number of improved joints by x-ray findings was not statistically significantly different (0.71 vs. 0.53).

Key Question 2. In children with JIA, what are the comparative effects of DMARDs on laboratory markers of inflammation or radiological progression, symptoms (e.g., pain, symptom scores), or health status (e.g., functional ability, mortality)?

Key Point

There are few direct comparisons of DMARDs in children with JIA, and insufficient evidence to determine if any specific drug or drug class has greater beneficial effects.

Detailed Analysis

Literature Identified

We identified six reports describing five unique studies and involving 520 patients that directly compared various DMARDs with one another (Table 6). Among these studies were one that compared two biologic DMARDs (etanercept and infliximab) and four that compared various non-biologic DMARDs (penicillamine, hydroxychloroquine, leflunomide, methotrexate, and sulfasalazine). A detailed summary of these studies, by treatment comparison, is provided below. Of the five studies, one was an open-label, non-randomized comparison, and the rest were RCTs. However, only two of the studies were considered to be of good quality (one comparing penicillamine to hydroxychloroquine and another comparing leflunomide to methotrexate in a non-inferiority design study); the rest were poor in quality.

Table 6

Studies comparing various DMARDs with one another.

Comparisons of Biologic DMARDs

Etanercept vs. Infliximab

One poor-quality, non-randomized, open-label study compared etanercept to infliximab.³¹ This study was considered to be of poor quality because drug switching made it hard to interpret findings, few data were provided about the subjects, and assessment was not blinded to therapy. In addition, a total of 6 of the 24 subjects did not complete the study. Among the 10 receiving etanercept, one was withdrawn for non-compliance. Among the 14 receiving infliximab, 4 withdrew because of adverse events and one withdrew because of failure to reach the ACR Pediatric 50. After 12 months of treatment, the change in active joint count was similar between etanercept (-9.5 [95 percent CI -19 to -3]) and infliximab (-11.5 [95 percent CI -17 to -7.5]). Results were also similar in the two treatment groups for changes in the CHAQ score (-0.81 vs. –0.31; p = 0.12), physician global assessment (-29 vs. -35; p = 0.65), patient/parent global assessment (-24.5 vs. -27.5; p = 0.81), ACR Pediatric 75 (67 percent each), ACR Pediatric 50 (78 percent vs. 89 percent; p-value not reported, but calculated as 0.53) and ESR (28.5 vs. -25; p = 0.37).

Comparisons of Non-Biologic DMARDs

Penicillamine vs. Hydroxychloroquine

Two publications¹¹^,¹² described a good-quality RCT that compared penicillamine and hydroxychloroquine to placebo (results described above, under Key Question 1) and to one another. At 12 months, neither active drug was superior to the other based on active joint count, ESR, or physician global assessment.

One poor-quality, open-label RCT²¹ compared hydroxychloroquine and penicillamine to gold (results described above, under Key Question 1) and to one another. At 50 weeks, there were no significant differences between the two DMARDs in active joint count, physician global assessment, or ESR.

Sulfasalazine vs. Hydroxychloroquine

One poor-quality RCT compared sulfasalazine to hydroxychloroquine.³² This study was considered to be of poor quality because there was an inadequate description of the subjects, it was unclear if the study was blinded, and many of the outcomes were not validated. After 6 months, the average number of affected joints decreased by 1.5 in the sulfasalazine group and by 0.6 in the hydroxychloroquine group (no p-value reported). During this time, the ESR decreased in both the sulfasalazine group (52.7 to 36.3; no p-value reported) and hydroxychloroquine group (41.2 to 28.9; no p-value reported). Physician global assessment (9 better, 9 worse, 3 no effect for sulfasalazine vs. 8 better, 3 worse, 7 no effect for hydroxychloroquine; no p-value reported) and patient global assessment (10 better, 7 worse, 3 no effect for sulfasalazine vs. 7 better 5 worse 3 no effect for hydroxychloroquine; no p-value reported) were similar in the two groups.

Leflunomide vs. Methotrexate

One good-quality RCT compared leflunomide to conventional treatment with methotrexate.³³ This 16-week study with a 32-week blinded extension found improvements in both groups. The active joint count decreased for the leflunomide and conventional treatment groups (–8.1 vs. -8.9; p = not significant). Similarly, in both groups there were improvements in the CHAQ score (–0.44 vs. -0.39; p = not significant), physician global assessment (-31.5 vs. -32.1; p = not significant), parent global assessment (-15.9 vs. -22; p = not significant), and ESR (-6.5 vs. 7.2; p = not significant). As the trial proceeded, the methotrexate group appeared to have a greater improvement in the proportion of patients who had an ACR Pediatric 30, Pediatric 50, or Pediatric 70 response. For example, 70 percent of the leflunomide group and 83 percent of the methotrexate group achieved an ACR Pediatric 70 response at 48 vs. 16 weeks. The improvement was not statistically significant for either the leflunomide (p = 0.01) or methotrexate (p = 0.06) groups. No statistical comparison was made between the two groups.