Critical appraisal of company submissions

Tracey Jones-Hughes; Tristan Snowsill; Marcela Haasova; Helen Coelho; Louise Crathorne; Chris Cooper; Ruben Mujica-Mota; Jaime Peters; Jo Varley-Campbell; Nicola Huxley; Jason Moore; Matt Allwood; Jenny Lowe; Chris Hyde; Martin Hoyle; Mary Bond; Rob Anderson

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Jones-Hughes T, Snowsill T, Haasova M, et al. Immunosuppressive therapy for kidney transplantation in adults: a systematic review and economic model. Southampton (UK): NIHR Journals Library; 2016 Aug. (Health Technology Assessment, No. 20.62.)

Immunosuppressive therapy for kidney transplantation in adults: a systematic review and economic model.

Show details

Contents

< Prev Next >

Chapter 5Critical appraisal of company submissions

Three companies submitted economic models to NICE: Astellas, Novartis and Bristol-Myers Squibb.

Astellas’ submission

Overview

The submission compared twice-daily immediate-release TAC (Prograf) with once-daily TAC-PR (Advagraf), and against BEL, EVL and SRL. Immediate-release TAC was considered to be the standard treatment of choice in adult renal transplantation immunosuppression, based on its UK market share, whereas the comparators investigated were deemed to be used infrequently. The submission cites evidence of improved outcomes for TAC-PR relative to the current standard regimen, immediate-release TAC, since the former became available in 2009. In addition, EVL was included in the evaluation despite its lack of market authorisation in the UK, as requested by the NICE scope.

The analysis found that immediate-release TAC resulted in reduced total costs and health benefits relative to the comparators, EVL and BEL; it was concluded that TAC-PR is cost-effective and should be the new standard of care. Although the health benefits of immediate-release TAC were found insufficient to compensate for its increased cost relative to SRL, the latter regimen was consider to apply to only a selected subgroup of adults receiving a kidney transplant.

The submission pointed to evidence on the relationship between treatment adherence and acute and long-term graft rejection, and graft failure as surrogate markers of outcomes. In particular, it stated that adherence to immunosuppressant regimens positively affects graft survival by preventing the development of de novo donor-specific antibodies, which have been associated with a reduction in 10-year graft survival.³⁴² This is then used to translate the observed improvement in adherence with PR TAC relative to immediate-release TAC into graft and patient survival benefits.³⁴³ In addition, the company claims that TAC-PR has a better pharmacokinetic profile than twice-daily TAC (lower intrapatient variability,³⁴⁴ which results in a lower risk of long-term graft failure.³⁴⁵ The company also cites analyses from the Collaborative Transplant Study for Europe presented at the 2014 World Transplant Congress, which shows that people who are treated with TAC-PR had higher patient and graft survival rates than people treated with immediate-release TAC over 12 months, following renal transplantation, in Collaborative Transplant Study data for 2011–13. However, this observation was not robust to the adjustment for multiple confounders [hazard ratio (HR) 0.76; p = 0.14; 95% CI were not stated].

The submission also cites the results of a meta-analysis pointing to increased risk of PTDM with TAC [relative risk (RR) at 12 months 1.72, 95% CI 1.17 to 2.52; RR at 36 months 2.71, 95% CI 1.61 to 4.57] relative to CSA, and acknowledges the evidence on the association between PTDM and reduced graft survival (RR 1.63, 95% CI 1.46 to 1.84).³⁴⁶ The company argues that these estimates may have been the result of people treated with high doses of TAC relative to current practice. To support this claim the submission cites the results of a Phase III study²⁰⁴ comparing TAC-PR with immediate-release TAC, which used lower doses of TAC and found lower incidence rates of PTDM than those in the studies included in the meta-analysis report. It is noted, however, that the latter evidence had no bearing on the meta-analysis finding of a higher RR of PTDM with TAC than CSA.

Efficacy and effectiveness evidence

The submission reports a systematic review of the RCT evidence of effectiveness of immunosuppression after first kidney-only transplant. The review involved an electronic search of bibliographic databases covering studies published during the period 2002–June 2014, and was complemented by relevant studies from two published reviews.⁶⁵^,³⁴⁷

Based on 6-month and 1-year pooled data from 19 RCTs including 3796 people, immediate-release TAC had a lower rate of BPAR than CSA ME (RR 0.69, 95% CI 0.57 to 0.82). However, based on data from 10 studies that reported the outcome in 1859 people, immediate-release TAC resulted in higher incidence of PTDM (1.57, 95% CI 1.16 to 2.12). In terms of other outcomes (graft survival, patient survival and death-censored graft survival) differences were found not to be statistically significant at the 5% level.

Pooled-effect estimates for immediate-release TAC, compared with SRL given as a CNI avoidance regimen, were obtained from four RCTs of 6–12 months’ follow-up involving 1397 people. Neither patient survival nor PTDM differed in statistically significant manner between the arms, whereas SRL produced a higher risk of developing AR (RR 2.28, 95% CI 1.37 to 3.79) and lower survival probability (RR 0.95, 95% CI 0.92 to 0.98). In the SRL CNI minimisation regimen, two studies were found, involving 461 people in the comparison of immediate-release TAC/SRL and steroids with immediate-release TAC/MMF and steroids. No differences were found in patient and graft survival, ARs and PTDM at 6–12 months post transplant, whereas more discontinuations were found in the former arm.

For the comparison between TAC-PR and CSA ME, the submission cites one multicentre study that compared these two options and an immediate-release TAC option, all in combination with MMF and steroids. The study found similar efficacy across the three treatment arms in terms of patient and graft survival and AR but there is no measure of uncertainty reported alongside the event rates presented.

Astellas presents results from its own meta-analysis of two studies comparing immediate-release TAC with TAC-PR for de novo kidney transplantation in terms of BPAR stratified for people with (RR 1.16, 95% CI 0.82 to 1.63) and without (RR 1.28, 95% CI 0.98 to 1.68) induction. It cites results of a published meta-analysis that included observational data³⁴⁸ as consistent with the claim that TAC-PR is as effective as immediate-release TAC in preventing BPAR and graft failure at 12 months post kidney transplantation.

For the TAC minimisation versus EVL comparison, no difference in patient and graft survival at 6–12 months was found in three studies involving 358 people (RR 1.01). The submission also cites results from the ASSET trial²⁵⁶ regarding a higher 12-month rate of BPAR (RR 2.19, 95% CI 0.20 to 23.77) with a low-dose TAC with EVL regimen versus standard-dose TAC with EVL regimen (both regimens were given from 3 months post transplantation after an initial 3-month regimen of standard TAC).²⁵⁶ For the comparison of TAC withdrawal with EVL introduction versus the continuation of an initial 3-month regimen of TAC, MPS and steroids, one study²⁵⁹ was cited as reporting no graft failure or patient death in either group at 12 months; renal function, as measured by eGFR of 53.38 ml/minute/1.73 m² in the TAC continuation group and 57.27 ml/minute/1.73 m² in the EVL group (p = 0.25); and no BPAR case in the TAC group and 17.5% incidence in the EVL group (RR 0.05, 95% CI 0.00 to 0.79). Given the absence of RCTs of TAC compared with EVL, Astellas estimated their relative effects indirectly from head-to-head studies of EVL plus low-dose CSA compared with standard CSA (two studies, reporting RR ratios between 0.98 and 1.01 for AR, graft and patient survival outcomes at 3–12 months) and studies of TAC compared with CSA.

Likewise for TAC compared with BEL, estimates were obtained from indirect comparisons, through studies of each of these regimens against CSA. The TAC studies have been described in this section. As for BEL, data from two Phase III trials with 3-year follow-up data were used for the indirect comparison: one included adults receiving a living donor or standard criteria deceased donor kidney (BENEFIT study⁵⁹) and the other was a study of similar design but included ECDs (BENEFIT-EXT study¹⁴²). The company presented separate and combined results of analyses of 1-year data from both trials stratified by a more-intensive and a less-intensive BEL regimen. In general, BEL was found to have higher BPAR rates, less chronic allograft nephropathy (for the more intensive BEL regimen) and improved renal function over CSA. BEL also reduced the incidence of NODAT.

Combining up to 1-year results from BENEFIT⁵⁹ and BENEFIT-EXT,¹⁴² the meta-analysis of immediate-release TAC compared with CSA (number of studies: AR 19, graft survival 11, patient survival 10, WMD in GFR, 2), and outcomes of TAC-PR compared with CSA from the Phase III trial reported by Silva et al.,²³⁹ TAC-PR was found to result in a lower ARR (RR 0.24, 95% CI 0.12 to 0.51) and lower WMDs in GFR (MD –10.50, 95% CI –16.57 to –4.43) than both the more-intensive and less-intensive BEL regimens.³⁴⁹ The company also cites the results of an indirect comparative analysis conducted by Bristol-Myers Squibb, which showed ‘no significant difference’ between BEL and TAC for mortality, graft loss or GFR at 12 and 36 months (All Wales Medicines Strategy Group 2012)³⁵⁰ and higher ARR and lower incidence of NODAT for BEL than for TAC.

Another indirect comparison by Astellas produced estimates of AR, graft survival and patient survival for immediate-release TAC relative to EVL. The RR ratios were, respectively, 0.70 (95% CI 0.48 to 1.03), 0.97 (95% CI 0.93 to 1.03) and 0.98 (95% CI 0.95 to 1.02).

Review of economic models and their results in the submission

The submission provides an overview of model structures and conclusions of previous CEAs of renal transplantation immunosuppressive regimens. From searches of electronic databases (NHS EED, The Cochrane Collaboration, MEDLINE and other database not specified) Astellas identified and included in its review 12 ‘representative studies because they met the inclusion criteria’ (Astellas’ submission, p. 28, chapter 8, Review of economic studies – it states that 11 studies were included in the review but 12 are actually cited). No details were provided about the inclusion criteria for the review of economic studies; such criteria, therefore, presumably refer to criteria employed for the effectiveness review in the submission. One of the included studies compared immediate-release TAC with TAC-PR (this study is reviewed in section 1.2 of the company’s submission).³²¹ Four studies compared TAC with CSA (three³⁰⁹^,³¹⁹^,³²⁰ of which met the criteria for inclusion in the review of section 1.2; the remaining study¹⁰⁰ was excluded from the review of section 1.2 because it measured costs only for medication) and seven studies³⁰⁶^–³⁰⁸^,³¹¹^,³⁵¹^–³⁵³ examined SRL in CNI avoidance or minimisation strategies compared with TAC (four studies³⁰⁷^,³⁰⁸^,³¹¹ included in the review of section 1.2) and three studies³⁵¹^–³⁵³ that were excluded from it as a result of the country to which they apply.

The submission briefly described the main results of these studies without critically assessing their validity and applicability to a UK setting, although a warning is issued about limited transferability of results from non-UK (10 out of the 12) studies. It concludes that the evidence supports the view that TAC is cost-effective relative to CSA, but that it is ambiguous in relation to the comparison against SRL in a CNI avoidance or minimisation strategy. The submission also includes a section in which three published models are described. No assessment of their strengths and weakness was presented. These models³⁰⁸^,³⁵¹^,³⁵² share the characteristics of models described and discussed in Assessment of cost-effectiveness (one of them³⁰⁸ is reviewed in that section).

Economic evaluation by the company

The CEA submitted by Astellas is an update of a published Markov model-based assessment of the cost-effectiveness of TAC, in either its prolonged-release formulation, TAC-PR, or the current standard therapy of immediate release (immediate-release TAC) by Muduma et al.,³¹⁸ reviewed in Chapter 4 (see Identified studies). The model describes the annual transitions between four health states, starting from kidney-only transplantation: functioning graft without history of AR; functioning graft having experienced AR; graft failure (dialysis); and death. The submission extends the effectiveness review for the model from June 2013, the cut-off date of the published study,³¹⁸ to June 2014. In addition, the analysis in the submission to NICE adds EVL in a CNI minimisation regimen to the list of treatments evaluated in the published paper.

Efficacy data used in the model

The model represents differences in outcomes between regimens as caused by their impact on BPAR. The model was based on the assumption that the effects of treatment on this surrogate outcome lasted for only the first year post transplantation. This assumption was combined with (1) the estimated RR of graft failure for a functioning graft with previous BPAR compared with no previous BPAR and (2) the 1-year post-transplant BPAR frequency, both from estimates reported by Opelz et al.,³⁵⁴ to derive the graft survival curves for grafts without prior AR and grafts with history of AR from the 5-year graft survival profile in UK registry data (NHSBT) 2013³⁵⁵ (Table 139). The model extrapolation was complemented by exponential survival curves to extend survival from 5 years up to 25 years post transplantation.

TABLE 139

One-year acute graft rejection rates used in the model

With regard to patient survival, the model used the 1-, 2- and 5-year post-transplantation survival rates from the NHSBT report 2012–13³⁵⁵ as the estimated survival rates with a functioning graft. To populate survival probabilities in the state of graft failure, the model used annual survival rates of people on dialysis followed for 10 years from the UK Renal Registry.³ The graft and patient survival rates were extrapolated to 25 years by estimating an exponential curve on the available data (including graft survival rates for years 3 and 4 derived by linear interpolation) and projecting survival rates from the last observed rate with the estimated curve. There is no mention in the submission about adjusting for increases in background mortality as the cohort in the model ages.

In addition to the difference in efficacy, measured in terms of ARRs, the model allowed for differences in effectiveness between the TAC arms through the differences in adherence induced by the once-daily, prolonged-release (Adagraf) compared with the twice-daily immediate-release formulations of the drug (immediate-release TAC). The model utilised comparative estimates of adherence with TAC-PR with immediate-release TAC of 88.2% vs. 78.8% from a published study³⁴³ and combined them with an estimated RR of graft failure in non-adherent versus adherent people of 3.47 derived from a meta-analysis,³⁵⁶ to obtain a RR of graft failure of 0.848, which was applied to the graft survival curves (until year 5, and by exponential curve extrapolation thereafter) that were common to all of the other immunosuppressive treatment strategies in the model.

There are two logical inconsistencies with this modelling procedure. First, accounting for the advantages in adherence with TAC-PR over immediate-release TAC makes comparison of TAC-PR with other immunosuppressive regimens in the model invalid, as no allowance was made for any effects of adherence on graft survival for the other regimens analysed in the model. Indeed, this undermines the fundamental assumption in the model that all significant differences in any drug regimen comparison may be accounted for by the effect through the surrogate, in this case the rate of AR.³⁵⁷ Thus, regardless of the validity of the comparative analysis of TAC-PR and immediate-release TAC, the indirect comparisons of model results between TAC-PR and SRL, EVL and BEL are then invalid.

Second, although the model was adjusted to include the effect of adherence on graft survival in the comparison of TAC-PR with immediate-release TAC, the patient survival curves (for the functioning and failed graft states) were left unchanged, so that the same set of patient survival curves was applied to all immunosuppressive options analysed. This implies the questionable assumption that improvements in graft survival, such as those obtained with TAC-PR relative to immediate-release TAC (and indeed relative to all other model arms), do not translate in direct patient survival benefits. This inconsistent logic in turn leads to underestimating the benefits of TAC-PR and overestimating its costs.

Inspection of the Microsoft Excel^® 2010 version 14 (Microsoft Corporation, Redmond, WA, USA) model spreadsheets revealed that the TAC drug regimen options (TAC-PR and immediate-release TAC) and EVL were the only treatment arms populated by data on actual immunosuppressive drug use (from the RCT sample on which the efficacy for the regimen was estimated); drug consumption values for BEL and SRL regimens were based on treatment guidelines (BNF or Summary of Product Characteristics).

Adverse events

The model allows for seven types of AE following transplantation: malignancy, diabetes mellitus, anaemia, CMV infection, hypertension, HMGCoA and wound-healing disorders. These events were assigned costs (except for the last type of event which had zero cost) but no disutility. The AE incidence rates in the model, reproduced in Table 140, differed across immunosuppressant treatment arms, although these had no influence on the probability of graft failure and patient death. Such differences only affected the costs differences between the treatments.

TABLE 140

Adverse events in the Astellas model (%)

The incidence rates of AEs were derived from a systematic review and meta-analysis published in 2006,³⁴¹ the values adopted by the published economic model for Germany by Jurgensen et al.³⁰⁶ reviewed in section 1.2 of the company’s submission and trial outcomes from the BENEFIT and BENEFIT-EXT trials.²⁰⁷

The rates of AEs were assumed to be the same with TAC-PR and immediate-release TAC and for the two SRL regimens (CNI avoidance and CNI minimisation). According to the incidence rates in this model, TAC has the lowest annual incidence of malignancy (except for SRL from the third post-transplantation year onwards), CMV, anaemia (except for BEL, which had the same annual incidence rates as those of TAC), dyslipidaemia and hypertension, but was associated with an excess incidence of PTDM over the other options.

Health-related quality of life and QALY outcomes were calculated from time spent in the graft functioning state and the graft failure state, which involved dialysis. Based on published estimates,³⁵⁸ the functioning state was associated with a utility value of 0.71, regardless of any prior experience of AR, and the graft failure state was associated with a utility of 0.459, which was equal to the weighted average of the utility of HD (0.44), experienced by 82% of people on dialysis, and PD (0.53) received by the rest.³⁵⁸

The model allows for the occurrence and effects of retransplantation, using the time to retransplantation data reported by McEwan et al.,³¹⁰^,³¹¹ which is reviewed above (see Chapter 4, Review of cost-effectiveness evidence). However, the states following the first retransplantation (i.e. functioning graft with prior AR on the current retransplant, functioning graft without prior AR on the current retransplant – regardless of AR of any previous transplant – and graft failure) face the same transition probabilities, utility values and costs as the corresponding states before retransplantation.³¹⁰^,³¹¹ This is likely to bias the analysis in favour of treatments with higher rejection rates in the model (as higher ARRs imply higher graft failure rates in this model) and may be interpreted as a conservative assumption of the relative effectiveness and incremental costs advantage of TAC over the comparators.

In addition, one incorrect calculation was identified in the Excel spreadsheets of the model submitted by Astellas. The problem was that the model used the data from the NHSBT from 2012 to 2013, on patient survival rates for kidney-only transplant recipients in the UK (p. 35, table 25, in the submission by Astellas) to populate the patient survival parameters of people with a functioning graft, ignoring the fact that such data on survival rates were likely to include deaths of both people with a functioning and those with a failed graft. Instead, the probability of death in the graft functioning state should have been calculated as the remainder of the annual probability of death from the NHSBT patient survival data minus the product of probability of mortality in the graft failure state and the proportion of people with a failed graft. In other words, the Astellas model is likely to overestimate mortality in the functioning graft states, which, in turn, underestimates the benefits of any gains in efficacy (i.e. reductions in AR in the model) that any regimen may have over another (e.g. TAC over the comparators).

Unit costs

The cost per mg of TAC-PR used was 23% lower than that of immediate-release TAC. (The authors present sensitivity analyses of discounts on TAC list prices limited to the first 90 days post transplantation.) Prices for other immunosuppressant regimens were based on BNF prices.

Treatment of ARs was assigned costs of i.v. steroids plus, for the 20% of steroid-resistant BPAR cases, the treatment costs of a regimen of rATG and an inpatient hospital stay for AKI without complications (£1737 overall mean cost). This assumed zero medical management costs for the 80% of people with steroid-sensitive AR and ignores any costs of follow-up to monitor treatment efficacy. The cost per year of dialysis was £38,387 and the cost of retransplant was £25,953. The costs of AEs adopted are presented in Table 141 (which reproduces table 35 in the Astellas submission).

TABLE 141

Costs of AEs (per year)

Results

The Astellas submission produces life expectancies (censored after 25 years) of 16.60 for TAC (immediate-release TAC), 16.57 for SRL CNI minimisation, 16.56 for EVL, 16.48 for SRL CNI avoidance, and 16.47 for BEL in a cohort of people of mean age 45 years, 37% of whom are women. The expected discounted (at 3.5%) QALYs were 8.01, 7.99, 7.99, 7.94 and 7.94, respectively. For TAC once-daily prolonged-release formulation (TAC-PR), total life expectancy was 16.96 and discounted QALY was 8.21.

In the base-case results, immediate-release TAC produced more QALYs than any of the comparators and lower costs than BEL and EVL, whereas it had higher cost against the SRL regimens. The ICER against SRL CNI minimisation strategy was in excess of £1M and the ICER against SRL CNI avoidance strategy was £174,842. In the comparison of TAC regimens, TAC-PR dominated immediate-release TAC, given its lower costs and higher QALYs (both discounted and undiscounted).

The results were found to be similar after changing assumptions, including the time horizon, from the base case of 25 years to 10, 15 and 20 years, the exclusion of discounting, AEs and half-cycle corrections. The results against SRL were found to change significantly when graft survival parameters in the model were populated with data from the SYMPHONY trial instead of the NHSBT data used in the base-case analyses: TAC-PR was found to dominate SRL as CNI avoidance regimen when both were given with DAC induction, 2 g MMF and steroids. In discussing these findings, the authors note that SYMPHONY trial has reported outcomes up to 3 years and is the largest prospective study in the de novo kidney transplantation to date, which showed TAC to result in lower AR, better renal function and graft survival outcomes at 1 year than the SRL regimen.

On the basis of these results, the company concludes that TAC is cost-effective and that TAC-PR should become the standard of care, as it produces lower costs and better health outcomes than immediate-release TAC. The latter statement is further supported, the submission claims, by the expected benefits, not accounted for in the Astellas model, arising from the improved pharmacokinetic profile of TAC-PR relative to immediate-release TAC. In addition, the authors argue that the results of the SYMPHONY trial have discouraged use of SRL, and that BEL’s high cost and high ARR may do likewise, citing a report by the All Wales Medicines Strategy Group³⁵⁰ as supportive evidence for this assertion.

Critical appraisal

The analysis presented by Astellas covers a number of appropriate comparators, including new regimens, BEL, and regimens with modes of action different from that of CNIs, that is, EVL and SRL. However, it omits one relevant comparator: CSA. There is no justification in the submission as to why this drug regimen option was not considered in the analysis. Muduma et al.³¹⁸ present the results of the same analysis based on data from the literature recorded in electronic databases up to 1 year earlier than the review in the Astellas submission (i.e. June 2013 vs. June 2014, respectively). The results reported by Muduma et al.,³¹⁸ who acknowledge employment by Astellas in the publication, are very similar to those presented by the Astellas submission for those drug regimens that were common to both reports (i.e. TAC-PR, immediate-release TAC, BEL, SRL CNI minimisation and SRL CNI avoidance). Unlike the Astellas submission, Muduma et al.³¹⁸ report results for CSA. The ICER of immediate-release TAC against CSA was £21,244 (table 1, base-case results³¹⁸) and the cost-effectiveness acceptability curve for the comparison showed that the TAC option had a 59.5% probability of being cost-effective at the £30,000 willingness to pay for a QALY threshold. The sensitivity analysis showed that the result of this comparison was sensitive to the inclusion of the AE costs, that is, when omitting them altogether the ICER for TAC increased to £35,446.

This evidence casts doubt on the robustness of the cost-effectiveness results and conclusions in the Astellas submission, and suggests that the results presented may be misleading owing to the exclusion of a relevant comparator. It is unfortunate that the submission did not include CSA, given the previous published degree of uncertainty in the cost-effectiveness of TAC.

There is use of inadequate data within the model. As discussed above, the estimates of patient survival in the functioning graft state may have been underestimated. This works against the more efficacious treatments, such as TAC, which had the lowest ARRs of all the regimens compared. Thus, the results reported by Astellas in the submission may be treated as conservative estimates of the costs and benefits of its TAC regimens. In relation to the evidence presented in support of TAC-PR, this may suffer from the previous criticism about the incomplete set of comparators, and the fact that the TAC-PR versus immediate-release TAC comparison is based on what is in effect a different model of the outcomes of renal transplantation from that used to compare immediate-release TAC against all the other regimens. In fact, the model used for comparing TAC-PR with immediate-release TAC contradicts the fundamental premise of the model used to compare immediate-release TAC with all regimens other than TAC-PR: that AR captures all important drivers of clinically meaningful outcomes.

One other issue relates to the way the model was structured. Although the model allowed repeat transplantation to occur for a given individual, the costs and HRQoL of subsequent dialysis were accounted for only the first transplantation. Although the proportion of people with more than one retransplantation may be small, this assumption could have been important to the conclusions derived from the comparison with CSA, had such comparator been included.

Another concern relates to how the timing of transplantation was implemented in the model. Markov models imply that transitions occur at the end of the period represented by each cycle. In the present case, the cycle length was 1 year and the authors of the Astellas model rightly decided on using half-cycle corrections to reduce the inaccuracy of calculation of expected costs and benefits that arise from having a long cycle length given the frequency of state transitions. The model, however, assumed that the proportion of people who undergo retransplantation in the very first cycle made a transition from the failed graft state to a functioning graft post-retransplantation state, as if the retransplant had occurred at the start of the period, so that they spent the whole cycle length (6 months owing to the half-cycle correction) with a functioning graft after retransplantation in the first cycle. This is wrong, as in a cohort of people with de novo kidney transplants, the discrete Markov process transition from a functioning first graft to a functioning retransplant requires two sequential intervening events to occur, that is, graft failure and retransplantation (i.e. a minimum of two cycles, one for each event, is required).

In summary, the main limitations of the Astellas economic analyses are:

Omission of CSA as a relevant comparator (without justification).
Patient survival estimates in the functioning graft state may have been underestimated, which works against treatments with low rates of AR, such as TAC. The underestimation is, in part, because of an error in using UK registry data on survival rates of both people with functioning and those with failed grafts to inform the survival rates for those in the model with a functioning graft.
The analyses comparing the TAC-PR regimens with other non-TAC regimens are invalid, as the two TAC regimens incorporate differences in treatment adherence and this is not accounted for in the other regimens.
Drug dosage levels for BEL and SRL were based on treatment guidelines, whereas for other regimens they were based on actual trial data.
The cost and HRQoL of dialysis were not included for recipients of second or subsequent transplantations.
The analysis does not account for the role of GRF in (1) long-term graft survival outcomes and (2) current costs and utilities.

Novartis’ submission

Novartis, the company that produces EVL, submitted a simulation model of an individual patient’s health experience for the lifetime remaining after renal transplantation in the English NHS. The following treatments were evaluated for a group of simulated people of mean age 45.7 years (SD 12.7 years), mean weight 70 kg (SD 10 kg), 68.5% of whom were male, and mean MDRD eGFR 9.03 ml/minute/1.73 m² (SD 7.9 ml/minute/1.73 m²):

EVL + reduced-dose CSA + steroids versus:
- TAC + MMF + CCSs
- Standard-dose CSA + MMF + steroids
EC-MPS + standard-dose CSA + steroids versus:
- Standard-dose CSA + MMF + steroids.

The model was specified as monthly transitions between six health states:

stable post-transplant state (functioning graft)
AR
graft failure
dialysis
retransplantation and
death (from CKD or other causes).

Moving between these states is associated with changes in direct health-care costs, whereas HRQoL (utility) changes are accounted for transitions between the states of having a functioning graft to a failed graft, and from any of these to the absorbing state of death. In addition, the model accounts for the changes in mortality risks, utilities and monitoring costs (outpatient specialist visit) with renal function. Although the costs associated with AEs emerging following transplantation were measured for six type of events [proteinuria, BK virus (BKV) infection, CMV infection, hyperlipidaemia, wound and hypertension], only for two of these was the loss of utility measured in the analysis (proteinuria and hypertension).

The model assumes that AR may happen up to 3 years after a transplant, and applies the same probabilities of this type of event to first and subsequent transplantations. The probability of chronic rejection (i.e. graft failure) is independent of renal function in the model. Once a patient’s graft fails, dialysis is started and given until the time a new transplant is received, which is determined by a random normal distribution process with mean of 36 months (SD 12 months). This feature of the model is what gives it its discrete event simulation nature.

The model allows different rates of change in renal function (eGFR) between the first year (during which they are specific to the immunosuppressive treatments) and the second, third and subsequent years, when the rate of eGFR change is common to all treatment arms in the model.

The model parameters for the EVL and MPA regimens were populated with efficacy and safety outcomes at 12 months from the study by Tedesco-Silva et al.,³⁵⁹ a multicountry trial that compared EVL 1.5 mg/day with mycophenolate acid 1.44 g/day in people receiving a primary kidney-only transplant in the period October 2005–October 2008. The values for the TAC regimen were obtained from a trial reported by Larson et al.,¹⁵⁴ which compared TAC with SRL in people receiving a kidney-only transplant (79% of whom were primary transplants in the TAC arm) in the period April 2001–January 2004 in the USA. The source of the efficacy and safety data for the MMF regimen was the multinational trial report by Vítko et al.,¹⁷⁷ which compared EVL with MMF in primary transplant patients who were recruited between August 1998 and August 1999.

The indirect nature of the relative efficacy data used as inputs to the cost-effectiveness model of the three comparisons submitted by Novartis presents some problems for valid estimation. In addition to the different dates when the trials were conducted and the type of transplant (primary only or mixed) for the EVL–TAC comparison, there were differences between the two studies in terms of the use of induction. Tedesco-Silva et al.³⁶⁰ reported that participants in their trial of EVL were administered two BAS 20-mg doses: one within 2 hours before transplantation and the other at 4 days post transplantation ‘or according to local practice’,³⁶⁰ whereas Larson et al.¹⁵⁴ reported that all people received thymoglobulin 1.5 mg/kg/day on days 0, 1, 2, 4 and 6 post transplant. The sample of TAC participants was also slightly older but more balanced in terms of sex, and had a higher proportion of living donor transplants. The major issue, however, is the fact that the actual amount of TAC use in the efficacy trial was different from the dose used to cost the same regimen in the model. Larson et al.¹⁵⁴ report that the TAC was started at a 3 mg twice daily. The estimated mean daily dosing at 1 year, separately reported for the first 59 people randomised to TAC, was 6.3 mg per day (SD 0.9 mg per day).³⁶¹ The model, however, applied costs to the TAC arm at a quoted BNF recommended dose of 0.25 mg/kg/day for a group of individuals of 70 kg mean weight, thus resulting in a mean daily dose of 17.5 mg, which is considerably higher than the actual drug use that corresponds to the efficacy outcomes used by the model. The dose behind the TAC drug acquisition costs used in the Novartis submission is also larger than the mean daily doses for immediate-release TAC reported by Tedesco-Silva et al.,³⁴⁹ which Astellas adopted in its submission, and which are consistent with the report of Dean et al.³⁶¹

In relation to the data sources for the comparison of EVL with the MMF + CSA regimen, the trial samples differ in terms of the period covered by the study and the country mix. The proportion of cadaveric donors transplant recipients was 46.6% in the EVL group compared with > 90% in the MMF + CSA regimen.¹⁵⁰ Moreover, the MMF regimen was given without induction therapy, in contrast with the trial that provided the outcome data for the EVL model arm.³⁵⁹ The same issues applied to the comparison of MPA with MMF + CSA, as the data source for MPA was the same trial as that for EVL.¹⁰⁷

Costs

Immunosuppressive costs of the MPS + EVL treatment regimens were based on the dosing protocols of the individual trial that was the source of efficacy data, whereas the costs of drug acquisition for the comparators, that is, the TAC and MMF + CSA regimen, were based on BNF-recommended starting dosages. Other health-care costs included the costs of monitoring GP visits, which increased with higher CKD state. The cost of an AR event was taken from that reported by McEwan et al.³¹⁰ The annual costs of dialysis, £22,877, were obtained from a 2011 NICE costing report³⁶² on organ donation for transplantation. Retransplantation involved an estimated cost of £17,736, a weighted average of NHS reference costs 2012/2013⁶⁴ for transplant procedures for varying ages and donor types.

Utilities

Estimates of utilities were derived from the study by Neri et al.,³⁶³ who reported EQ-5D health states measured in a cross-sectional study of people with kidney-only transplants in the UK, valued using UK tariffs, as a function of CKD states. As renal function deteriorated so did the HRQoL (utility) values experienced by the simulated patient in the model. The model accounted for negative impacts on HRQoL (disutilities) of two adverse effects, proteinuria (reduced utility by 0.043) and hypertension (reduced utility by 0.010).

Results

EVL + reduced CSA vs. TAC + MMF

Novartis reports a life expectancy at transplantation in a patient group of mean age 45.7 years (SD 12.7 years) of 25.71 life-years under the EVL immunosuppression compared with 23.39 life-years under TAC, and discounted QALYs of 8.86 and 7.37, respectively (Novartis’ submission, table 5.18, Base-case analysis – deterministic ICERs). Given the discounted costs per patient that result under these options, £135,358 for EVL and £140,972 for TAC, EVL was found to be the preferred option, as it is less costly and more effective than TAC.

Further results accounting for uncertainty in model inputs relating to uncertain parameters (ARRs, chronic rejection rates, rate of change in eGFR after 12 months post transplant, health-state utilities and event costs) confirmed that the probability of EVL being cost-effective was 100% at thresholds ranging from £0 to £200,000 per QALY.

EVL + reduced CSA vs. MMF + standard-dose CSA

The EVL regimen was found to produce 1.76 extra years of life over the MMF with CSA regimen in the base case of a cohort of mean age 45.7 years. This corresponded to 0.99 extra discounted QALYs (Novartis’ submission, table 5.18, Base-case analysis – deterministic ICERs). The EVL containing triple therapy was also associated with £59,354 extra discounted costs over the MMF + CSA regimen, and a practically identical ICER figure, given the 0.99 discounted QALY benefit with EVL.

In probabilistic sensitivity analysis (PSA) accounting for the uncertain parameters (as listed for the results of the EVL vs. TAC comparison), the EVL had a 0% probability of being cost-effective relative to MMF for cost-effectiveness thresholds ranging from £0 to approximately £86,000 willingness to pay per QALY, and was still < 15% at £200,000 per QALY.

The fact that the PSA yielded a willingness to pay per QALY threshold at which EVL had a 50% chance of being cost-effective (> £200,000 per QALY), which was more than three times its deterministic ICER of £59,354, indicates that the model has important non-linearities and that using the deterministic values for decision-making is incorrect. Although this warning would not have made any difference to a decision based on a £30,000 per QALY threshold (i.e. both determinist and probabilistic results led to the same conclusion) for this comparison or the previous one discussed (i.e. EVL vs. TAC), the distinction does matter for interpreting the results of the third comparison presented by Novartis – of EC-MPS vs. MMF, discussed next.

EC-MPS vs. MMF + standard-dose CSA

In the deterministic base-case analysis, the mycophenolate regimen was found to result in 25.48 life-years, and 8.69 discounted QALYs per patient (table 5.18, Base-case analysis – deterministic ICERs). Mycophenolate acid had an extra 1.31 life-years and 0.80 discounted QALYs per treated patient relative to MMF. Given its additional discounted costs of £10,588, EVL had an ICER of £13,209 per QALY relative to MMF with CSA.

In the PSA that accounted for the effect of uncertain parameter estimates (as listed in the results of EVL relative to TAC), mycophenolate acid had a 50% chance of being cost-effective at a threshold value of around £28,000 willingness to pay per QALY.

Although the deterministic ICER for MPA is below the lower cost-effectiveness threshold adopted by NICE (£20,000), the willingness-to-pay threshold corresponding to the 50% probability of mycophenolate acid being cost-effective in the PSA is ≈ £28,000) suggesting that EVL may be borderline cost-effective, in relation to the £30,000 maximum acceptable amount NICE is willing to pay for a QALY. This comparison shows that the deterministic results are potentially misleading for informing decisions or deriving model predictions about treatment outcomes in this model.

Critique

The Novartis model uses a patient simulation model of monthly cycles to calculate the costs and health outcomes of immunosuppressant regimens over the remaining lifetime (i.e. 50 years post transplantation). The main strength of the model is its account of the occurrence of clinical events that determine health status, that is, AR, and graft and patient survival, as well as the effect of renal function on costs and HRQoL.

The study failed to conduct adequate evidence synthesis, as their methods of identification of relevant evidence on efficacy was not systematic, as acknowledged by the authors. The model analyses were based on data from single trials, and their analyses were restricted to undertake pairwise indirect comparisons of the treatments investigated in each of those individual trials. This led to results that were at odds with findings from the systematic review of the clinical evidence undertaken by PenTAG (see Chapter 3, Summary of pairwise comparisons) which found no statistically significant improvement in efficacy outcomes (AR, graft failure, death) of EC-MPS compared with MMF, whereas the Novartis model-based analysis produced an extra 1.31 life-years for EC-MPS. Therefore, the results by Novartis are likely to be biased, and consideration of additional efficacy evidence from direct and indirect comparisons would have allowed the company to provide a more reliable technology assessment.

Some errors were identified in the calculation of unit costs of immunosuppression for the CSA component of the EVL regimen, which was common to two other comparators, but is not part of the current standard clinical practice in England. This had the effect of underestimating costs for the CSA-containing regimens.

The model accounted for some important AEs, but omitted one of the most important determinants of patient and graft survival: PTDM.

A major flaw in the model is the assumption that graft failure occurs independently of the GRF or the occurrence of AR. The probability of graft failure (labelled chronic rejection in the submission) is based on 12-month post-transplantation trial data for each regimen, which, given that this probability is constant over the 50-year time horizon of the model, casts serious doubt about the validity of the findings.

In summary, the main strength of the Novartis analysis is its account for the effect of differences in GRF between treatment arms on current costs and utilities. Its main limitations are:

The use of treatment effectiveness data from single selected RCTs, not systematic reviews or meta-analysis, and based on pairwise indirect comparisons of those trials. The estimated effectiveness of EC-MPS compared with MMF is therefore substantially greater than that estimated from the assessment group’s systematic review and meta-analysis.
The model structure contains the assumption that graft failure occurs independently of GRF or the occurrence of AR. Instead, the probability of graft failure is based on the trial-derived rates at 1 year post transplant, which are then assumed to remain constant throughout the modelled period.
Regimens involving CSA (including the EVL regimen) had incorrect unit costs for CSA; this would underestimate the cost of those regimens.
The estimate of the annual cost of dialysis is from an unusual source, and substantially lower than current costs as in the NHS reference costs.
The AE PTDM is not included in the model (despite others being included).

Bristol-Myers Squibb’s submission

The following regimens, all following BAS induction, were compared in the Bristol-Myers Squibb submission:

BEL (less-intensive dosing) + MMF + steroids vs. CSA + MMF + steroids
BEL (less-intensive dosing) + MMF + steroids vs. TAC (immediate release) + MMF + steroids.

Two patient populations were studied, namely standard criteria donor recipients, and the ECDs recipients of de novo renal transplants. In addition, the submission presented subgroup analyses for people of weight of > 90 kg.

In its review of the effectiveness evidence, the company justifies its exclusion of SRL from the analysis arguing that, in practice, its use ‘is generally restricted to treating renal transplant patients whose renal function is steadily declining on TAC or CSA, and in whom other measures (such as dose adjustment) have not been successful’ (Bristol-Myers Squibb’s submission, chapter 3, efficacy section). As for TAC-PR, the company argued that there was insufficient direct or indirect evidence to include it as a comparator. EVL was excluded from the analysis because it lacks UK marketing authorisation. As for MMF and MPS, the company states that they were not included as comparators because they are required to be given with CCSs as part of triple therapy containing BEL, TAC or CSA.

The evidence used to populate the efficacy and safety parameters in the model used in the Bristol-Myers Squibb analysis was derived from the BENEFIT⁵⁹) and BENEFIT-EXT¹⁴² trials, which compared BEL with CSA. The efficacy and safety parameter values for BEL relative to immediate-release TAC were obtained from indirect comparisons in a NMA of 32 studies, 29 of which compared TAC with CSA and three studies (including BENEFIT⁵⁹ and BENEFIT-EXT¹⁴²) of BEL compared with CSA.

In making the case for BEL the submission argues that the i.v. mode of administration is likely to result in increased adherence to treatment relative to TAC and CSA, which are administered orally and require routine monitoring to drug exposure and dose adjustment. The company claims that this would be expected to result in improved outcomes with BEL over the CNI comparators. Further, in setting the context of the economic evaluation (Bristol-Myers Squibb’s submission, chapter 6, Cost-effectiveness of BEL) the company states that the drivers of the evaluation were the acquisition cost of BEL, the number of years of functioning graft and the costs and utility (HRQoL) of dialysis following graft failure, which led it to perform subgroup analyses in those whose expected graft survival is short. Therefore, because ‘post-transplant renal function is a well-established predictor of graft survival this analysis focused on people with a post-transplant eGFR < 30ml/minute/1.73m² as these people represent those for whom improved post-transplant renal function is most likely to have significant health and cost benefits’.

The analysis is based on the 3-year outcomes from the pooled data from BENEFIT⁵⁹ and BENEFIT-EXT,¹⁴² including renal function (eGFR), and the cumulative incidence of NODAT, AR, PTLD, graft failure and death, where eGFR of < 15 ml/minute/1.73m² was assumed to identify people with graft failure. The Markov model developed by Levy et al.³³⁴ was then used to extrapolate these outcomes to the long term. To avoid repeating the description in Chapter 4 (see Identified studies), the main features of this model are summarised here.

The model represents annual transitions among the following health states:

functioning graft (including distinguishing four categories of renal function according to National Kidney Foundation Kidney Disease Outcomes Quality Initiative)
- GFR stage 2 (GFR2) = ≥ 60 ml/minute/1.73 m²
- GFR3a = 45 ml/minute/1.73 m² ≤ GFR < 60 ml/minute/1.73 m²
- GFR3b = 30 ml/minute/1.73 m² ≤ GFR < 45 ml/minute/1.73 m²
- GFR4 = 15 ml/minute/1.73 m² ≤ GFR < 30 ml/minute/1.73 m²
graft failure/dialysis defined as:
- GFR5 = GFR < 15 ml/minute/1.73 m²
functioning re-graft/retransplantation
death.

The probabilities of transitions between these states were populated by time to event models estimated by Levy et al.³³⁴ using US registry data. The survival models were the following:

Weibull time to event models for graft survival [two models: (1) graft failure 1–4 years after transplant and (2) graft failure > 4 years]
Weibull time to event model for patient survival [two models: (1) death with a functioning graft 1–4 years after transplant and (2) death with functioning graft (DWTG) of > 4 years]
exponential survival model of time from graft failure to retransplant
exponential survival model of time from retransplant to graft failure
exponential patient survival on dialysis (after graft failure)
exponential patient survival after retransplant.

The Weibull survival model adjusted for covariates including patient age, sex, baseline eGFR, weight, NODAT, AR events, PTLD, donor type and other, calendar year, and patient and donor characteristics.³³⁴ The conditioning of these models’ predictions on baseline eGFR allowed the derivation of separate survival curves for the different starting (i.e. at 3 years post transplant) renal functioning health states in the model. In order to assign costs and utilities for each starting eGFR group, the total time spent with a functioning graft predicted from the survival models (adjusted for death risks) was allocated to different eGFR categories by assuming that eGFR declined linearly over time from its starting level (the midpoint of the starting eGFR stage) until reaching graft failure, which was associated with an eGFR level of 15 ml/minute/1.73 m². Thus, for example, the group of people who entered the Markov model in GFR2 (at 3 years post transplant) at the midpoint GFR level of 67.5 ml/minute/1.73 m²; those in these groups who experienced graft failure, say on the fifth annual cycle (i.e. 8 years post transplant), would be assumed to have traversed from eGFR2 to eGFR5 at an annual rate of 10.5 ml/minute/1.73 m² [ = (67.5 – 15)/5 ml/minute/1.73 m²]. Thus, the members of this illustrative group of modelled people would have made a transition from GFR2 to GFR3a in the first year (at the end of which they would reach a GFR level of 57 ml/minute/1.73 m²), remain in eGFR during the second year (to finish it at a GFR level of 46.5 ml/minute/1.73 m²), then make a transition to, and end the third year in, GFR3b (at a GFR level of 36 ml/minute/1.73 m²), make a transition to GFR4 in the fourth year (to end the year at GFR level of 25.5 ml/minute/1.73 m²) and experience graft failure at the end of the fifth year (GFR level of 15 ml/minute/1.73 m²). In the model, some people die without graft failure, and they were assumed to have remained in the same eGFR stage as that in which they entered the model [on the basis of regression analysis of United States Renal Data System (USRDS) data on which the survival models were estimated].

After calculating expected costs and outcomes in the Markov model for each starting eGFR stage over 37 years (which, added to the initial 3-year period, amounts to the modelled horizon of 40 years adopted in the base case), the expected costs and outcomes for the whole population were calculated by a weighted average of the expected costs and QALYs across starting model stages. The proportions were the frequency distributions of people at 3 years post transplant across functioning graft stages (approximated by a normal distribution using mean and SD of eGFR values), dialysis stage and death. Finally, the expected costs and QALYs over the extrapolated Markov phase were added to costs and QALYs associated with the observed trial outcomes in the trial to calculate total QALYs and costs over 40 years for each trial arm in BENEFIT⁵⁹ and BENEFIT-EXT.¹⁴²

Efficacy parameter estimates

The main inputs for the model were those estimated from the NMA at 36 months. These are presented in Table 142, which reproduces table in the industry submission (Bristol-Myers Squibb’s submission, section 6.1, Model inputs, table 28). In the model, the effect of NODAT on graft and patient survival curves is accounted for by applying HRs from the literature.³⁶⁵ PTLD and CVD were accounted for in the model by assigning a 50% chance of death to each of them. The sources of these estimates were not given.

TABLE 142

Relative effect of TAC and BEL vs. CSA at 36 months

According to the Bristol-Myers Squibb submission, the distribution of the patient cohort at the start of the Markov model for each of the three regimens evaluated – BEL, TAC and CSA – was calculated from the pooled BENEFIT⁵⁹ and BENEFIT-EXT¹⁴² trial data on GFR outcomes at 36 months post transplant. They assumed that GFR level followed a normal distribution to derive the distribution across functioning graft states, and used the observed means of 38.6 and SD of 22.93 for CSA, 54.64 for BEL (from the BENEFIT⁵⁹ trials) and 44.8 for TAC (from NMA relative to CSA). But the assumption of normally distributed GFR is problematic, as it implies that in the CSA arm, 4.6% of people at the end of the trial phase (and therefore at the start the Markov model phase) have a negative GFR value. However, inspection of the model’s Excel spreadsheets revealed that these values were not used in the model, but rather a mean of 50.80 and SD of 21.80 for CSA, which implies that 0.9% of people have a negative GFR value at 3 years post transplant. The means for TAC and BEL were, in turn, 58.47 and 66.96, and they also applied the SD 21.80 for CSA (these imply negative GFR values for < 0.4% of people).

To validate the survival curves underpinning its Markov model, which were estimated from US data, the company compared the predictions from its Weibull survival models with UK data from the NHSBT 2013 report³⁵⁵ (these have been discussed in relation to the model submitted by Astellas, submission section 6.1). The predicted survival curves from the Bristol-Myers Squibb model by type of donor (DBD and DCD) are compared with the corresponding UK data points at year 1, 2 and 5 post transplant. Owing to the difficulty of visualising the chart presented by Bristol-Myers Squibb (Bristol-Myers Squibb’s submission, figure 22), the 5-year survival curves reported by the NHSBT 2013 report are reproduced in Figure 66, alongside the corresponding predictions in the survival model informing the Markov model in the company’s submission. It shows that the model predictions for the DBD graft survival (DBD predictions based on USRDS) converge towards actual UK data for the corresponding donor type. The model predictions based on the DCD patient population, however, appear to diverge from the trend observed in UK data for each donor type. This is of concern, as predictions from this model were used to extrapolate 3-year trial outcomes for 37 years.

FIGURE 66

Validation of first adult kidney-only graft survival predictions of the Bristol-Myers Squibb model (based on US data from the USRDS) with NHS data (NHSBT) by donor type.

Changes in eGFR stages were associated with changes in utilities and costs. Utilities were derived from a cross-sectional study of UK renal transplant patients.³³⁸ AEs including AR, NODAT and PTLD were given estimated annual utility losses of 0.50, 0.06 and 0.44, respectively, reported from the literature.

Costs

The submission provides actual data on estimated costs of clinical events following transplantation in standard practice at a single centre in Wales (Table 143). The analysis has been published as part of a multinational study report (described in Chamberlain et al.⁴² Assessment of cost-effectiveness, Results), which shows some common and divergent practice between this site and other European centres. Briefly, costs were estimated in a retrospective analysis of computerised records from the Cardiff Renal Transplant Database, related to all individuals aged ≥ 18 years who received a kidney-only transplant recorded between January 1998 and December 2005. They were followed up to 3 years, and the analysis included those in whom data were recorded for at least 12 months after transplant and whose data included their most recent transplant in the studied period.

TABLE 143

Costs and utilities by GFR in the Bristol-Myers Squibb model

The study provided evidence that was previously unavailable for the UK on actual costs of post-transplantation care and events stratified by GFR at 1 year post transplant. The sample for analysis included 370 people in whom a variety of treatment regimens were used. Of the 20 different treatments used in this period, triple therapy with TAC steroids and AZA was the most frequent (19%), followed by triple therapy with TAC, steroids and MMF (18%). The next most frequently used regimens were double therapy with TAC and AZA or TAC with MMF (9% each). By the second year the proportion of people on these TAC triple regimens had declined (to 14% and 12% of the sample), whereas the proportion of people on the double therapy TAC had increased (to 14% and 13%). The same observation was made from 24 months to the 24+ months’ follow-up point.

Another aspect of this data source is the observed number of TAC immunosuppressant doses used over the follow-up period in this sample. Although, the dose of TAC, given as part of triple therapy alongside MMF and steroids, was continually reduced over the first year from the mean of 10.31 mg at month 1 to 6.36 mg at month 12, and was 5.73 mg and 5.71 mg at month 24 and month 24+, respectively, the dose was kept at 11.23 mg throughout the observation period in the triple regimen that included AZA (Bristol-Myers Squibb’s submission, appendix 5, Preliminary report PORTRAIT database study Cardiff).

On the basis of the resource-use estimates from the PORTRAIT study report, the TAC drug regimen and the CSA regimen costs were estimated. Drug use was valued at BNF 67³⁶⁹ prices [for TAC, the average price of immediate-release TAC 1 mg of 50- and 100-capsule packs was used; for CSA, the average prices of Capimune^® (Mylan), Capsorin^® (Morningside Pharmaceuticals Ltd), Deximune^® (Dexcel Pharma Ltd) and Neoral^® (Novartis), 30-capsule packs, were used]. Administration costs included one laboratory test per outpatient appointment to determine CNI level, and accounted for the observed number of outpatient appointments in years 1, 2 and 2+. The costs of BEL administration included the costs of i.v. infusion, which were obtained from a previous HTA report on abatacept (from which BEL was derived, and that has the same method and frequency of administration). Thus, the annual drug acquisition and administration costs of the regimens in the first year of the model for a 75-kg patient were £13,472 for BEL, £3937 for TAC (immediate-release TAC) and £1972 for CSA. These costs were smaller in the second and subsequent years by about 30%, 25% and 15% in the BEL, TAC and CSA arms, respectively.

Results of Bristol-Myers Squibb’s analyses

In the base-case results for a cohort of people with a starting average age of 43 years, at 40 years post initial transplant 11% of people would be alive under BEL, whereas that would be 8.8% under TAC and 7.4% under CSA. By that point, in 75.6% of people the graft would have failed under BEL, whereas that would have happened in 73.8% of people under TAC and 76.9% under CSA. Correspondingly, 19.3% of people received retransplantation under BEL, 19.2% under TAC and 20.6% under CSA.

When comparing total discounted costs, BEL resulted in incremental costs of £91,001 over TAC and £92,216 over CSA. In turn, the incremental discounted QALYs were 0.62 relative to TAC and 0.97 relative to CSA. The incremental cost per additional QALY of BEL relative to TAC was £147,334, whereas that for TAC relative to CSA was £3375.

These results were driven by the higher costs of BEL immunosuppression, which, despite its associated savings in dialysis costs relative to the other regimens (£15,469 relative to CSA and £2248 relative to TAC), incurred seven and three times the cost of immunosuppression of the CSA (additional costs £109,402) and TAC (£95,159 difference) regimens, respectively. These results were confirmed by PSAs and deterministic sensitivity analyses, which showed the ICER to be insensitive to variation in uncertain parameters.

The submission presented additional analyses for a special group of people with a shorter expected graft survival than that for the overall patient population. This is referred to as ‘subgroup analysis’ by the company, and implemented by defining the group as those people with GFR of < 30 ml/minute/1.73 m² at 1 year post transplant. They implement a post-hoc adjustment to the model so that the effect of eGFR improvements within that range may be accounted for in the model, which originally was specified in discrete eGFR categories and thus restricted all people entering the model in the same category to having the same benefits. The company found that, in these people, BEL results in higher benefits (0.46 extra QALYs in both comparisons) and lower costs (–£1478 relative to CSA and –£4166 relative to TAC).

However, this analysis suffers from a logical flaw. It assumes that those people whom the company claims to have identified as able to benefit from their drug regimen may be identified with precision. In fact they may not. The meaningful definition of subgroup analyses in a setting where risk and uncertainty influence the outcomes of treatment such as this, so that the outputs of a decision model are mathematic expectations of cost and benefits, identifies a selected group of people for special management on the basis of observable characteristics defined at the outset. The defining characteristic of the selected group of people in the subgroup analysis by Bristol-Myers Squibb is an outcome of treatment, and thus not known at the time of transplant (which would be required for sound decision-making analysis about choice of maintenance treatment).

A subgroup analysis presented by Bristol-Myers Squibb finds that BEL may be cost-effective in people with body weight of approximately 90 kg and more. At this body weight, BEL use incurs minimal vial wastage, thus maximising effectiveness for the given cost.

Critique

The model captures all the most important clinical outcomes and AEs arising post transplantation, and accounts for the role of renal function as a prognostic factor for long-term graft survival and its contemporaneous effects on HRQoL and costs. It also accounts for the effect of short-term AR on longer-term graft and patient survival.

A major strength of the evidence presented by Bristol-Myers Squibb is the cost study used to populate the costs of immunosuppressant drug use and administration in the model and the costs associated with renal function. This evidence has been reported as part of a wider study⁴² in a peer-reviewed publication.

The major limitation of this study is the questionable generalisability of the values used to populate the transition probabilities of the model used to extrapolate short-term trial outcomes to 40 years. The survival models that inform the transition probabilities to the key events, that is, graft failure after transplant, time to retransplantation after graft failure, and possibly patient survival with a functioning graft, may reflect the experience of a patient population that does not correspond to that of the UK.

Another issue is the use of efficacy differences between regimens at 3 years post transplant to populate the entire initial 3 years, as if these differences had occurred from day 1 and remained constant until the end of the third year post transplantation, which we know was not the case, and bias the analysis in favour of BEL, the company’s drug. In fact, inspection of the model spreadsheet reveals that discounting was not applied to the first 3-year costs and benefits.

A methodological limitation is the assumed linear, constant decline in eGFR, which was the driver of the Markov model used to extrapolate outcomes beyond 3 years, in order to estimate quality of life over the graft survival period conditional on initial eGFR value. This, in turn, reflected the limited information available on renal function from registry data; studies using multicentre cohorts could potentially address this issue by measuring, rather than imputing, renal function periods of longer than 2–3 years, which are typically found in the experimental literature.

In summary, the Bristol-Myers Squibb model has numerous strengths, but has the following main limitations:

The use of US data to extrapolate the survival data for key transition probabilities to 40 years (graft failure, time-to-retransplantation after failure).
The use of efficacy differences between regimens at 3 years post transplant to invalidly calculate benefit differences throughout the first 3 years in the cost-effectiveness model, which favours the company’s drug, BEL.
Lack of accounting for the costs of concomitant regimens used in the triple-therapy regimens investigated by the RCTs, which served as the source of efficacy values in the model (discussed in Comparison between the model submissions).
Lack of discounting of costs and QALYs in the first 3 years of the analysis, which invalidly raises the benefits of BEL proportionally more than it increases its incremental costs.
The assumed linear decline in eGFR 3 years post transplant at a rate with no validation or sensitivity analysis of this assumption.
A ‘subgroup analysis’ based on people with poor GRF at 1 year, but who would not be identifiable at the time of starting maintenance immunosuppression (and therefore also outside the scope of this technology assessment)
Another subgroup analysis, of those with a body weight of 90 kg, should be disregarded, as this subgroup is based only on the cost differences that would be affected by the patient’s weight.

Comparison between the model submissions

Besides the treatment comparisons, the company submissions also differ in terms of the models used to evaluate those treatments (Table 144). Table 145 highlights how useful the evidence provided in each of the economic evaluations may be to inform the decision-making. Given the necessity to extrapolate short-term outcomes reported in trials with typical follow-ups of 1–3 years, the main differences between extrapolating models used by the three companies are reflected in the choice of surrogate outcome used to drive the disease course in people with renal transplantation and the duration of any relative effects of treatments.

TABLE 144

Summary of the economic analyses in company submissions

TABLE 145

Evers checklist: quality of published economic evaluation studies

The submission by Astellas uses a Markov structure to model the disease evolution and the effects of treatment in the relevant cohort of people. In this model the occurrence of BPAR in the first year post transplant (for the first transplant and any second transplant occurring in the first year of the model) affects the probability of graft failure in subsequent years. Renal function plays no role in this model. In contrast, differences in eGFR changes between the triple-therapy regimens in the first year drive the modelled outcomes of subsequent years in the model by Novartis. Although the risk, costs and HRQoL consequences associated with ARs are accounted for in this model, these events do not affect graft survival. Graft failure is thus as likely to occur while individuals are at CKD stages 1 and 2 as when they are at CKD stage 5, and any state in between those two extremes for that matter. The model by Bristol-Myers Squibb, unlike that by Novartis, assumes that eGFR at the end of year 1 determines graft survival. However, unlike Astellas and similarly to Novartis, the Bristol-Myers Squibb model allows for the costs and consequences of changes in eGFR over time in the functioning graft state and for the effect of eGRF on the probability of patient death. An additional advantage of the Bristol-Myers Squibb analysis over that of Novartis is its allowance for the effects of AR in the first year post transplant to affect patient and graft survival thereafter, as the analysis by Astellas does for the graft survival only.

The figures adopted by the Novartis submission seem to underestimate the costs of twice-daily immediate-release TAC doses. Their cost per mg for TAC is £0.82, whereas the weighted average figure for the market share of the different presentations used by Astellas is £1.618. On the other hand, the mean daily dose at 70 kg body weight for TAC in the Novartis submission is 17.5 mg, whereas the average daily dose for the first year used by Astellas is 7.17 mg. This results in an average maintenance monthly cost of TAC that is 24% higher in the model by Novartis than in the model by Astellas (i.e. £438 vs. £353 per month).

Other differences were found in terms of the unit costs of the MMF therapy. Novartis used a £9.65 price per pack of 50 tablets of 500 mg each, obtained from market data [Commercial Medicines Unit (CMU) Electronic Market Information Tool (eMit) 2014³⁷⁰], whereas Astellas used a price almost 10 times higher: £82.26 per pack of 50 capsules of 500 mg, citing BNF 2014.⁵⁶ The effect of the chosen MMF price is also different across the submitted analyses, as MMF is a concomitant medication across all immunosuppressive regimens analysed in the evaluation by Astellas, whereas in the Novartis analysis MMF is not part of the regimens involving the company’s own therapies (i.e. EVL and EC-MPS). Thus, although across submissions the treatment regimens that include the companies’ drugs may be associated with increased effectiveness, a higher MMF price has different implications across the submissions: it makes it less attractive for the NHS to adopt such a regimen (as people live longer and incur higher drug costs) in the Astellas analysis, whereas the opposite occurs in the Novartis case (as only the cost of comparator regimens increases).

Although the three models submitted to NICE for this assessment varied in terms of the way the health course of an individual evolved and the use of immunosuppression affected such path, accounting of costs was similar in some aspects once the cycle length of models was taken into account. Table 146 presents the most important costs for those elements that were common across the models.

TABLE 146

Major cost elements (£) in the model submissions

Although the acquisition costs of TAC are comparable across the three industry submissions, only the one by Bristol-Myers Squibb reports any estimates of drug administration, which have the merit of being based on observed data as opposed to assumptions about compliance with dosing guidelines or protocols. With respect to immunosuppression costs, it may be noted that Bristol-Myers Squibb did not account for costs of other concomitant drugs that are part of triple-therapy immunosuppression (e.g. MMF + CCSs, which were given in BENEFIT⁵⁹ and BENEFIT-EXT¹⁴²).

More importantly for the results is the observation that Bristol-Myers Squibb used an estimate of dialysis costs³⁷² that was twice the size of the estimate adopted by Novartis (NICE costing guideline 2011³⁷⁴) and almost 13% higher than that of Astellas.³⁷¹ Given the driving influence of dialysis costs for cost-effectiveness and an issue to be discussed next in relation to the time spent on dialysis in the models, the quality of evidence gained by the Bristol-Myers Squibb model in estimating immunosuppression-related costs and event costs may have been partly offset by an overestimation of the cost savings to be obtained from reducing the time for which people experienced dialysis.

In Table 147, the key features of the effectiveness elements of the analyses performed by the companies are presented. A salient aspect of the comparison model specifications is the longer expected time to retransplantation at the time dialysis starts for those people whose graft fails in the Bristol-Myers Squibb model. It is noted that this estimate was derived from an exponential survival model from an older patient sample in the USA (Medicare-covered transplant-only people). This model has a hazard (instantaneous probability) of receiving a transplant that is constant over time and that is predicted according to donor and patient characteristics (Levy et al.³³⁴). In the Bristol-Myers Squibb model these characteristics are fixed over time and result in the constant annual probability of 4% of receiving a transplant while on dialysis. This means that the expected waiting time for a retransplant in a US sample with the Bristol-Myers Squibb’s model characteristics (which match the BENEFIT⁵⁹ and BENEFIT-EXT¹⁴² sample characteristics), as detailed in the Bristol-Myers Squibb submission, is 16.5 years at the start of dialysis. This waiting time is clearly longer than the waiting time currently expected in the UK, which may be closer to the values adopted by Astellas and Novartis in their models.

TABLE 147

Key features of effectiveness analysis in industry models

In any case, the median time to retransplant may also be unrealistic for the USA, even after considering issues about socioeconomic barriers to access and related features of that system. After inspection of the estimated coefficients of the exponential model reported by Levy et al.³³⁴ (supplementary material, file 1, and reproduced by the Bristol-Myers Squibb submission as appendix 4, table 1), the age covariate (which remains fixed at 40.3 years throughout the 40 annual cycles of the Markov model, so that those proportions of the cohort who experience graft failure early in the model have the same probability of receiving a retransplant in any given cycle as that people who experience graft failure in the latter part of the modelled time horizon) is positively associated with the probability of retransplant, which means that those who start dialysis at older ages have shorter expected waits for a retransplant and suggests that the model was estimated in a cohort of much older people than the Bristol-Myers Squibb’s modelled age of 40 years (e.g. for graft failure at age 70 years the model yields an expected wait of approximately 10 years to receive a retransplant).

The overestimation of time to retransplant in the Bristol-Myers Squibb model that was just described has the implication of overestimating the time on dialysis with its associated costs and loss in quality of life. This, in turn, means that the model is likely to overestimate the benefits of any advantages in terms of graft survival that BEL has over its comparators, TAC and CSA. Likewise, this probably exaggerates the costs savings and quality-of-life gains of TAC over CSA, which suggest that its ICER (£3375; this was not stated in the Bristol-Myers Squibb submission but implicit in their numbers and calculated from them by PenTAG) is an underestimate. Table 148 presents a summary of model outputs for the three industry model submissions.

TABLE 148

Results of model-based analyses submitted by the companies

Copyright © Queen’s Printer and Controller of HMSO 2016. This work was produced by Jones-Hughes et al. under the terms of a commissioning contract issued by the Secretary of State for Health. This issue may be freely reproduced for the purposes of private research and study and extracts (or indeed, the full report) may be included in professional journals provided that suitable acknowledgement is made and the reproduction is not associated with any form of advertising. Applications for commercial reproduction should be addressed to: NIHR Journals Library, National Institute for Health Research, Evaluation, Trials and Studies Coordinating Centre, Alpha House, University of Southampton Science Park, Southampton SO16 7NS, UK.

Included under terms of UK Non-commercial Government License.

Bookshelf ID: NBK379777

Contents

< Prev Next >

PubReader
Print View
Cite this Page
Jones-Hughes T, Snowsill T, Haasova M, et al. Immunosuppressive therapy for kidney transplantation in adults: a systematic review and economic model. Southampton (UK): NIHR Journals Library; 2016 Aug. (Health Technology Assessment, No. 20.62.) Chapter 5, Critical appraisal of company submissions.
PDF version of this title (6.7M)

Critical appraisal of company submissions - Immunosuppressive therapy for kidney...
Critical appraisal of company submissions - Immunosuppressive therapy for kidney transplantation in adults: a systematic review and economic model
The HubBLe Trial: haemorrhoidal artery ligation (HAL) versus rubber band ligatio...
The HubBLe Trial: haemorrhoidal artery ligation (HAL) versus rubber band ligation (RBL) for symptomatic second- and third-degree haemorrhoids: a multicentre randomised controlled trial and health-economic evaluation
Factors considered while rating the quality of the evidence - WHO Guidelines on ...
Factors considered while rating the quality of the evidence - WHO Guidelines on the Management of Health Complications from Female Genital Mutilation
Lapatinib and Trastuzumab in Combination with an Aromatase Inhibitor for the Fir...
Lapatinib and Trastuzumab in Combination with an Aromatase Inhibitor for the First-Line Treatment of Metastatic Hormone Receptor-Positive Breast Cancer which Over-Expresses Human Epidermal Growth Factor 2 (HER2): A Systematic Review and Economic Analysis

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

Immunosuppressive therapy for kidney transplantation in adults: a systematic review and economic model.

Chapter 5Critical appraisal of company submissions

Astellas’ submission

Overview

Efficacy and effectiveness evidence

Review of economic models and their results in the submission

Economic evaluation by the company

Efficacy data used in the model

Adverse events

Unit costs

Results

Critical appraisal

Novartis’ submission

Costs

Utilities

Results

EVL + reduced CSA vs. TAC + MMF

EVL + reduced CSA vs. MMF + standard-dose CSA

EC-MPS vs. MMF + standard-dose CSA

Critique

Bristol-Myers Squibb’s submission

Efficacy parameter estimates

Costs

Results of Bristol-Myers Squibb’s analyses

Critique

Comparison between the model submissions

Views

In this Page

Other titles in this collection

Recent Activity