U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Walton M, Wade R, Claxton L, et al. Selective internal radiation therapies for unresectable early-, intermediate- or advanced-stage hepatocellular carcinoma: systematic review, network meta-analysis and economic evaluation. Southampton (UK): NIHR Journals Library; 2020 Sep. (Health Technology Assessment, No. 24.48.)

Cover of Selective internal radiation therapies for unresectable early-, intermediate- or advanced-stage hepatocellular carcinoma: systematic review, network meta-analysis and economic evaluation

Selective internal radiation therapies for unresectable early-, intermediate- or advanced-stage hepatocellular carcinoma: systematic review, network meta-analysis and economic evaluation.

Show details

Chapter 3Assessment of clinical effectiveness

Methods for reviewing clinical effectiveness

A systematic review of the clinical effectiveness evidence on SIRT was undertaken following the general principles outlined in the Centre for Reviews and Dissemination (CRD)’s guidance on undertaking systematic reviews15 and reported in accordance with the general principles of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement.16 The research protocol is registered on PROSPERO, the international prospective register of systematic reviews in health and social care (registration number CRD42019128383).

Search strategy

A comprehensive search was undertaken to systematically identify clinical effectiveness and cost-effectiveness literature relating to TheraSphere, SIR-Spheres and QuiremSpheres for HCC. In addition, a search for randomised controlled trials (RCTs) of comparator therapies was undertaken to strengthen the network of evidence on SIRT.

Search strategy for selective internal radiation therapy studies

A search strategy was developed in Ovid MEDLINE by an information specialist (MH), with input from the review team. The strategy consisted of a set of terms for HCC combined with terms for SIRT, and was limited to studies from 2000 onwards. The 2000 date limit was applied as scoping searches had identified controlled studies of SIR-Spheres and TheraSphere published after the year 2000; earlier studies were preliminary uncontrolled studies so have limited value for addressing the decision problem. In addition, clinical advice confirmed that the treatment environment for patients with HCC was different prior to 2000 in terms of comparator treatment options. The searches were not limited by language or study design. The MEDLINE strategy was adapted for use in all other resources searched.

The following databases were searched on 28 January 2019:

  • MEDLINE (all) (via Ovid)
  • EMBASE (via Ovid)
  • Cumulative Index to Nursing and Allied Health Literature (CINAHL) Plus
  • Science Citation Index (via Web of Science)
  • Cochrane Central Register of Controlled Trials (CENTRAL) (via Wiley)
  • Cochrane Database of Systematic Reviews (via Wiley)
  • Database of Abstracts of Reviews of Effects (via CRD databases)
  • Health Technology Assessment (HTA) database (via CRD databases)
  • NHS Economic Evaluation Database (NHS EED) (via CRD databases)
  • EconLit (via Ovid).

In addition, information on studies in progress, unpublished research or research reported in grey literature was sought by searching a range of relevant resources:

  • ClinicalTrials.gov
  • World Health Organization (WHO) International Clinical Trials Registry portal
  • European Union Clinical Trials Register
  • PROSPERO
  • Conference Proceedings Citation Index – Science (via Web of Science)
  • ProQuest Dissertations & Theses A&I (via ProQuest).

A search of the NICE website and NHS Evidence for relevant guidelines was undertaken on 8 May 2019.

Company submissions and relevant systematic reviews were also hand-searched to identify further relevant studies. Clinical advisors were consulted for any additional studies.

Search results were imported into EndNote X9 (Clarivate Analytics, Philadelphia, PA, USA) and de-duplicated. Full search strategies can be found in Appendix 1.

Search strategy for comparator therapies

A search for RCTs of comparator therapies was undertaken to strengthen the network of evidence on SIRT. In view of time and resource limitations, it was decided to identify RCTs of CTTs (i.e. TAE, TACE and DEB-TACE) by searching existing relevant systematic reviews and meta-analyses and undertaking update searches if necessary.

Evidence on systemic therapies for HCC was identified from the recent NICE single technology appraisals of sorafenib,11 lenvatinib12 and regorafenib.13

The search strategy for systematic reviews and meta-analyses of CTTs was developed in Ovid MEDLINE by an information specialist (MH), with input from the review team. The strategy consisted of a set of terms for HCC combined with terms for embolisation or chemoembolisation, and was limited to studies from 2010 onwards to identify the most recent reviews. A search strategy to limit retrieval to systematic reviews or meta-analyses was added in MEDLINE and EMBASE.17 The MEDLINE strategy was adapted for use in all resources searched.

The following databases were searched on 7 May 2019:

  • MEDLINE (all) (via Ovid)
  • EMBASE (via Ovid)
  • Cochrane Database of Systematic Reviews (via Wiley)
  • Database of Abstracts of Reviews of Effects (via CRD databases)
  • HTA database (via CRD databases).

In addition, PROSPERO was searched to identify any unpublished or ongoing systematic reviews or meta-analyses.

Search results were imported into EndNote X9 and de-duplicated. Full search strategies can be found in Appendix 2.

Inclusion criteria

Inclusion criteria were defined in line with the final scope provided by NICE and are outlined below. Studies were initially assessed for relevance using titles and abstracts. One reviewer examined titles and abstracts, with a second reviewer checking 10% of records. Full manuscripts of any titles/abstracts that appeared to be relevant were obtained if possible and the relevance of each study was assessed independently by two reviewers in accordance with the criteria outlined in the following sections. Any discrepancies were resolved through consensus and, if necessary, a third reviewer was consulted. Relevant foreign-language studies were translated and assessed for inclusion in the review. Studies available only as abstracts were included and attempts were made to contact authors for further data.

Study design

Randomised controlled trials were eligible for inclusion in the clinical effectiveness review. However, where RCT evidence was insufficient to address the decision problem, non-randomised comparative studies (including retrospective studies) and non-comparative studies of SIRT were considered for inclusion. The evidence was scoped before deciding what level of evidence would be included for data extraction and quality assessment.

Participants

Studies of people with early-stage HCC in whom curative treatment is contraindicated (BCLC stage A), and with intermediate-stage (BCLC stage B) or advanced-stage (BCLC stage C) HCC, with or without PVT/portal vein invasion (PVI), were included in the review. Studies of people with secondary liver metastases or other types of liver cancer (such as cholangiocarcinoma) were not included unless they also included people with primary HCC, and results were reported separately for people with HCC.

Interventions

The interventions under consideration were the selective internal radiation therapies TheraSphere, SIR-Spheres and QuiremSpheres. Studies in which more than one type of SIRT was used were included only if results were reported separately for the different types of SIRT. Where studies did not state which type of SIRT or radioembolisation technology was used, authors were contacted to identify the specific technology used.

Evidence on combined treatments (e.g. SIRT plus sorafenib) was also considered for inclusion, and evidence was scoped before deciding which trials would be included for data extraction and quality assessment.

Comparators

Relevant comparators were:

  • alternative SIRT interventions (i.e. TheraSphere, SIR-Spheres and QuiremSpheres)
  • conventional transarterial therapies (i.e. TAE, TACE and DEB-TACE)
  • established clinical management without SIRT, such as systemic therapy (i.e. sorafenib, lenvatinib and regorafenib) or BSC, for people for whom any TAE therapies are inappropriate.

To strengthen the network of evidence on SIRT, we considered undertaking comparisons of CTTs (i.e. TAE, TACE and DEB-TACE), systemic therapies (i.e. sorafenib, lenvatinib and regorafenib) and BSC, using RCT evidence. The evidence was scoped and criteria for inclusion were developed. Relevant RCTs were assessed for quality and key outcome data were extracted, based on requirements for the model.

Outcomes

The outcome measures to be considered included:

  • overall survival (OS)
  • progression-free survival (PFS)
  • time to progression (TTP)
  • response rates
  • rates of liver transplant or surgical resection
  • adverse effects of treatment
  • health-related quality of life (HRQoL)
  • time on treatment/number of treatments provided.

Data extraction

Data were extracted by one reviewer using a standardised data extraction form and independently checked for accuracy by a second reviewer. Disagreements were resolved through consensus and, if necessary, a third reviewer was consulted. If multiple publications of the same study were identified, data were extracted and reported as a single study.

Critical appraisal

The methodological quality of the included studies was assessed using criteria relevant to the study design. RCTs were assessed using the most recent version of the Cochrane risk-of-bias tool.18 Quality-assessment tools for other study designs were developed using relevant criteria, such as those outlined in the CRD’s guidance on undertaking systematic reviews.15 Quality assessment was undertaken by one reviewer and independently checked by a second reviewer. Any disagreements were resolved through consensus and, if necessary, a third reviewer was consulted. Details of the quality of the included studies are presented in descriptive tables and their impact on the reliability of results is discussed.

Methods of data analysis/synthesis

Characteristics of the included SIRT studies (such as participant and intervention characteristics, results and trial quality) were tabulated and described in a narrative synthesis. Where sufficient clinically and statistically homogenous data were available, data were pooled using appropriate meta-analytic techniques using WinBUGS software (Medical Research Council Biostatistics Unit, Cambridge, UK). Clinical, methodological and statistical heterogeneity were investigated, with sensitivity or subgroup analyses undertaken where appropriate and where available data permitted.

Where the data allowed, a network meta-analysis (NMA) using Bayesian statistical methods with WinBUGS software was undertaken to estimate the relative effectiveness of the different treatments. Results are summarised using point estimates and 95% credible intervals (CrIs) of the effect of each treatment relative to the reference treatment. Where possible, consistency between direct and indirect estimates of treatment effect in the NMA was assessed. The results of the NMA are described in Chapter 4 of this report and were used in the economic model described in Chapter 7.

Clinical effectiveness results

Quantity and quality of research available

Studies of selective internal radiation therapy

The electronic searches for clinical effectiveness evidence on SIRT interventions (i.e. TheraSphere, SIR-Spheres and QuiremSpheres) identified a total of 4755 records (after de-duplication between databases). The 4755 records were inserted into an EndNote library. Reviewer 1 (RW) screened 2615 titles and abstracts, and reviewer 2 (SS) screened 2617 titles and abstracts. A total of 477 records (10% of the library) were double-screened; discrepancies were resolved through consensus or in consultation with a third reviewer (AE).

Of the 4755 records in the library, 3670 were excluded from the clinical effectiveness review after title and abstract screening as they did not include patients with unresectable HCC, did not assess TheraSphere, SIR-Spheres or QuiremSpheres, did not report relevant patient outcomes or were not a primary study. A total of 1085 records appeared to meet the study selection criteria based on title and abstract (where an abstract was available).

In view of the large number of potentially eligible records, the evidence was scoped before deciding which studies to order for full-paper screening. Records were coded, using titles and abstracts (where available), in terms of the intervention (type of SIRT and whether the study focused on the delivery of SIRT or the work-up procedure), the study design (prospective or retrospective, comparative or not) and the number of HCC patients included in the study. A large number of records were conference/meeting abstracts (n = 603) rather than full publications (n = 482); reviewer 1 (RW) coded the full publications and reviewer 2 (SS) coded the conference/meeting abstracts. Studies marked as a ‘RCT’ (n = 47; 43 full publications and four conference/meeting abstracts) or as ‘prospective comparative’ (n = 26; 18 full publications and eight conference/meeting abstracts) or ‘retrospective comparative’ (n = 103; 61 full publications and 42 conference/meeting abstracts) studies were ordered for full-paper screening as comparative studies (total n = 176) were prioritised over non-comparative studies. However, it was clear that there were no comparative studies of QuiremSpheres; therefore, all studies considered to relate to QuiremSpheres (referring to holmium as the intervention) were ordered for full-paper screening (n = 11). In addition, large non-comparative studies that included > 500 patients were also ordered for full-paper screening (n = 6). One additional non-comparative study, in which BCLC subgroups and subsequent treatments were reported and which was considered to be particularly relevant for the economic model, was ordered. Therefore, a total of 194 records were ordered for full-paper screening.

Of the 194 records ordered, 130 were excluded based on full-paper screening and 64 were considered to be potentially relevant records to be included in the clinical effectiveness review and/or NMA (55 studies plus nine associated publications).

A total of 130 records were coded at the title and abstract stage as systematic reviews. Reviewer 1 (RW) screened systematic reviews from 2015 onwards for relevance; there were 25 relevant systematic reviews (plus one associated erratum). The reference lists of these systematic reviews were screened to check for additional potentially relevant studies; no additional studies were identified.

Separate searches of guideline databases (the NICE website and NHS Evidence), conducted in May 2019, identified a total of 23 records after de-duplication against the original library, none of which were considered relevant for inclusion in the systematic review. The reference lists of relevant guidelines were screened to check for additional potentially relevant studies; no additional studies were identified.

Clinical advisors were not aware of any additional studies other than those already identified from electronic searches.

A PRISMA flow diagram is presented in Figure 1. In total, 27 of the 55 studies were prioritised for data extraction, as they were considered to be the most relevant for the assessment of clinical effectiveness and/or the proposed NMAs; these studies are summarised in Table 3. One non-comparative study was included in the clinical effectiveness review because this was the only study of QuiremSpheres;51 the other 26 studies were comparative studies.

FIGURE 1. The PRISMA flow diagram of the study selection process for the clinical effectiveness review.

FIGURE 1

The PRISMA flow diagram of the study selection process for the clinical effectiveness review.

TABLE 3

TABLE 3

Studies included in the systematic review of clinical effectiveness or considered for the NMA (n = 27)

The 28 lower-priority studies are summarised in Appendix 7 along with the reason for not including them in the systematic review of clinical effectiveness or the proposed NMAs (e.g. consultation with clinical advisors confirmed that the comparators used were not applicable to current UK practice).5255

Thirty-four records were coded at the title and abstract stage as potentially relevant economic studies (seven of which were also coded as includes for the clinical effectiveness review). A separate flow diagram of the study selection process for these economic studies is presented (see Figure 7).

Studies of comparator therapies

Randomised controlled trials of comparator therapies were sought to strengthen the network of evidence on SIRT (see Chapter 5). The search for systematic reviews and meta-analyses of CTTs (TAE, TACE and DEB-TACE) identified 989 records. The records were inserted into an EndNote library and one reviewer (RW) screened the titles and abstracts. Records were put in reverse date order and screened started at the year 2019 and worked backwards until no new relevant RCTs were identified from the reviews and meta-analyses. A total of 319 records were screened, published between 2017 and 2019. Twenty-four of the 319 records were relevant systematic reviews or meta-analyses; full papers were obtained and reference lists were checked for RCTs comparing TAE, TACE or DEB-TACE with each other. Eleven relevant RCTs (reported in 12 publications) were identified, which are summarised in Table 4. In view of the recency of the relevant systematic reviews and meta-analyses and the age of the RCTs of CTTs (published between 1992 and 2016), it was decided that update searches were not necessary.

TABLE 4

TABLE 4

The RCTs of CTTs (n = 11)

Evidence on systemic therapies for HCC was identified from the recent NICE single technology appraisals of sorafenib,11 lenvatinib12 and regorafenib.13

Assessment of clinical effectiveness

This section describes the seven RCTs and seven prospective comparative studies of SIR-Spheres and TheraSphere, the five retrospective comparative studies comparing SIR-Spheres with TheraSphere and the non-comparative case series of QuiremSpheres. The additional seven retrospective comparative studies of SIR-Spheres or TheraSphere (see Table 3) and studies of comparator therapies (see Table 4) that were selected, as they were considered to be potentially relevant for the NMAs, are described in Chapter 5.

Risk of bias

Results of the risk-of-bias judgements are presented in Appendix 5.

The SorAfenib versus Radioembolization in Advanced Hepatocellular Carcinoma (SARAH) and Selective Internal Radiation Therapy Versus Sorafenib in Locally Advanced Hepatocellular Carcinoma (SIRveNIB) RCTs were both rated as having a low overall risk of bias.1921 There were some concerns regarding bias for the trials undertaken by Pitton et al.23 and Kulik et al.28 Concerns related to the randomisation process for the study by Pitton et al.23 There were concerns related to the randomisation process, potential deviations from the intended interventions and measurement of the outcome for the study by Kulik et al.28 The SIRTACE,22 SORAMIC24 and Prospective Randomized study of chEmoeMbolization versus radIoEmbolization for the tReatment of hEpatocellular carcinoma (PREMIERE)2527 trials were all rated as being at a high overall risk of bias; the SIRTACE trial was rated as being at a high risk of bias arising from the randomisation process, missing outcome data and measurement of the outcome,22 the SORAMIC trial was rated as being at a high risk of bias in relation to deviations from the intended interventions as well as some concerns arising from the randomisation process,24 and the PREMIERE trial was rated as being at a high risk of bias arising from the randomisation process and concerns arising from deviations from the intended interventions.2527

The prospective comparative studies were all rated as being at a high risk of bias.3137 In particular, allocation to treatment groups was either inadequately described or inappropriate, resulting in differences in prognostic factors between treatment groups at baseline. Outcome assessors do not appear to have been blinded in any of the prospective comparative studies.

Four of the retrospective comparative studies were rated as being at a high risk of bias.3840,42 The two studies by Biederman et al.38,39 appear to have included many of the same patients, although one of the studies was reported only as a conference abstract, with very limited study details.38 Each of the studies rated as being at a high risk of bias appeared to include patients with different prognostic characteristics at baseline in the two different treatment groups. It was unclear whether or not outcome assessors were blinded in any of the studies. The study by Bhangoo et al.41 was rated as being at an unclear risk of bias; it was unclear whether or not treatment groups were similar at baseline, whether or not outcome assessors were blinded and whether or not missing outcome data were balanced across treatment groups.

The small case series undertaken by Radosa et al.51 should be considered to be at a high risk of bias; it is unclear whether or not patients were representative of all those who would be eligible for SIRT in clinical practice, outcome assessors were not blinded to the participants’ intervention and outcome measures were not consistently assessed.

Efficacy and safety of SIR-Spheres

As discussed in Study design, RCTs were eligible for inclusion in the clinical effectiveness review, with non-randomised comparative studies and non-comparative studies considered for inclusion, in the absence of sufficient RCT evidence. Five RCTs of SIR-Spheres were identified, comparing SIR-Spheres with established therapies available to patients with intermediate (TACE/DEB-TACE) and advanced (sorafenib) HCC. Other studies of SIR-Spheres identified also compared with sorafenib or TACE (see Table 3); therefore, they were not included in the review.

This section focuses on the two large good-quality RCTs (SARAH and SIRveNIB) and also presents a brief summary of the three lower-quality RCTs of SIR-Spheres.

The SARAH and SIRveNIB randomised controlled trials

Two large RCTs compared SIR-Spheres with sorafenib in patients who were not suitable for curative treatments: the SARAH trial was conducted in France19,20 and the SIRveNIB trial was conducted in the Asia-Pacific region.21 Both trials were considered to have a low overall risk of bias (see Appendix 5). Further details of these trials are presented in Table 5.

TABLE 5

TABLE 5

Details of the SARAH and SIRveNIB RCTs

As shown in Table 5, there were methodological differences between the SARAH and the SIRveNIB trials. In the SIRveNIB trial, patients could receive only one SIRT delivery, whereas in the SARAH trial patients could receive more than one delivery of SIRT; 69 out of 184 (37.5%) patients who received SIRT received more than one delivery to either the ipsilateral or the contralateral lobe.

The SARAH trial was conducted in France and the SIRveNIB trial was conducted in the Asia-Pacific region. This has implications for the generalisability of the SIRveNIB trial results to the UK population. HCC in European patients is more likely to be caused by alcohol or hepatitis C, whereas in Asia it is more likely to be caused by hepatitis B. The natural history of these diseases is different. Treatment options are also different, as hepatitis B-related liver disease is often less advanced than in alcohol-related or hepatitis C-related disease; therefore, patients may have had more treatment prior to receiving systemic therapy.

The Sirtex submission stated that patient selection in the SARAH trial did not reflect UK clinical practice, as the trial included patients with a poor survival prognosis who would be considered for only systemic therapy or BSC [e.g. because of a high tumour burden, main PVT or impaired liver function (Child–Pugh class B)]. Therefore, this has implications for the generalisability of the SARAH trial results to the UK population who would be eligible for SIRT in clinical practice.

In both trials, patients were assessed for suitability of SIRT after randomisation. In the SARAH trial, 53 out of 237 (22.4%) patients allocated to SIR-Spheres did not receive SIRT, 26 of whom were treated with sorafenib. In the SIRveNIB trial, 52 out of 182 (28.6%) patients allocated to SIR-Spheres did not receive SIRT, three of whom were treated with sorafenib (where reported; subsequent treatments were not reported for 31/52 patients). Results were presented for both the intention-to-treat (ITT) and the per-protocol populations; patients who did not receive their allocated treatment were excluded from the per-protocol analysis (those who received sorafenib instead of SIRT were not included in the sorafenib arm in the per-protocol analysis).

The SARAH and SIRveNIB trial publications reported baseline characteristics for both the ITT and the per-protocol populations.19,21 The SIR-Spheres and sorafenib groups were generally similar at baseline in the ITT populations (see Table 5). However, in the per-protocol population, patients in the sorafenib arm appeared to have slightly worse disease characteristics than those in the SIR-Spheres arm in the SARAH trial (BCLC stage C: 69.4% vs. 65.5%; Child–Pugh class B7: 14.6% vs. 11.5%; median tumour burden: 20% vs. 12.5%, respectively) and in the SIRveNIB trial (BCLC stage C: 45.1% vs. 38.5%; PVT: 29.6% vs. 23.1%; tumour size > 50% of liver: 21.6% vs. 17.7%, respectively).

Overall survival

Neither trial found a statistically significant difference in OS between SIR-Spheres and sorafenib in either the ITT or the per-protocol analyses, as shown in Table 5.

Both trials undertook subgroup analyses according to baseline characteristics. The SIRveNIB trial reported a statistically significant difference in OS favouring SIR-Spheres in the subgroup of patients with BCLC stage C disease in the per-protocol analysis [median 9.2 vs. 5.8 months, hazard ratio (HR) 0.67, 95% confidence interval (CI) 0.4 to 1.0; p = 0.0475]. The SARAH trial demonstrated a statistically significant difference in OS favouring sorafenib in the subgroup of patients with complete occlusion in the main portal vein in the per-protocol analysis (HR 2.44, 95% CI 1.01 to 5.88); however, the number of patients included in this subgroup analysis was very small, so the result should be interpreted with caution.

Progression-free survival

In the SARAH trial, PFS was defined as the time from the closest date of radiological examination before first administration of study treatment to disease progression, in accordance with Response Evaluation Criteria in Solid Tumours (RECIST) 1.1 criteria,68 or death. In the SIRveNIB trial, PFS was defined as the time from the date of randomisation to tumour progression at any site in the body, or death, whichever is earlier. Tumour progression was assessed in accordance with RECIST 1.1 criteria.68

Progression-free survival was not statistically significantly different between treatment groups in the ITT analyses of either the SARAH or the SIRveNIB trials. However, in the SIRveNIB trial, PFS was statistically significantly improved with SIR-Spheres in the per-protocol analysis (HR 0.73, 95% CI 0.6 to 0.9; p = 0.0128).

Tumour response rate

Tumour response was statistically significantly greater in the SIR-Spheres arm than in the sorafenib arm in both the SARAH and the SIRveNIB trials (SARAH: 19% vs. 12%, p = 0.0421; SIRveNIB: 16.5% vs. 1.7%, p < 0.001). However, in the SARAH trial, only 190 SIR-Spheres patients and 198 sorafenib patients were evaluable and included in the analysis.

Rate of liver transplantation or resection

A very small proportion of patients in both treatment arms of the SARAH and the SIRveNIB trials went on to have subsequent liver transplantation (< 1%), liver surgery (0.6–1.3%) or tumour ablation (0.5–2.5%).

Quality of life

The SARAH trial reported statistically significantly better HRQoL in the SIR-Spheres treatment group than in the sorafenib group for both the ITT and the per-protocol populations, assessed using the European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ)-C30. However, the proportion of patients who completed questionnaires was 71% in the SIR-Spheres group (169/237) and 84% (186/222) in the sorafenib group at baseline, reducing with time to only 29% (26/90 patients at risk) in the SIR-Spheres group and 32% (29/92 patients at risk) in the sorafenib group at 12-month follow-up. There was no statistically significant difference in HRQoL between the treatment groups in the SIRveNIB trial, assessed using the EuroQol-5 Dimensions (EQ-5D) index.

Adverse events

The proportion of patients reporting at least one treatment-related adverse event (TRAE) and the proportion reporting at least one grade ≥ 3 adverse event (AE) was higher in the sorafenib group than in the SIR-Spheres group in both trials, as shown in Table 5.

In the SARAH trial, the most frequent grade ≥ 3 AEs were fatigue (SIR-Spheres 9% vs. sorafenib 19%), liver dysfunction (11% vs. 13%), increased laboratory liver values (9% vs. 7%), haematological abnormalities (10% vs. 14%), diarrhoea (1% vs. 14%), abdominal pain (3% vs. 6%), increased creatinine (2% vs. 6%) and hand–foot skin reaction (< 1% vs. 6%).

In the SIRveNIB trial, the most frequent grade ≥ 3 AEs of interest were anaemia (SIR-Spheres 0% vs. sorafenib 2.5%), fatigue (0% vs. 3.7%), diarrhoea (0% vs. 3.7%), abdominal pain (2.3% vs. 1.2%), ascites (3.8% vs. 2.5%), hypertension (0% vs. 1.2%), upper gastrointestinal haemorrhage (0.8% vs. 1.9%), jaundice (0.8% vs. 1.2%), radiation hepatitis (1.5% vs. 0%) and hand–foot skin reaction (0% vs. 16.7%).

The AE profiles of SIRT and sorafenib are very different. Sorafenib is a continuous treatment, whereas most patients receive only one delivery of SIRT [37.5% patients in the SARAH trial received more than one delivery, either to the ipsilateral or to the contralateral lobe (primarily because of bilobar tumours or a large central tumour requiring bilateral treatment), whereas in the SIRveNIB trial patients received only one delivery]. AE rates were not reported separately for patients who received more than one delivery of SIRT; therefore, it is not possible to compare AE rates for patients who received one delivery with those who received more than one delivery. In the SARAH trial, patients with bilobar tumours received the first treatment in the hemiliver with the greatest tumour burden, and treatment of the contralateral hemiliver was scheduled 30–60 days after the first treatment. No patient had a whole-liver treatment approach in one session. Clinical advisors confirmed that this is reflective of their experience; patients would not receive whole-liver treatment in one session to reduce the risk of radioembolisation-induced liver disease (REILD). However, the Sirtex submission states that SIR-Spheres can be administered to both lobes of the liver during the same procedure [based on observational data in which 95.9% patients in the European Network on Radioembolisation with Yttrium-90 Resin Microspheres (ENRY) register received whole-liver treatments in a single session69]; neither the SARAH trial nor the SIRveNIB trial administered SIR-Spheres to both lobes during the same procedure. This variance is probably because of the clinical indication for SIRT; the ENRY register is likely to include a majority of patients with colorectal cancer liver metastases, who do not have underlying cirrhosis, whereas in HCC patients the cirrhotic liver is likely to be more susceptible to REILD.

A relatively large proportion of patients who undergo work-up for SIRT, to assess their suitability for the procedure, are unable to receive SIRT (e.g. owing to liver-to-lung shunting or unfavourable hepatic arterial anatomy) [42/226 (18.6%) in SARAH and 37/182 (20.3%) in SIRveNIB]. The work-up of patients who are unable to undergo SIRT delivery has cost implications.

The SARAH randomised controlled trial subgroup analysis (low tumour burden/low albumin–bilirubin grade)

The Sirtex company submission selected a subgroup of patients from the SARAH trial with ≤ 25% tumour burden and albumin–bilirubin (ALBI) 1 for their base-case analysis in the economic model; the company stated that these patients are considered the most appropriate candidates for SIR-Spheres in clinical practice, as they are the most likely to benefit from SIRT. This is not a clinically recognised subgroup and was based on a post hoc analysis; therefore, these results should be prospectively validated before being considered relevant for clinical practice.

This subgroup included 37 (16%) patients in the SIRT group and 48 (22%) patients in the sorafenib group; 92% of those allocated to SIRT received treatment after work-up. Baseline characteristics were relatively well balanced between treatment groups, although more patients in the SIRT arm had BCLC stage B disease, single tumours and received previous TACE (these patients generally have a better prognosis than patients who are diagnosed at a later stage and are not eligible for TACE) than in the sorafenib arm. More patients in the sorafenib arm had an ECOG performance status of 0 and unilobar liver involvement. Table 6 presents the baseline characteristics and results for the full ITT population and the low tumour burden/low ALBI grade subgroup of the SARAH trial.

TABLE 6

TABLE 6

Details of the ITT population and the low tumour burden/low ALBI grade subgroup of SARAH

As shown in Table 6, median OS and PFS appeared to be better in the SIR-Spheres arm than in the sorafenib arm in the post hoc subgroup analysis, although the difference between treatment groups was not statistically significant. The proportion of patients who went on to have potentially curative therapy was higher in the SIR-Spheres arm than in the sorafenib arm, although numbers were very low (five and one patients, respectively). Tumour response rate, HRQoL and AEs were not reported separately for the low tumour burden/low ALBI grade subgroup.

Prespecified and post hoc subgroup analysis results were presented in the SARAH trial publication for OS.19 Tumour burden was included as a post hoc subgroup. However, neither the ALBI grade nor the combination of low tumour burden and low ALBI grade was presented.

The SIRveNIB trial did not report subgroup analysis results for the subgroup of low tumour burden/low ALBI grade patients. However, ALBI grade was included in the OS subgroup analysis. Results favoured SIR-Spheres in the subgroup of ALBI 1 patients (HR 0.89, 95% CI 0.6 to 1.4; p = 0.58), whereas results favoured sorafenib for the subgroup of patients with ALBI 2/3 (HR 1.24, 95% CI 0.9 to 1.7; p = 0.14).

Other randomised controlled trials of SIR-Spheres

The SIRTACE is a small RCT rated as being at a high risk of bias that compared SIR-Spheres (n = 13) with TACE (n = 15) in patients with unresectable HCC without portal vein occlusion.22 A higher proportion of patients in the SIRT group had BCLC stage A disease (38.5% vs. 26.7%) and Child–Pugh liver function class A (92.3% vs. 86.7%) than in the TACE group. The average number of tumour nodules was higher in the TACE group (5.0 vs. 3.5). Therefore, patients in the SIR-Spheres treatment arm had a better prognosis than those in the TACE arm.

At 6 months, 69.2% of SIRT patients and 86.7% of TACE patients were still alive. At 12 months, 46.2% of SIRT patients and 66.7% of TACE patients were still alive. PFS, disease control rate and the proportion of patients who went on to have potentially curative therapy were similar between treatment groups. The proportion of patients with a partial response was higher in the SIRT group than in the TACE group (30.8% vs. 13.3%), although patient numbers were very small.

There were no statistically significant differences between treatment groups in HRQoL by week 12, despite Functional Assessment of Cancer Therapy Hepatobiliary-Pancreatic Symptom Index (FACT-Hep) scores being lower in the SIRT group at baseline (indicating lower quality of life). However, 10 out of 28 patients had missing baseline data and were excluded from HRQoL analyses. The proportion of patients reporting TRAEs was higher in the TACE group than in the SIRT group (33.3% vs. 23.1%), although the proportion of patients reporting at least one AE was higher in the SIRT group (92.3% vs. 66.7%), as was the number of patients with grade ≥ 3 AEs (three vs. two patients) and serious AEs requiring hospitalisation (seven vs. five patients).

A small RCT by Pitton et al.,23 with some concerns regarding bias, compared SIR-Spheres (n = 12) with DEB-TACE (n = 12) in patients with unresectable intermediate (BCLC stage B) HCC with preserved liver function (Child–Pugh class A–B7). Treatment groups appeared reasonably similar at baseline, although more patients in the SIRT group had received prior local ablation (four vs. one) and more patients in the DEB-TACE group had received prior resection (five vs. three). Median OS and PFS were longer in the DEB-TACE arm than in the SIR-Spheres arm (788 days vs. 592 days and 216 days vs. 180 days, respectively), although the difference between groups was not statistically significant. Median TTP was 371 days in the SIRT arm and 336 days in the DEB-TACE arm. AEs were not reported.

The SORAMIC RCT compared SIR-Spheres followed by sorafenib with sorafenib alone in patients with unresectable intermediate or advanced (BCLC stage B or C) HCC with preserved liver function (Child–Pugh class ≤ B7) and ECOG performance status of < 2, who were poor candidates for TACE. Only safety and tolerability data for the first 40 patients have been published to date, rated as being at a high risk of bias.24 More patients in the sorafenib-alone group had PVT (35% vs. 15%) and BCLC stage C disease (70% vs. 60%), indicating poorer prognosis in this group. There were 196 treatment-emergent AEs reported in the SIRT plus sorafenib arm and 222 events in the sorafenib-alone arm, of which 21.9% and 21.2%, respectively, were considered to be grade ≥ 3. The most common grade 3 or 4 AEs (hypertension, hand–foot skin reaction and diarrhoea) were reported in a similar number of patients in both treatment arms. Grade 3 or 4 fatigue appeared more common in patients receiving SIRT plus sorafenib (20% vs. 10%). Grade 3 or 4 infection and anorexia appeared more common in patients receiving sorafenib alone (20% vs. 5% and 0% vs. 10%, respectively). Grade 3 or 4 laboratory-related events were more common in patients receiving sorafenib alone (elevated gamma-glutamyltransferase level 45% vs. 30%, elevated aspartate aminotransferase level 15% vs. 0% and elevated alanine aminotransferase level 10% vs. 0%). One patient experienced a grade 3 gastric ulcer that was probably (but not proven to be) related to SIRT microspheres deposition.

Further details of each of these trials are presented in Appendix 6.

Ongoing studies

There are three ongoing studies of SIR-Spheres including patients with HCC: the Austrian Cardiovascular and Interventional Radiological Society of Europe Registry for SIR-Spheres Therapy (CIRT),70 the RESIN tumour registry in the USA71 and the RESIN tumour registry in Taiwan.72 The CIRT study was completed in January 2020, the RESIN tumour registry study in the USA is due to be completed in August 2022 and the RESIN tumour registry study in Taiwan was due to be completed in December 2019.

There is also an ongoing individual patient data prospective meta-analysis of patients from the SIRveNIB and SARAH trials: VESPRO.73

Efficacy and safety of TheraSphere

As discussed in Study design, RCTs were eligible for inclusion in the clinical effectiveness review. Non-randomised comparative studies (including retrospective studies) and non-comparative studies were considered for inclusion in the absence of sufficient RCT evidence. Only two small RCTs of TheraSphere were identified. Therefore, prospective non-randomised comparative studies were also included in the clinical effectiveness review; seven non-RCTs were included, most of which compared TheraSphere with TACE/DEB-TACE. The retrospective comparative studies of TheraSphere that were identified also compared against TACE/DEB-TACE (see Table 3); therefore, they were not included in the review as they were considered to be lower quality than the prospective comparative studies.

One small RCT rated as being at a high risk of bias (PREMIERE) compared TheraSphere (n = 24) with TACE (n = 21) as a bridge to transplant in patients with BCLC stage A or B unresectable HCC with no vascular invasion and Child–Pugh liver function class A or B.2527 The proportion of patients with Child–Pugh class A was much higher in the TACE arm than in the TheraSphere arm (71% vs. 50%) and the proportion of patients with portal hypertension was much lower in the TACE arm (52% vs. 83%), suggesting better prognosis in the TACE arm. OS was slightly longer in the TheraSphere arm (18.6 months vs. 17.7 months) and the rate of liver transplant/resection was also higher in the TheraSphere arm (87% vs. 70% of ‘listed patients’), although time to transplant/resection was slightly longer in the TheraSphere arm (8.8 months vs. 7.6 months). TTP was significantly longer in the TheraSphere arm: overall median TTP was not reached in the TheraSphere arm (> 26 months) and was 6.8 months in the TACE arm (HR 0.112, 95% CI 0.027 to 0.557; p = 0.007); TTP in the non-transplanted patients was also significantly longer in the TheraSphere arm (median > 26 months vs. 4.8 months). AEs and HRQoL were not reported.

One small RCT by Kulik et al.,2830 which caused some concerns regarding bias, compared TheraSphere plus sorafenib (n = 10) with sorafenib alone (n = 10) as a bridge to transplant in patients with Child–Pugh liver function class ≤ B8 HCC who were potential candidates for liver transplant. A higher proportion of patients in the TheraSphere plus sorafenib arm were male (80% vs. 50%) and had BCLC stage A disease (70% vs. 50%), with more patients in the TheraSphere-alone arm having BCLC stage C disease (40% vs. 20%). More patients in the TheraSphere plus sorafenib arm had ECOG performance status 0 (80% vs. 60%) and Child–Pugh liver function class A (80% vs. 60%). Three patients died in the TheraSphere arm, compared with two patients in the TheraSphere plus sorafenib arm. The proportion of patients receiving liver transplant or resection was 90% in each treatment arm. Most AEs were more common in the TheraSphere-alone arm (fatigue 90% vs. 40%, diarrhoea 20% vs. 10%, pain 50% vs. 0%, nausea 70% vs. 20% and vomiting 20% vs. 0%), although grade ≥ 3 hand–foot skin reaction was more common in the TheraSphere plus sorafenib arm (20% vs. 0%).

Five prospective comparative studies, all rated as being at a high risk of bias, compared TheraSphere with TACE/DEB-TACE in patients with HCC.3135 Two studies assessed OS. In one small study (n = 86), OS appeared slightly longer with TACE than with TheraSphere in patients with intermediate-stage disease (median 18 months vs. 16.4 months).32 In a much larger study (n = 765) in which survival outcomes were stratified by BCLC stage and Child–Pugh liver function class, survival was longer in the TACE arm for patients with early- and intermediate-stage disease but longer in the TheraSphere arm for patients with advanced-stage disease.35 Two small studies (n = 86 and n = 96) assessed TTP, which was longer with TheraSphere than with TACE (median 13.3 months vs. 6.8 months and median 13.3 months vs. 8.4 months).32,34 Two small studies (n = 67 and n = 86) assessed complete or partial response rate; results were conflicting, with one study31 favouring TACE (2.3% vs. 0%, using RECIST criteria68) and the other32 favouring TheraSphere (75% vs. 50%, using modified RECIST criteria68). Two small studies (n = 67 and n = 56) assessed HRQoL, both favouring TheraSphere.31,33 Only one study (n = 86) reported AEs; the most commonly reported AE (unspecific abdominal pain) was more frequent in TACE patients than in SIRT patients (83% vs. 5%).32

One small prospective matched case–control study by Maccauro et al.,36 rated as being at a high risk of bias, compared TheraSphere plus sorafenib (n = 15) with TheraSphere alone (n = 30) in patients with predominantly BCLC stage C (due to PVT) unresectable HCC with Child–Pugh liver function class A. The study was published only as a conference abstract; therefore, very limited data are available. Results were similar between treatment groups for OS (median 10 months in each treatment arm), PFS (median 6 months vs. 7 months in the TheraSphere plus sorafenib and TheraSphere-alone arms, respectively) and response rate, using modified RECIST criteria68 (45.5% vs. 42.8%). However, response rate using EASL criteria1 was better in the TheraSphere-alone arm (40% vs. 10%).

One small prospective comparative study by Woodall et al.,37 rated as being at a high risk of bias, compared TheraSphere in HCC patients without PVT (n = 20) with TheraSphere in HCC patients with PVT (n = 15) and a no-treatment control (BSC) in HCC patients who were not eligible for SIRT owing to substantial extrahepatic disease or hepatopulmonary shunt or underlying liver insufficiency (n = 17). OS was significantly longer in patients without PVT who received TheraSphere (median 13.9 months) than in patients with PVT who received TheraSphere (median 3.2 months) and patients who received BSC (median 5.2 months). AEs were more common in TheraSphere patients who had PVT than in those who did not have PVT (33% vs. 25%). No other outcomes were reported.

Further details of each of these studies are presented in Appendix 6.

Ongoing studies

There is one ongoing RCT of TheraSphere in patients with HCC: STOP-HCC, which has an estimated study completion date of February 2020; final results are not anticipated before at least December 2020.74

The BTG submission presents 12 additional ongoing or planned studies of TheraSphere.

Efficacy and safety of QuiremSpheres

Only one study of QuiremSpheres has been completed in patients with HCC: a small case series undertaken by Radosa et al.51 Nine patients with HCC were retrospectively identified from a prospectively maintained database of patients who received QuiremSpheres between March 2017 and April 2018 at a single centre. It is unclear whether or not patients were representative of all those who would be eligible for SIRT in clinical practice. The available data are too limited to draw any conclusions about the safety or efficacy of QuiremSpheres. Study details are presented in Appendix 6.

Ongoing studies

There are three ongoing studies of QuiremSpheres including patients with HCC: HEPAR Primary,75 HORA EST HCC76 and Hope166.77 All three studies are currently recruiting patients.

Direct comparisons of different selective internal radiation therapies

Five small retrospective comparative studies, all rated as being at a high or unclear risk of bias, compared SIR-Spheres with TheraSphere. No studies were identified that directly compared QuiremSpheres with either SIR-Spheres or TheraSphere. Further details of each of the five studies are presented in Appendix 6. The two studies by Biederman et al. (n = 9738 and n = 9039) included patients who all had PVT and appear to have included some of the same patients, although one of the studies was published only as a conference abstract,38 so it is unclear how much overlap there was. The study by d’Abadie et al.42 (n = 58 procedures) aimed to investigate the difference in efficacy per Gy of resin versus glass spheres and whether or not the difference could result from the different degrees of heterogeneity in sphere distribution; limited patient outcomes were reported.

Overall survival was reported in four studies [n = 97,38 n = 90 (possibly with some overlap),39 n = 7740 and n = 1741]. OS was longer in the TheraSphere arm in three of the studies,38,39,41 two of which included patients who all had PVT.38,39 Median OS in the SIR-Spheres arm ranged from 3.7 to 7.7 months. Median OS in the TheraSphere arm ranged from 7.0 to 15 months.

Progression-free survival was reported in only one study (n = 77), in which it was longer in the SIR-Spheres arm (6.1 months vs. 5.0 months).40 However, TTP was reported for the two treatment arms separately in one other study (n = 90 patients with PVT), in which it was longer in the TheraSphere arm (5.9 months vs. 2.8 months).39

Tumour response rate was reported for the two treatment arms separately in only one study (n = 90 patients with PVT), in which a higher proportion of evaluable patients had a complete (8.8% vs. 0%) or partial (31.6% vs. 13.3%) response in the TheraSphere arm.39

None of the studies reported HRQoL outcomes.

Adverse events were reported separately for the two treatment arms in two studies. The study by Biederman et al.39 (n = 90 patients with PVT) reported no significant difference in pain (41.2% vs. 30.8%), fatigue (17.6% vs. 18.5%), nausea (17.6% vs. 3.1%) or anorexia (0% vs. 9.2%) between the SIR-Spheres and TheraSphere arms, respectively. In the very small study by Bhangoo et al.41 (n = 17), all clinical toxicities reported were more frequent in the SIR-Spheres arm than in the TheraSphere arm (fatigue 67% vs. 45%, abdominal pain 33% vs. 27%, nausea/vomiting 67% vs. 55%, anorexia/weight loss 33% vs. 9%, diarrhoea 17% vs. 0% and gastric ulcer 17% vs. 0%).

An addendum, in the form of an academic-in-confidence manuscript, was received from Terumo Europe NV (hereafter Terumo) in August 2019. The manuscript described a retrospective pilot study of (confidential information has been removed) patients treated with QuiremSpheres, TheraSphere or SIR-Spheres at two centres in Germany and the Netherlands. OS and response were assessed at 6 months for all three interventions and at 12 months for QuiremSpheres and SIR-Spheres. Median OS was similar between the treatment groups at 6 months (confidential information has been removed) and 12 months (confidential information has been removed). The most commonly reported AEs were (confidential information has been removed) abdominal pain, fatigue and nausea; other AEs were rarely reported. This was a very small pilot study with unclear patient selection; patients in the TheraSphere group had poorer prognosis at baseline than did the other two treatment groups. The authors acknowledge that the study carries several methodological limitations.78

Clinical effectiveness summary and conclusions

SIR-Spheres

There are two large good-quality RCTs comparing SIR-Spheres with sorafenib (SARAH19,20 and SIRveNIB21).

There was no statistically significant difference in OS (HR 1.15, 95% CI 0.94 to 1.41 SARAH, and HR 1.12, 95% CI 0.9 to 1.4 SIRveNIB) or PFS (HR 1.03, 95% CI 0.85 to 1.25 SARAH, and HR 0.89, 95% CI 0.7 to 1.1 SIRveNIB) in the SARAH or SIRveNIB trials in the ITT populations. However, tumour response rate was significantly greater in the SIR-Spheres arm than in the sorafenib arm in both trials (of patients who were evaluable and included in the analyses). The SARAH trial reported significantly better HRQoL in the SIR-Spheres arm than in the sorafenib arm, assessed using the EORTC QLQ-C30, although the proportion of patients who completed the questionnaires was low, particularly at later time points. The SIRveNIB trial found no significant difference in HRQoL assessed using the EQ-5D index. The AE profiles of SIR-Spheres and sorafenib are very different, although the most common AEs generally occurred more frequently in the sorafenib arm in both trials.

There are some concerns regarding the generalisability of the SARAH and SIRveNIB trials to patients who would be eligible for SIRT in UK practice. The SIRveNIB trial was conducted in the Asia-Pacific region, where the aetiology of HCC differs from that in European patients; HCC is predominantly caused by hepatitis B in Asia, whereas it is predominantly caused by alcohol or hepatitis C in Europe. The SARAH trial included patients with a poorer prognosis than those who would be considered for SIRT in UK practice (e.g. high tumour burden, main PVT or impaired liver function).

Around one-fifth of patients in the SARAH and SIRveNIB trials were not suitable for SIRT after work-up (e.g. due to liver-to-lung shunting or unfavourable hepatic arterial anatomy); a proportion of patients assessed for suitability for SIRT in clinical practice would also be considered unsuitable, with associated cost implications.

Patients with bilobar disease may require more than one administration of SIRT. In the SARAH trial, patients with bilobar tumours received the first treatment in the hemiliver with the greatest tumour burden, and treatment of the contralateral hemiliver was scheduled 30–60 days after the first treatment. However, the Sirtex submission states that SIR-Spheres can be administered to both lobes of the liver during the same procedure; neither the SARAH trial nor the SIRveNIB trial administered SIR-Spheres to both lobes during the same procedure. Clinical advisors confirmed that this is reflective of their experience, in which patients would not receive whole-liver treatment in one session to reduce the risk of REILD.

The Sirtex company submission selected a subgroup of patients from the SARAH trial with ≤ 25% tumour burden and ALBI 1 for its base-case analysis in the economic model; the company stated that these patients are considered the most appropriate candidates for SIR-Spheres in clinical practice, as they are the most likely to benefit from SIRT. This is not a clinically recognised subgroup and was based on a post hoc analysis; therefore, these results should be prospectively validated before being considered relevant to clinical practice. Median OS (HR 0.73, 95% CI 0.44 to 1.21) and PFS (HR 0.65, 95% CI 0.41 to 1.02) appeared better in the SIR-Spheres arm than in the sorafenib arm in the subgroup analysis, although the difference between treatment groups was not statistically significant. The proportion of patients who went on to have potentially curative therapy was higher in the SIR-Spheres arm than in the sorafenib arm, although numbers were very low (five and one patients, respectively).

Three very small poorer-quality RCTs compared SIR-Spheres with TACE,22 DEB-TACE23 or SIR-Spheres plus sorafenib versus sorafenib alone.24 The trials comparing SIR-Spheres with TACE or DEB-TACE appeared to favour the chemoembolisation procedure over SIRT in terms of survival outcomes.22,23 The addition of SIR-Spheres to sorafenib did not appear to increase the number of treatment-emergent AEs.24

TheraSphere

Two small RCTs2530 and seven prospective comparative studies3137 of TheraSphere were included in the clinical effectiveness review; one of the RCTs (PREMIERE) and all of the non-RCT studies were rated as being at a high risk of bias, and the other RCT caused some concerns regarding bias. Therefore, all of these results should be interpreted with caution.

Both RCTs assessed TheraSphere as a bridge to transplant. The PREMIERE RCT reported longer TTP, a higher proportion of patients undergoing transplant and slightly longer OS in the TheraSphere arm than in the TACE arm.2527 Kulik et al.2830 reported similar survival and transplant/resection rates between patients receiving TheraSphere plus sorafenib or sorafenib alone.

Five prospective comparative studies compared TheraSphere with TACE or DEB-TACE; OS appeared better with TheraSphere in patients with early- and intermediate-stage disease.32,35 TTP was longer with TheraSphere than with TACE.32,34 Results relating to response rates were conflicting.31,32 HRQoL appeared better with TheraSphere.31,33 One study reported that the most common AE was more frequent with TACE than with SIRT.32

One prospective comparative study compared TheraSphere plus sorafenib with TheraSphere alone, with similar results between treatment groups.36 The other study compared TheraSphere in patients with or without PVT with no treatment in patients unsuitable for TheraSphere; OS was significantly longer in patients without PVT who received TheraSphere compared with those with PVT who received TheraSphere and those who received only BSC.37

QuiremSpheres

Only one study of QuiremSpheres has been completed in patients with HCC: a small case series undertaken by Radosa et al.51 The available data are too limited to draw any conclusions about the safety or efficacy of QuiremSpheres.

Direct comparison of different selective internal radiation therapies

Five small retrospective comparative studies, all rated as being at a high or unclear risk of bias, compared SIR-Spheres with TheraSphere. Two of the studies included patients who all had PVT and appear to have included some of the same patients.38,39 OS was reported in four studies, including the two studies of patients with PVT; OS was longer in the TheraSphere arm in three of the studies.38,39,41 One study assessed PFS, which was longer with SIR-Spheres,40 and another study assessed TTP, which was longer with TheraSphere (in patients with PVT).39 Tumour response rate was higher in the TheraSphere arm than in the SIR-Spheres arm in patients with PVT.39 One very small study reported more frequent clinical toxicities in the SIR-Spheres arm than in the TheraSphere arm.41 In patients with PVT, there was no difference in the frequency of fatigue, but pain and nausea appeared more frequent with SIR-Spheres, and anorexia appeared more frequent with TheraSphere.39

No studies were identified that directly compared QuiremSpheres with either SIR-Spheres or TheraSphere.

The BTG submission described a systematic review by Kallini et al.,79 supported by funding from BTG, which aimed to compare the AE profiles of TheraSphere and SIR-Spheres for the treatment of unresectable HCC. Twenty-two observational studies of TheraSphere and nine observational studies of SIR-Spheres were included in the review and the number of AEs and number of patients across studies were summed to calculate the proportion of patients experiencing each AE. No studies directly comparing TheraSphere with SIR-Spheres were included in the review. AE reporting appears to have been variable between studies, with many AEs being reported by very few of the included studies (e.g. hepatobiliary and respiratory AEs). Baseline characteristics of patients were poorly reported in many of the included studies. Gastric ulcers were reported more frequently with SIR-Spheres than with TheraSphere [3.1% (six studies) vs. 0.1% (nine studies)], but the proportion of patients reporting ascites was higher with TheraSphere than with SIR-Spheres [9.2% (10 studies) vs. 4.7% (5 studies)]. Nausea (13 studies in total), fatigue (16 studies in total) and abdominal pain (18 studies in total) occurred in similar proportions of patients for both interventions.79

An addendum, in the form of an academic-in-confidence manuscript, was received from Terumo in August 2019. OS and response were similar between the treatment groups. The most commonly reported AEs were (confidential information has been removed) abdominal pain, fatigue and nausea; other AEs were rarely reported. This was a very small pilot study with several methodological limitations.78

Conclusions

There is a large body of evidence on the clinical effectiveness and safety of SIRT compared with sorafenib or TACE. Only two studies were considered to have a low risk of bias: SARAH19,20 and SIRveNIB,21 which both compared SIR-Spheres with sorafenib. However, there are some concerns regarding the generalisability of the results of these two RCTs to the UK HCC population, particularly the SIRveNIB trial, which was conducted in the Asia-Pacific region where the aetiology of HCC differs from that in Europe.

Both RCTs found no significant difference in OS or PFS between SIR-Spheres and sorafenib, despite statistically significantly greater tumour response rate in the SIR-Spheres arm of both trials. The SARAH trial reported a significant difference between groups in HRQoL, favouring SIR-Spheres; however, the proportion of patients who completed the questionnaires was low. AEs, particularly grade ≥ 3 events, were more frequent in the sorafenib group in both trials.

The Sirtex company submission selected a subgroup of patients from the SARAH trial with ≤ 25% tumour burden and ALBI 1 for its base-case analysis in the economic model. Although results appeared more promising in this subgroup of patients with a better prognosis, these post hoc subgroup analysis results should be prospectively validated before being considered relevant to clinical practice.

In studies comparing the different SIRT, patients with PVT appeared to have better survival outcomes with TheraSphere than with SIR-Spheres; however, this result was from a small retrospective comparative study rated as being at a high risk of bias and, therefore, may not be reliable. Other studies comparing TheraSphere with SIR-Spheres that did not include only patients with PVT had conflicting results. The only study that compared QuiremSpheres with SIR-Spheres and TheraSphere was provided by Terumo as an addendum in August 2019. Clinical outcomes appeared to be similar between treatment groups; however, this was a very small pilot study with several methodological limitations.

Image 17-109-19-fig7
Copyright © Queen’s Printer and Controller of HMSO 2020. This work was produced by Walton et al. under the terms of a commissioning contract issued by the Secretary of State for Health and Social Care. This issue may be freely reproduced for the purposes of private research and study and extracts (or indeed, the full report) may be included in professional journals provided that suitable acknowledgement is made and the reproduction is not associated with any form of advertising. Applications for commercial reproduction should be addressed to: NIHR Journals Library, National Institute for Health Research, Evaluation, Trials and Studies Coordinating Centre, Alpha House, University of Southampton Science Park, Southampton SO16 7NS, UK.
Bookshelf ID: NBK562671

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (3.2M)

Other titles in this collection

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...