U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Chou R, Buckley D, Fu R, et al. Emerging Approaches to Diagnosis and Treatment of Non–Muscle-Invasive Bladder Cancer [Internet]. Rockville (MD): Agency for Healthcare Research and Quality (US); 2015 Oct. (Comparative Effectiveness Reviews, No. 153.)

Cover of Emerging Approaches to Diagnosis and Treatment of Non–Muscle-Invasive Bladder Cancer

Emerging Approaches to Diagnosis and Treatment of Non–Muscle-Invasive Bladder Cancer [Internet].

Show details

Executive Summary

Background

Bladder cancer is the 4th most commonly diagnosed cancer in men and the 10th most commonly diagnosed cancer in women in the United States.1 The American Cancer Society estimated in 2014 that there would be 74,690 new cases of bladder cancer in the United States that year and about 15,580 deaths due to bladder cancer.1 Bladder cancer occurs primarily in men age 60 and older, and roughly twice as frequently in white compared with black men.2 Bladder cancer is an important health problem, with no improvement in associated mortality since 1975.3 Economic analyses have shown bladder cancer to be the costliest cancer to treat on a per capita basis.4 The most common risk factor for bladder cancer is cigarette smoking; other risk factors include occupational exposures and family history.

Bladder cancer is staged based on the extent of penetration or invasion into the bladder wall and adjacent structures.5 Bladder cancers that have not invaded the bladder smooth muscle layer—stage classifications Tis (carcinoma in situ), Ta (noninvasive papillary carcinoma), and T1 (cancer that invades the subepithelial connective tissue) —are broadly grouped as non–muscle-invasive bladder cancer (NMIBC). Stage T2 cancers are muscle invasive, and higher stage cancers invade beyond the muscle layer into surrounding fat (stage classification T3 bladder cancer) or beyond the fat into nearby organs or structures (stage classification T4 bladder cancer). Approximately 75 percent of newly diagnosed bladder cancers are NMIBC.6 Individuals with NMIBC generally have a good prognosis, with 5-year survival rates higher than 88 percent.7 However, as many as 70 percent of NMIBC tumors recur after initial treatment, with a 10- to 20-percent risk of progression to invasive bladder cancer.6 Prognosis is poorer for patients with muscle-invasive bladder cancers (5-year survival rates from 63% to 15%).7

A number of tests are available for screening, diagnosis, and staging of bladder cancer. Standard methods for identification of bladder cancer include urine dipstick and microscopic urinalysis (to detect hematuria) and urine cytology (to detect abnormal or cancerous cells in the urine), followed by imaging tests and cystoscopy.8 Urine-based biomarkers have been developed as potential diagnostic alternatives or supplements to cytology, imaging, and cystoscopy.9 A number of biomarkers have been evaluated in conjunction with cytology for diagnosis of bladder cancer, potentially reducing the need for cystoscopy. In addition to being performed for initial diagnosis and staging, diagnostic surveillance with cystoscopy and cytology is performed following treatment to identify patients with recurrence or progression of cancer. Urine-based biomarker tests may also be used to help identify recurrence and need for cystoscopy during surveillance.

The large number of available tests and testing strategies, and potential tradeoffs in diagnostic accuracy, risks, and patient preferences pose significant challenges in determining optimal testing and monitoring strategies. Tests with high false-positive rates could lead to unnecessary invasive procedures for further evaluation, and tests with high false-negative rates could lead to missed diagnoses.

Once bladder cancer has been diagnosed, a number of factors affect prognosis and treatment options. These include the stage of the cancer, tumor grade (higher grade tumors are more likely to recur and progress), whether the tumor is an initial tumor or a recurrence, number and size of tumors, and patient's age and general health. The main treatment for NMIBC is local resection with transurethral resection of the bladder tumor (TURBT), often with adjuvant intravesical therapy to destroy residual tumor cells using chemotherapeutic agents (e.g., mitomycin C [MMC], apaziquone, paclitaxel, gemcitabine, thiotepa, valrubicin, doxorubicin, epirubicin), bacillus Calmette-Guérin (BCG), or interferon immunotherapy.10 Clinical trials of electromotive drug administration to enhance the effectiveness of intravesical chemotherapy are underway in the United States.

The purpose of this report is to review the currently available evidence on the comparative effectiveness of diagnostic tests and treatments for NMIBC. Although updated guidelines for the treatment and followup of NMIBC from the European Association of Urology were published in 2013,11 the literature continues to evolve, with much of the new evidence focusing on diagnostic techniques such as fluorescent cystoscopy or urine-based biomarkers and treatments with intravesical therapy alternatives to MMC and BCG. A systematic evidence review that includes recently published research may provide a better understanding of the comparative effectiveness of currently available approaches to diagnosis, treatment, and post-treatment surveillance for NMIBC. The systematic review may be used to update existing clinical recommendations that are several years old or may be out of date because of the development of new technologies and therapies.

Scope of Review and Key Questions

This topic was nominated for review by the American Urological Association and focuses on diagnosis of bladder cancer and treatment of NMIBC. The Key Questions and analytic framework used to guide this report are shown below. The analytic framework (Figure A) shows the scope of this review, including the target population, interventions, comparisons, and health outcomes we examined.

Figure A is an analytic framework that depicts the populations, interventions, outcomes, and adverse effects of interest for emerging approaches to diagnosis and treatment of non-muscle-invasive bladder cancer. The far left of the framework describes the target population for utilizing diagnostic tests for bladder cancer, patients with signs or symptoms of bladder cancer. To the right is diagnostic testing, utilizing one or more of the following urinary biomarkers: BTAstat® for bladder tumor-associated antigen (BTA); Alere NMP22® and BladderChek® for nuclear matrix protein 22 (NMP22); UroVysion® fluorescence in situ hybridization (FISH) assay; Cxbladder™, and ImmunoCyt™. Below the diagnostic testing is an oval for adverse effects of diagnostic testing. To the right of the diagnostic testing are four possible results of testing. Diagnostic workup negative, Muscle-invasive bladder cancer (Stage T2, T3, or T4a; N0, M0), and Metastatic bladder cancer (Stage T4b; N0; M0, or Any T; N1-3; M0, or Any T; Any N; M1) are not included in this systematic review. The third outcome is diagnosis of non-muscle invasive bladder cancer with stage Ta, Tis, or T1; N0; M0. Continuing to the right from the box with non-muscle invasive bladder cancer, are treatment of bladder cancer with chemotherapeutic and immunotherapeutic agents of interest, including mitomycin; apaziquone; paclitaxel; gemcitabine; thiotepa; valrubicin; doxorubicin; epirubicin; bacillus Calmette-Guérin (BCG); and interferon. To the right of treatment is surveillance for recurrence or progression of bladder cancer. Below the treatment and surveillance boxes is an oval for the adverse effects of treatment and/or surveillance. To the right of surveillance is a box for the health outcomes of treating and using surveillance for bladder cancer, mortality, need for cystectomy, recurrence of cancer, progression of cancer, and quality of life.

Figure A

Analytic framework. KQ = Key Question. Cancer stages shown are the TNM (tumor, node, metastasis) classification. a Urinary biomarkers of interest are restricted to tests that are approved for diagnosis of bladder cancer by the U.S. Food and Drug Administration (more...)

Key Question 1. What is the diagnostic accuracy of various urinary biomarkers compared with other urinary biomarkers or standard diagnostic methods (cystoscopy, cytology, and imaging) in (1) people with signs or symptoms warranting evaluation for possible bladder cancer or (2) people undergoing surveillance for previously treated bladder cancer?

  1. Does the diagnostic accuracy differ according to patient characteristics (e.g., age, sex, race/ethnicity), or according to the nature of the presenting signs or symptoms?

Key Question 2. For patients with non–muscle-invasive bladder cancer, does the use of a formal risk-adapted assessment approach to treatment decisions (e.g., based on Guidelines of the European Association of Urology or on urinary biomarker tests) decrease mortality or improve other outcomes (e.g., recurrence, progression, need for cystectomy, quality of life) compared with treatment not guided by a formal assessed risk-adapted approach?

Key Question 3. For patients with non–muscle-invasive bladder cancer treated with transurethral resection of bladder tumor, what is the effectiveness of various intravesical chemotherapeutic or immunotherapeutic agents for decreasing mortality or improving other outcomes (e.g., recurrence, progression, need for cystectomy, quality of life) compared with TURBT alone?

  1. What is the comparative effectiveness of various chemotherapeutic or immunotherapeutic agents, as monotherapy or in combination?
  2. Does the comparative effectiveness differ according to tumor characteristics, such as stage, grade, size, multiplicity, whether the tumor is primary or recurrent, or molecular/genetic markers?
  3. Does the comparative effectiveness differ according to patient characteristics, such as age, sex, race/ethnicity, performance status, or medical comorbidities?
  4. Does the comparative effectiveness of various chemotherapeutic or immunotherapeutic agents differ according to dosing frequency, duration of treatment, and/or the timing of administration relative to TURBT?

Key Question 4. For patients with high-risk non–muscle-invasive bladder cancer treated with TURBT, what is the effectiveness of external beam radiation therapy (either alone or with systemic chemotherapy/immunotherapy) for decreasing mortality or improving other outcomes compared with intravesical chemotherapy/immunotherapy alone or cystectomy?

Key Question 5. In surveillance of patients treated for non–muscle-invasive bladder cancer, what is the effectiveness of various urinary biomarkers to decrease mortality or improve other outcomes compared with other urinary biomarkers or standard diagnostic methods (cystoscopy, cytology, and imaging)?

  1. Does the comparative effectiveness differ according to tumor characteristics, such as histology, stage, grade, size, or molecular/genetic markers?
  2. Does the comparative effectiveness differ according to the treatment used (i.e., specific chemotherapeutic or immunotherapeutic agents and/or TURBT)?
  3. Does the comparative effectiveness differ according to the length of surveillance intervals?
  4. Does the comparative effectiveness differ according to patient characteristics, such as age, sex, or race/ethnicity?

Key Question 6. For initial diagnosis or surveillance of patients treated for non–muscle-invasive bladder cancer, what is the effectiveness of blue light or other methods of augmented cystoscopy compared with standard cystoscopy for recurrence rates, progression of bladder cancer, mortality, or other clinical outcomes?

Key Question 7. What are the comparative adverse effects of various tests for diagnosis and post-treatment surveillance of bladder cancer, including urinary biomarkers, cytology, and cystoscopy?

Key Question 8. What are the comparative adverse effects of various treatments for non–muscle-invasive bladder cancer, including intravesical chemotherapeutic or immunotherapeutic agents and TURBT?

  1. How do adverse effects of treatment vary by patient characteristics, such as age, sex, ethnicity, performance status, or medical comorbidities such as chronic kidney disease?

Methods

This Comparative Effectiveness Review (CER) follows the methods suggested in the Agency for Healthcare Research and Quality (AHRQ) “Methods Guide for Effectiveness and Comparative Effectiveness Reviews” (AHRQ Methods Guide)12 and the AHRQ “Methods Guide for Medical Test Reviews.”13 All methods were determined a priori.

Searching for the Evidence

A research librarian experienced in conducting literature searches for CERs searched in Ovid MEDLINE® (January 1990–October 2014), Cochrane Central Register of Controlled Trials (through September 2014), Cochrane Database of Systematic Reviews (through September 2014), Health Technology Assessment (through Third Quarter 2014), National Health Sciences Economic Evaluation Database (through Third Quarter 2014), and Database of Abstracts of Reviews of Effects (through Third Quarter 2014) to capture both published and gray literature. We searched for unpublished studies in clinical trial registries (ClinicalTrials.gov, Current Controlled Trials, ClinicalStudyResults.org, and the World Health Organization International Clinical Trials Registry Platform) and regulatory documents ( vog.ADF@sgurD and U.S. Food and Drug Administration (FDA) Medical Devices Registration and Listing). Reference lists of relevant studies and previous systematic reviews were hand-searched for additional studies. Scientific information packets were solicited from drug and device manufacturers, and a notice published in the Federal Register invited interested parties to submit relevant published and unpublished studies.

Study Selection

We developed criteria for inclusion and exclusion of studies based on the Key Questions and the defined population, interventions, comparators, outcomes, timing, and settings (PICOTS) and study designs. Inclusion and exclusion criteria are summarized below. Abstracts were reviewed by two investigators, and all citations deemed appropriate for inclusion by at least one of the reviewers were retrieved. Two investigators independently reviewed all full-text articles for inclusion. Discrepancies were resolved by discussion and consensus.

Population and Condition of Interest

For Key Questions related to diagnosis, we included studies of adults with signs or symptoms of possible bladder cancer (e.g., macroscopic or microscopic hematuria, irritative voiding symptoms) or undergoing surveillance following treatment for bladder cancer. For Key Questions related to treatment, we included adults with NMIBC who were undergoing treatment.

Interventions, Comparisons, and Study Designs of Interest

We included studies of urinary biomarkers for the diagnosis of bladder cancer approved by the FDA or available in the United States and classified as a Laboratory Developed Test by the FDA (CxBladder™). We excluded studies of diagnostic accuracy of other biomarkers and studies of included biomarkers that did not evaluate diagnostic accuracy of biomarkers against standard diagnostic methods (cystoscopy and histopathology). For cystoscopic methods, we included studies of fluorescent cystoscopy following intravesical instillation of a photosensitizing agent and other methods of augmented cystoscopy (e.g., narrow band imaging) for the initial diagnosis or surveillance of bladder cancer compared with standard (white light) cystoscopy.

For treatments, we included studies of intravesical therapies (MMC, apaziquone, paclitaxel, gemcitabine, thiotepa, valrubicin, doxorubicin, epirubicin, BCG, and interferon) and external beam radiation therapy with or without systemic chemotherapy or immunotherapy versus TURBT, other intravesical therapies, or cystectomy. We included studies that compared different dosing regimens, different surveillance intervals, and risk-adapted approaches versus other approaches. We also included studies on the effects of patient and tumor characteristics on estimates of effectiveness.

For all Key Questions, we included randomized trials and, when randomized trials were not available, cohort studies with concurrent controls. For diagnostic accuracy, we also included cross-sectional studies. We excluded uncontrolled observational studies, case-control studies, case series, and case reports, as these studies are less informative than studies with a control group.

Outcomes of Interest

For diagnostic accuracy of urinary biomarkers, we evaluated sensitivity, specificity, predictive values, and likelihood ratios, using cystoscopy with biopsy as the reference standard. Clinical outcomes for trials of diagnostic methods and treatments were mortality, need for cystectomy, progression to muscle-invasive bladder cancer, bladder cancer recurrence, and quality of life. We also evaluated adverse effects of diagnostic testing (e.g., false-positives, labeling, anxiety, complications of cystoscopy) and adverse effects of treatment (e.g., cystitis, urinary urgency, urinary frequency, incontinence, hematuria, pain, urosepsis, myelosuppression).

Timing and Settings of Interest

For all Key Questions, we included studies conducted in inpatient or outpatient settings with any duration of followup.

Data Extraction and Data Management

For treatment studies, we extracted the following information into evidence tables: study design; setting; inclusion and exclusion criteria; dose and duration of treatment for experimental and control groups; duration of followup; number of subjects screened, eligible, and enrolled; population characteristics (including age, race/ethnicity, sex, stage of disease, and functional status); results; adverse events; withdrawals due to adverse events; and sources of funding. We calculated relative risks (RRs) and associated 95% confidence intervals (CIs) based on the information provided (sample sizes and incidence of outcomes in each intervention group). We noted discrepancies between calculated and reported results when present.

For diagnostic accuracy studies, we abstracted the following information: setting, screening test or tests, method of data collection, reference standard, inclusion criteria, population characteristics (including age, sex, race/ethnicity, smoking status, signs or symptoms, and prior bladder cancer stage or grade), proportion of individuals with bladder cancer, bladder cancer stage and grade, definition of a positive screening exam, proportion of individuals unexaminable by the screening test, proportion who did not undergo reference standard test, results, and sources of funding. We attempted to create two-by-two tables from information provided (sample size, prevalence, sensitivity, and specificity) and compared calculated measures of diagnostic accuracy based on the two-by-two tables with reported results. We noted discrepancies between calculated and reported results when present. When reported, we also recorded the area under the receiver operating characteristic curve.14,15

Data extraction for each study was completed by one investigator and independently reviewed for accuracy and completeness by a second investigator.

Assessment of the Risk of Bias of Individual Studies

We assessed the risk of bias for randomized trials and observational studies using criteria adapted from those developed by the U.S. Preventive Services Task Force.16 Studies of diagnostic accuracy were rated using criteria adapted from QUADAS-2, a revised tool for Quality Assessment of Diagnostic Accuracy Studies.17 These criteria were applied in conjunction with the approaches recommended for medical interventions in the AHRQ Methods Guide12 and in the AHRQ “Methods Guide for Medical Test Reviews.”13

Two investigators independently assessed the risk of bias of each study. Discrepancies were resolved through discussion and consensus. Each study was rated as low, medium, or high risk of bias.12

Studies rated low risk of bias were considered to have no more than very minor methodological shortcomings, and their results are likely to be valid. Studies rated moderate risk of bias have some methodological shortcomings, but no flaw or combination of flaws judged likely to cause major bias. The category of moderate risk of bias is broad, and studies with this rating vary in their strengths and weaknesses; the results of some studies assessed to have moderate risk of bias are likely to be valid, while others may be only possibly valid. Studies rated high risk of bias have significant flaws that may invalidate the results. They have a serious or “fatal” flaw or combination of flaws in design, analysis, or reporting; large amounts of missing information; or serious discrepancies in reporting. We did not exclude studies rated as having high risk of bias a priori, but they were considered the least reliable when synthesizing the evidence, particularly when discrepancies between studies were present.

Assessing Applicability

We recorded factors important for understanding the applicability of studies, such as whether the publication adequately described the study sample, the country in which the study was conducted, the characteristics of the patient sample (e.g., age, sex, race/ethnicity, risk factors for bladder cancer, presenting symptoms, and medical comorbidities), tumor characteristics (e.g., stage and grade, primary or recurrent, unifocal or multifocal lesions), the characteristics of the diagnostic tests (e.g., specific test evaluated and cutoffs used) and interventions (e.g., treatment dose, duration, and interval) used, and the magnitude of effects on clinical outcomes.12 There is no generally accepted universal rating system for applicability, which depends in part on context. Therefore, a rating of applicability (such as high or low) was not assigned because applicability may differ based on the user of this report.

Data Synthesis

For studies on diagnostic accuracy of urinary biomarkers, we performed meta-analyses to help summarize data and obtain more precise estimates.18 We used a bivariate logistic mixed-effects model19 to analyze sensitivity and specificity, incorporating the correlation between sensitivity and specificity. We assumed random effects across studies with a bivariate normal distribution for sensitivity and specificity, and heterogeneity among the studies was measured based on the random-effect variance (τ2). When few studies were available for an analysis, we used the moment estimates of correlation between sensitivity and specificity in the bivariate model. We calculated positive likelihood ratio and negative likelihood ratio using the summarized sensitivity and specificity.20,21 For head-to-head comparisons, we used the same bivariate logistic mixed-effects model as described above but added an indicator variable for imaging modalities (equivalent to a meta-regression approach).

All quantitative analyses were conducted using SAS® 10.0 (SAS Institute Inc., Cary, NC).22 We assessed the presence of statistical heterogeneity among the studies using the standard Cochran's chi-square test, and the magnitude of heterogeneity by using the I2 statistic.23 When statistical heterogeneity was present, we performed sensitivity analyses by conducting meta-analysis using the profile likelihood method.24 We also performed sensitivity and subgroup analyses based on ratings for risk of bias, dose of intravesical therapy, inclusion of high-risk patients, and duration of followup. We stratified trials according to the type of instillation regimen, classified as single instillation, induction therapy (treatment for 4 to 8 weeks), maintenance therapy (treatment for longer than 8 weeks), or other. We calculated pooled RRs for the dichotomous outcomes for bladder cancer recurrence, bladder cancer progression, all-cause mortality, bladder cancer mortality, and local and systemic adverse events. Similar analyses were performed for trials of augmented cystoscopy (fluorescent light or narrow band imaging) versus white light cystoscopy.

Grading the Strength of Evidence for Each Key Question

We assessed the strength of evidence (SOE) for each Key Question and outcome using the approach described in the AHRQ Methods Guide,12 based on the overall quality of each body of evidence; the risk of bias (graded low, moderate, or high); the consistency of results across studies (graded consistent, inconsistent, or unable to determine when only 1 study was available); the directness of the evidence linking the intervention and health outcomes (graded direct or indirect); the precision of the estimate of effect, based on the number and size of studies and CIs for the estimates (graded precise or imprecise); and reporting bias (suspected or undetected)

Assessments of reporting bias were based on whether studies defined and reported primary outcomes, identification of relevant unpublished studies, and when available, by comparing published results with results reported in trial registries.

We graded the SOE for each Key Question using the four categories recommended in the AHRQ Methods Guide.12 A high grade indicates high confidence that the evidence reflects the true effect and that further research is very unlikely to change our confidence in the estimate of effect. A moderate grade indicates moderate confidence that the evidence reflects the true effect; further research may change our confidence in the estimate of effect and may change the estimate. A low grade indicates low confidence that the evidence reflects the true effect; further research is likely to change the confidence in the estimate of effect and is likely to change the estimate. A grade of insufficient indicates that evidence either is unavailable or is too limited to permit any conclusion because of the availability of only poor-quality studies, extreme inconsistency, or extreme imprecision.

Results

Database searches resulted in 4,071 potentially relevant articles. After dual review of abstracts and titles, 643 articles were selected for full-text dual review, and 149 studies (in 192 publications) were determined to meet inclusion criteria and were included in this review.

Key Question 1. Diagnostic Accuracy: Comparison of Urinary Biomarkers

For this Key Question, we included 57 studies (in 60 publications) that evaluated the diagnostic accuracy of urinary biomarkers for diagnosis of bladder cancer.

  • Quantitative nuclear matrix protein 22 (NMP22): Sensitivity was 0.69 (95% CI, 0.62 to 0.75) and specificity 0.77 (95% CI, 0.70 to 0.83), based on 19 studies, for a positive likelihood ratio of 3.05 (95% CI, 2.28 to 4.10) and negative likelihood ratio of 0.40 (95% CI, 0.32 to 0.50) (SOE: moderate)
    • For evaluation of symptoms: Sensitivity was 0.67 (95% CI, 0.55 to 0.77; 9 studies) and specificity 0.84 (95% CI, 0.75 to 0.90; 7 studies).
    • For surveillance: Sensitivity was 0.61 (95% CI, 0.49 to 0.71; 10 studies) and specificity 0.71 (95% CI, 0.60 to 0.81; 8 studies).
  • Qualitative NMP22: Sensitivity was 0.58 (95% CI, 0.39 to 0.75) and specificity 0.88 (95% CI, 0.78 to 0.94), based on four studies, for a positive likelihood ratio of 4.89 (95% CI, 3.23 to 7.40) and negative likelihood ratio of 0.48 (95% CI, 0.33 to 0.71) (SOE: low).
    • For evaluation of symptoms: Sensitivity was 0.47 (95% CI, 0.33 to 0.61) and specificity 0.93 (95% CI, 0.81 to 0.97), based on two studies.
    • For surveillance: Sensitivity was 0.70 (95% CI, 0.40 to 0.89) and specificity 0.83 (95% CI, 0.75 to 0.89), based on two studies.
  • Qualitative bladder tumor antigen (BTA): Sensitivity was 0.64 (95% CI, 0.58 to 0.69; 22 studies) and specificity 0.77 (95% CI, 0.73 to 0.81; 21 studies), for a positive likelihood ratio of 2.80 (95% CI, 2.31 to 3.39) and negative likelihood ratio of 0.47 (95% CI, 0.30 to 0.55) (SOE: moderate).
    • For evaluation of symptoms: Sensitivity was 0.76 (95% CI, 0.67 to 0.83; 8 studies), and specificity 0.78 (95% CI, 0.66 to 0.87; 6 studies).
    • For surveillance: Sensitivity was 0.60 (95% CI, 0.55 to 0.65; 11 studies) and specificity 0.76 (95% CI, 0.69 to 0.83; 8 studies).
  • Quantitative BTA: Sensitivity was 0.65 (95% CI, 0.54 to 0.75) and specificity 0.74 (95% CI, 0.64 to 0.82), based on four studies, for a positive likelihood ratio of 2.52 (95% CI, 1.86 to 3.41) and negative likelihood ratio of 0.47 (95% CI, 0.37 to 0.61) (SOE: low).
    • For evaluation of symptoms: Sensitivity was 0.76 (95% CI, 0.61 to 0.87) and specificity 0.53 (95% CI, 0.38 to 0.68), based on one study.
    • For surveillance: Sensitivity was 0.58 (95% CI, 0.46 to 0.69) and specificity 0.79 (95% CI, 0.72 to 0.85), based on two studies.
  • Fluorescence in situ hybridization (FISH): Sensitivity was 0.63 (95% CI, 0.50 to 0.75) and specificity 0.87 (95% CI, 0.79 to 0.93), based on 11 studies, for a positive likelihood ratio of 5.02 (95% CI, 2.93 to 8.60) and negative likelihood ratio of 0.42 (95% CI, 0.30 to 0.59) (SOE: moderate).
    • For evaluation of symptoms: Sensitivity was 0.73 (95% CI, 0.50 to 0.88) and specificity 0.95 (95% CI, 0.87 to 0.98), based on two studies, for a positive likelihood ratio of 14.2 (95% CI, 5.2 to 39) and negative likelihood ratio of 0.29 (95% CI, 0.14 to 0.60).
    • For surveillance: Sensitivity was 0.55 (95% CI, 0.36 to 0.72; 7 studies) and specificity was 0.80 (95% CI, 0.66 to 0.89; 6 studies).
  • ImmunoCyt: Sensitivity was 0.78 (95% CI, 0.68 to 0.85) and specificity 0.78 (95% CI, 0.72 to 0.82), based on 14 studies, for a positive likelihood ratio of 3.49 (95% CI, 2.82 to 4.32) and negative likelihood ratio of 0.29 (95% CI, 0.20 to 0.41) (SOE: moderate).
    • For evaluation of symptoms: Sensitivity was 0.85 (95% CI, 0.78 to 0.90; 6 studies) and specificity 0.83 (95% CI, 0.77 to 0.87; 7 studies).
    • For surveillance: Sensitivity was 0.75 (95% CI, 0.64 to 0.83; 7 studies) and specificity 0.76 (95% CI, 0.70 to 0.81; 8 studies).
  • CxBladder: Sensitivity was 0.82 (95% CI, 0.70 to 0.90) and specificity 0.85 (95% CI, 0.81 to 0.88) for evaluation of symptoms, based on one study, for a positive likelihood ratio of 5.53 (95% CI, 4.28 to 7.15) and negative likelihood ratio of 0.21 (95% CI, 0.13 to 0.36) (SOE: low).
  • Direct (within-study) comparisons:
    • There was no difference between quantitative NMP22 (cutoff >10 U/mL) versus qualitative BTA in sensitivity (0.69 [95% CI, 0.62 to 0.76] vs. 0.66 [95% CI, 0.59 to 0.73], for a difference of 0.03 [95% CI, -0.04 to 0.10]) or specificity (0.73 [95% CI, 0.62 to 0.82] vs. 0.76 [95% CI, 0.66 to 0.84], for a difference of 0.03 [95% CI, -0.08 to 0.01]), based on seven studies (SOE: moderate).
    • ImmunoCyt was associated with higher sensitivity than FISH (0.71 [95% CI, 0.54 to 0.84] vs. 0.61 [95% CI, 0.43 to 0.76], for a difference of 0.11 [95% CI, 0.001 to 0.21]) but lower specificity (0.71 [95% CI, 0.62 to 0.79] vs. 0.79 [95% CI, 0.71 to 0.85], for a difference of -0.08 [95% CI, -0.15 to -0.001]), based on three studies (SOE: low).
    • Evidence for other head-to-head comparisons of urinary biomarkers was based on small numbers of studies with imprecise estimates and methodological shortcomings, precluding reliable conclusions regarding comparative test performance (SOE: insufficient).
    • Sixteen studies found sensitivity of various urinary biomarkers plus cytology to be associated with higher sensitivity than the urinary biomarker alone (0.8 [95% CI, 0.75 to 0.86] vs. 0.69 [95% CI, 0.61 to 0.76], for a difference of 0.13 [95% CI, 0.08 to 0.17]), with no difference in specificity (SOE: moderate).

Key Question 1a. Diagnostic Accuracy: Patient Characteristics or Presenting Signs or Symptoms

For this Key Question, we included 42 studies that evaluated diagnostic accuracy according to patient characteristics or the nature of the presenting signs or symptoms.

  • Effects of tumor stage: Across urinary biomarkers, sensitivity increased with higher tumor stage. Evidence was most robust for quantitative NMP22 (11 studies), qualitative BTA (18 studies), and FISH (8 studies); the association between higher tumor stage and increased sensitivity was least pronounced for ImmunoCyt (10 studies). Sensitivity for carcinoma in situ (CIS) tumors was generally similar to or slightly lower than for T1 tumors (SOE: high).
  • Effects of tumor grade: Across urinary biomarkers, sensitivity increased with higher tumor grade. Evidence was most robust for quantitative NMP22 (12 studies), ImmunoCyt (10 studies), qualitative BTA (18 studies), and FISH (9 studies) (SOE: high).
  • Effects of tumor size: Two studies found that sensitivity was higher for larger (>1 cm or >2 cm) versus smaller tumors (SOE: low).
  • Evidence on the effects of patient characteristics, such as age, sex, smoking status, and presence of other clinical conditions, on diagnostic accuracy of urinary biomarkers was limited and did not clearly or consistently indicate effects on sensitivity or specificity (SOE: low).

Key Question 2. Use of Formal Risk-Adapted Assessment Approach

This Key Question addresses the issue of whether use of a formal risk-adapted assessment approach to treatment decisions decreases mortality or improves other outcomes compared with treatment not guided by a formal risk-adapted assessment approach.

  • No study compared clinical outcomes associated with use of a formal risk-adapted approach to guide treatment of NMIBC versus treatment not guided by a risk-adapted approach (SOE: insufficient).

Key Question 3. Effect of TURBT Plus Intravesical Therapy Versus TURBT Alone

This Key Question addresses the issue of whether the use of various intravesical chemotherapeutic or immunotherapeutic agents in addition to TURBT decreases mortality or improves other outcomes compared with TURBT alone. We included 37 studies (in 46 publications) that evaluated intravesical therapy versus no intravesical therapy.

  • BCG was associated with decreased risk of bladder cancer recurrence (3 trials; RR, 0.56; 95% CI, 0.43 to 0.71; I2 = 0%) and progression (4 trials; RR, 0.39; 95% CI, 0.24 to 0.64; I2 = 40%) versus no intravesical therapy. No trial evaluated effects of BCG versus no intravesical therapy on risk of all-cause mortality. One trial found BCG to be associated with decreased risk of bladder cancer mortality, but the difference was not statistically significant (RR, 0.62; 95% CI, 0.32 to 1.19) (SOE: insufficient for all-cause and bladder cancer mortality; low for recurrence and progression).
  • MMC was associated with decreased risk of bladder cancer recurrence versus no intravesical therapy (8 trials; RR, 0.71; 95% CI, 0.57 to 0.89; I2 = 72%), but there was no difference in risk of all cause-mortality (1 trial; hazard ratio [HR], 1.17; 95% CI, 0.89 to 1.53), and effects on bladder cancer mortality (1 trial; HR, 0.71; 95% CI, 0.34 to 1.46) and bladder cancer progression (5 trials; RR, 0.68; 95% CI, 0.39 to 1.20, I2 = 0%) were not statistically significant (SOE: moderate for recurrence; low for progression, all-cause mortality, and bladder cancer–specific mortality).
  • Doxorubicin was associated with decreased risk of bladder cancer recurrence versus no intravesical therapy (10 trials; RR, 0.80; 95% CI, 0.72 to 0.88; I2 = 46%), no difference in risk of bladder cancer progression (5 trials; RR, 1.03; 95% CI, 0.72 to 1.46; I2 = 0%), and no clear effects on all-cause mortality (2 trials) or bladder cancer–specific mortality (1 trial) (SOE: moderate for recurrence; low for progression, all-cause mortality, and bladder cancer–specific mortality).
  • Epirubicin was associated with decreased risk of bladder cancer recurrence (9 trials; RR, 0.63; 95% CI, 0.53 to 0.75; I2 = 64%) (SOE: moderate), but the effect on bladder cancer progression was not statistically significant (8 trials; RR, 0.79; 95% CI, 0.84 to 1.30; I2 = 27%) (SOE: low).
  • Gemcitabine was examined in one trial that found no difference between single-instillation gemcitabine versus no intravesical therapy in risk of bladder cancer recurrence (RR, 0.98; 95% CI, 0.70 to 1.36); estimates for progression (RR, 3.00; 95% CI, 0.32 to 28.4), all-cause mortality (RR, 0.50; 95% CI, 0.13 to 2.00), and bladder cancer–specific mortality (RR, 1.00; 95% CI, 0.06 to 15.81) were very imprecise (SOE: low for bladder cancer recurrence; insufficient for all-cause and bladder cancer–specific mortality and progression).
  • Interferon alpha was associated with decreased risk of bladder cancer recurrence versus no intravesical therapy that was not statistically significant (3 trials; RR, 0.75; 95% CI, 0.53 to 1.06; I2 = 50%), decreased risk of bladder cancer progression (2 trials; RR, 0.33; 95% CI, 0.14 to 0.76; I2 = 0%), and no difference in risk of bladder cancer–specific mortality (1 trial; RR, 1.00; 95% CI, 0.15 to 6.75) (SOE: low).
  • Interferon gamma was associated with decreased risk of bladder cancer recurrence versus no intravesical therapy (1 trial; RR, 0.72; 95% CI, 0.51 to 1.01), with no difference in risk of bladder cancer progression (1 trial; RR, 1.08; 95% CI, 0.07 to 16.4) (SOE: low).
  • Thiotepa was associated with decreased risk of bladder cancer recurrence versus no intravesical therapy that was not statistically significant (5 trials; RR, 0.78; 95% CI, 0.58 to 1.06; I2 = 69%), with insufficient evidence to determine effects on progression or mortality (SOE: low for recurrence, insufficient for all-cause and bladder cancer mortality and progression).

Key Question 3a. Comparative Effectiveness: Chemotherapeutic or Immunotherapeutic Agents as Monotherapy or in Combination

For this Key Question, we included 54 studies in 66 publications.

BCG Versus MMC

  • There were no differences between BCG and MMC in risk of bladder cancer recurrence (10 trials; RR, 0.95; 95% CI, 0.81 to 1.11; I2 = 67%), but BCG was associated with decreased risk in the subgroup of trials that evaluated maintenance regimens (5 trials; RR, 0.79; 95% CI, 0.71 to 0.87; I2 = 0%). There was no difference in risk of all-cause (7 trials; RR, 0.94; 95% CI, 0.83 to 1.06; I2 = 0%) or bladder cancer–specific mortality (5 trials; RR, 0.77; 95% CI, 0.54 to 1.10; I2 = 0%) or progression (7 trials; RR, 0.88; 95% CI, 0.66 to 1.17; I2 = 18%) (SOE: moderate for all-cause mortality, bladder cancer–specific mortality, and progression; low for recurrence).
  • There were no differences between BCG alone and BCG plus MMC given sequentially in risk of all-cause (1 trial; RR, 1.57; 95% CI, 0.67 to 3.71) or bladder cancer–specific mortality (2 trials; RR, 1.10; 95% CI, 0.50 to 2.38; I2 = 17%), bladder cancer recurrence (4 trials; RR, 1.03; 95% CI, 0.70 to 1.52; I2 = 75%), progression (3 trials; RR, 0.87; 95% CI, 0.40 to 1.91; I2 = 22%), or cystectomy (4 trials; RR, 0.87; 95% CI, 0.41 to 1.84; I2 = 0%) (SOE: low).
  • There were no differences between BCG plus MMC administered sequentially and MMC alone in risk of all-cause (2 trials; RR, 1.53; 95% CI, 0.72 to 1.74 and RR 0.95; 95% CI, 0.71 to 1.30) or bladder cancer–specific mortality (2 trials; RR, 0.64; 95% CI, 0.22 to BCG 1.88 and RR, 0.95; 95% CI, 0.45 to 1.56), bladder cancer recurrence (2 trials; RR, 0.88; 95% CI, 0.75 to 1.03; I2 = 0%), or progression (2 trials; RR, 0.82; 95% CI, 0.40 to 1.68 and RR, 1.28; 95% CI, 0.35 to 4.61) (SOE: low).

BCG Versus Doxorubicin

  • BCG was associated with decreased risk of bladder cancer recurrence versus doxorubicin (2 trials; RR, 0.31; 95% CI, 0.16 to 0.61 and RR, 0.75; 95% CI, 0.64 to 0.88), but there were no differences in risk of all-cause mortality (2 trials; RR, 0.40; 95% CI, 0.01 to 12 and RR, 1.00; 95% CI, 0.71 to 1.37) or bladder cancer progression (1 trial; RR, 0.20; 95% CI, 0.02 to 1.72) (SOE: low).

BCG Versus Epirubicin

  • BCG was associated with reduced risk of bladder cancer recurrence versus epirubicin, but statistical heterogeneity was high (5 trials; RR, 0.54; 95% CI, 0.40 to 0.74; I2 = 76%). Estimates favored BCG for all-cause (3 trials; RR, 0.72; 95% CI, 0.44 to 1.19; I2 = 87%) and bladder cancer–specific mortality (3 trials; RR, 0.72; 95% CI, 0.25 to 2.08; I2 = 80%) and bladder cancer progression (5 trials; RR, 0.60; 95% CI, 0.36 to 1.01; I2 = 47%), but differences were not statistically significant (SOE: moderate for recurrence; low for all-cause mortality, bladder cancer–specific mortality, and progression).
  • There was no difference between BCG alone and BCG plus epirubicin administered sequentially in risk of bladder cancer recurrence (3 trials; RR, 1.25; 95% CI, 0.92 to 1.69; I2 = 0%). BCG alone was associated with increased risk of bladder cancer progression (3 trials; RR, 1.92; 95% CI, 0.73 to 5.07; I2 = 0%), but the difference was not statistically significant (SOE: low).
  • One trial found no differences between BCG alone and epirubicin plus interferon alpha-2b in risk of bladder cancer–specific mortality (RR, 0.79; 95% CI, 0.32 to 1.63) or progression-free survival, although BCG was associated with decreased risk of bladder cancer recurrence (RR, 0.66; 95% CI, 0.51 to 0.85) (SOE: low).

BCG Versus Gemcitabine

  • There were no differences between BCG and gemcitabine in risk of all-cause mortality (1 trial; RR, 1.20; 95% CI, 0.04 to 34), progression (2 trials; RR, 1.11; 95% CI, 0.53 to 2.34 and RR, 0.52; 95% CI, 0.13 to 2.06), or quality of life (1 trial) (SOE: low).
  • Evidence from three trials was insufficient to determine effects of BCG versus gemcitabine on risk of bladder cancer recurrence because of clinical heterogeneity and inconsistent findings (RR, 1.67 [95% CI, 1.21 to 2.29]; RR, 0.53 [95% CI, 0.28 to 1.01]; and RR, 0.76 [95% CI, 0.44 to 1.90]) (SOE: insufficient).
  • There were no differences between BCG alone and BCG plus gemcitabine administered sequentially in risk of bladder cancer recurrence (1 trial; RR, 0.86; 95% CI, 0.49 to 1.51) or progression (1 trial; RR, 1.18; 95% CI, 0.30 to 4.61) (SOE: low).

BCG Versus Interferon

  • BCG was associated with reduced risk of bladder cancer recurrence versus interferon alpha-2a (1 trial; RR, 0.57; 95% CI, 0.39 to 0.82), but the difference in risk of bladder cancer progression was not statistically significant (1 trial; RR, 0.69; 95% CI, 0.25 to 1.92) (SOE: low).
  • In patients pretreated with MMC, BCG alone was associated with reduced risk of bladder cancer recurrence versus alternating BCG plus interferon alpha-2b (1 trial; RR, 0.42; 95% CI, 0.30 to 0.59) (SOE: low).
  • Differences between BCG alone and coadministration of BCG and interferon alpha-2b in risk of bladder cancer recurrence (1 trial; RR, 0.88; 95% CI, 0.71 to 1.08) or progression (1 trial; RR, 0.76; 95% CI, 0.17 to 3.30) did not reach statistical significance (SOE: low).

BCG Versus Thiotepa

  • Two trials found that, for maintenance therapy, BCG was associated with decreased risk of recurrence versus thiotepa (RR, 0.38 [95% CI, 0.19 to 0.76] and RR, 0.04 [95% CI, 0.00 to 0.63]), but estimates for other outcomes were too imprecise to evaluate effects (SOE: low for recurrence; insufficient for progression, death, and cystectomy).

MMC Versus Doxorubicin

  • There was no difference between MMC and doxorubicin in risk of bladder cancer recurrence (6 trials; RR, 1.00; 95% CI, 0.82 to 1.22; I2 = 44%), but MMC was associated with a non–statistically significant trend toward decreased risk of bladder cancer progression (4 trials; RR, 0.63; 95% CI, 0.37 to 1.08; I2 = 21%) (SOE: low).

MMC Versus Epirubicin

  • There was no difference between MMC and epirubicin in risk of bladder cancer recurrence in one trial (RR, 1.16; 95% CI, 0.52 to 2.58) (SOE: low).

MMC Versus Gemcitabine

  • In one trial, MMC was associated with no difference in risk of bladder cancer progression compared with gemcitabine (p = 0.29). MMC was associated with increased risk of recurrence, but the difference was not statistically significant (RR, 1.64; 95% CI, 0.64 to 4.19) (SOE: low).

MMC Versus Interferon Alpha

  • One trial found no difference between MMC and interferon alpha in risk of bladder cancer recurrence (RR, 0.77; 95% CI, 0.58 to 1.01) or bladder cancer progression (RR, 1.38; 95% CI, 0.49 to 3.88) (SOE: low).

MMC Versus Interferon Gamma

  • MMC was associated with increased risk of bladder cancer recurrence versus interferon gamma in one trial (RR, 1.61; 95% CI, 0.97 to 2.67) (SOE: low).

MMC Versus Thiotepa

  • Two trials found no difference between MMC and thiotepa in risk of recurrence (RR, 1.76 [95% CI, 0.36 to 8.70] and RR, 1.14 [95% CI, 0.60 to 2.16]) (SOE: low).

Doxorubicin Versus Epirubicin

  • Doxorubicin was associated with increased risk of bladder cancer recurrence versus epirubicin (3 trials; RR, 1.56; 95% CI, 1.08 to 2.22; I2 = 0%); the difference in risk of progression was not statistically significant (1 trial; RR, 1.32; 95% CI, 0.50 to 3.47) (SOE: low).

Doxorubicin Versus Thiotepa

  • There was no statistically significant difference between doxorubicin and thiotepa in risk of bladder cancer recurrence (RR, 1.22; 95% CI, 0.76 to 1.94). Estimates from one trial for progression (RR, 2.11; 95% CI, 0.40 to 11.06), noncancer mortality (RR, 0.35; 95% CI, 0.01 to 8.45), and cancer-specific mortality (RR, 3.17; 95% CI, 0.13 to 76.1) were very imprecise (SOE: low for recurrence; insufficient for progression, noncancer mortality, and cancer-specific mortality).

Epirubicin Versus Interferon Alpha

  • Epirubicin was associated with decreased risk of bladder cancer recurrence versus interferon alpha in one trial (RR, 0.67; 95% CI, 0.49 to 0.91) (SOE: low).

Key Question 3b. Comparative Effectiveness: Tumor Characteristics

For this Key Question, we included 29 studies.

  • There were no clear differences in estimates of effectiveness of intravesical therapies in subgroups defined by tumor stage, grade, size, multiplicity, recurrence status, or DNA ploidy (SOE: low).

Key Question 3c. Comparative Effectiveness: Patient Characteristics

  • No trial evaluated how estimates of effectiveness of intravesical therapy vary in subgroups defined by patient characteristics, such as age, sex, race/ethnicity, performance status, and comorbidities (SOE: insufficient).
  • In patients with recurrence or progression following prior BCG therapy, one trial found maintenance therapy with gemcitabine to be associated with decreased risk of recurrence versus repeat treatment with BCG, and one trial found MMC maintenance therapy to be associated with lower likelihood of disease-free survival than gemcitabine; estimates for progression were imprecise (SOE: low).

Key Question 3d. Comparative Effectiveness: Dosing Frequency, Treatment Duration, Timing

For this Key Question, we included 53 studies (in 57 publications) that compared different doses or instillation regimens of the same drug or different BCG strains.

BCG

  • Six trials found no clear differences between standard and lower doses of BCG in risk of recurrence, progression, or bladder cancer–specific mortality, including in patients with higher risk NMIBC, although there was some inconsistency between trials. Standard therapy was associated with increased risk of local and systemic adverse events versus lower dose BCG in most trials (SOE: low).
  • Three trials of responders to BCG induction therapy found no clear differences between maintenance therapy versus no maintenance therapy in risk of all-cause mortality (3 trials; RR, 0.90; 95% CI, 0.72 to 1.11) or bladder cancer–specific mortality (2 trials; RR, 1.14; 95% CI, 0.24 to 5.40), although maintenance therapy was associated with decreased risk of recurrence (RR, 0.76 [95% CI, 0.65 to 0.88] and RR, 0.16 [95% CI, 0.02 to 1.21]) (SOE: low).
  • Two of three trials found that more prolonged courses of BCG were associated with decreased risk of bladder cancer recurrence versus induction therapy in patients with higher risk NMIBC, but increased risk of adverse events (SOE: low).
  • One trial found OncoTICE® strain BCG to be associated with lower likelihood of 5-year recurrence-free survival than BCG Connaught (48% vs. 74%; p = 0.01), and one trial found OncoTICE strain BCG to be associated with lower likelihood of 5-year recurrence-free survival than RIVM strain BCG (36% vs. 54%; p = 0.07). Four trials that compared non-OncoTICE BCG strains found no differences (SOE: low).

MMC

  • One trial of patients with NMIBC (not selected for being at higher risk) found no clear differences between MMC 40 mg single instillation and MMC 40 mg five instillations in risk of recurrence, progression, or mortality. The single instillation was associated with lower risk of local adverse events (SOE: low).
  • One trial of patients with higher risk NMIBC found that MMC 20 mg induction therapy for 6 weeks was associated with higher risk of recurrence than maintenance therapy. There were no clear differences in risk of adverse events (SOE: low).
  • Two trials of MMC maintenance regimens in patients with NMIBC not selected for being at higher risk found some evidence that a higher total number of instillations and increased frequency during initial therapy were associated with lower risk of recurrence and progression, and might be associated with lower risk of local adverse events (SOE: low).
  • One trial found no difference between “optimized” (through alkalinization of urine) versus nonoptimized administration of intravesical MMC in risk of recurrence in patients with low-risk NMIBC, but one trial of patients with higher risk NMIBC found optimized administration to be associated with lower risk of recurrence and increased risk of local adverse events (SOE: low).

Doxorubicin

  • Two trials of patients with NMIBC not selected for being at higher risk found no differences between doxorubicin 30 mg and 20 mg given as short (8 weeks) or long (2 years) regimens in risk of recurrence or progression, with no differences in adverse events (SOE: low).
  • Two trials of patients with NMIBC not selected for being at higher risk found no clear differences between doxorubicin induction therapy alone and induction plus maintenance in risk of recurrence, progression, or mortality, with no differences in adverse events (SOE: low).
  • Two trials of doxorubicin found no clear benefits associated with administration prior to TURBT or multiple instillations immediately after TURBT, with some evidence of increased adverse events with multiple immediate post-TURBT instillations (SOE: low).

Epirubicin

  • Three trials of epirubicin found no clear evidence that higher doses are associated with reduced risk of recurrence or progression versus lower doses, with no differences in adverse events (SOE: moderate).
  • Three trials found no clear difference between single-instillation epirubicin and multiple instillations in patients with low- or high-risk NMIBC in risk of recurrence, progression, or bladder cancer–specific mortality, with some evidence of lower risk of local adverse events with single instillation (SOE: moderate).
  • Two trials, including one trial of patients with higher risk NMIBC, found no clear differences between epirubicin maintenance therapy and induction without maintenance in risk of recurrence or progression. There were no differences in risk of local adverse events (SOE: moderate).
  • Five trials that evaluated different epirubicin regimens that included maintenance therapy found some evidence that more intensive therapy is associated with decreased risk of recurrence, but results were inconsistent. There was no difference in risk of adverse events (SOE: low).

Thiotepa

  • Two trials found no clear differences between thiotepa 30 mg and 60 mg for maintenance or for treatment of incompletely resected NMIBC or CIS (SOE: low).

Interferon Alpha-2b

  • Four trials found that higher doses of interferon alpha-2b were associated with improved outcomes related to recurrence, progression, or resolution of bladder cancer marker lesions versus lower doses, but most estimates were imprecise and did not reach statistical significance. There were no clear differences in risk of local or systemic adverse events (SOE: low).

Multiple Drugs

  • One trial found no difference between initiation of intravesical therapy (9 instillations over 6 months) with MMC or doxorubicin 50 mg on the day of TURBT versus 1 to 2 weeks after TURBT in risk of recurrence, progression, or mortality, or between maintenance beyond 6 months versus no additional maintenance therapy. There were no clear differences in local or systemic adverse events (SOE: low).

Key Question 4. For TURBT Patients, Effectiveness of Radiation Therapy Versus Intravesical Therapy or Cystectomy

This Key Question addressed the effectiveness of external beam radiation therapy for decreasing mortality or improving other outcomes compared with intravesical chemotherapy/immunotherapy alone or cystectomy in patients treated with TURBT. One randomized trial (rated moderate risk of bias) compared external beam radiation therapy with no radiation therapy in patients with NMIBC.

  • One randomized trial of patients with T1 Grade (G) 3G3 bladder cancer found no effects of radiation therapy versus no radiotherapy (for unifocal disease and no CIS) or radiation therapy versus intravesical therapy (for multifocal disease or CIS) in recurrence-free survival (HR, 0.94; 95% CI, 0.67 to 1.30), progression-free interval (HR, 1.07; 95% CI, 0.65 to 1.74), progression-free survival (HR, 1.35; 95% CI, 0.92 to 1.98), or overall survival (HR, 1.32; 95% CI, 0.86 to 2.04) after 5 years (SOE: low).

Key Question 5. Effectiveness of Urinary Biomarkers Versus Other Urinary Biomarkers or Standard Diagnostic Methods for Surveillance

  • No study evaluated the effectiveness of urinary biomarkers to decrease mortality or improve other outcomes compared with standard diagnostic methods or other urinary biomarkers in surveillance of patients treated for NMIBC.

Key Question 5a. Comparative Effectiveness: Tumor Characteristics

  • No evidence was found (SOE: insufficient).

Key Question 5b. Comparative Effectiveness: Treatment Used

This Key Question addressed the issue of whether comparative effectiveness differs according to the treatment used (i.e., specific chemotherapeutic or immunotherapeutic agents and/or TURBT).

  • No evidence was found (SOE: insufficient).

Key Question 5c. Comparative Effectiveness: Surveillance Intervals

  • No evidence was found (SOE: insufficient).

Key Question 5d. Comparative Effectiveness: Patient Characteristics

  • No evidence was found (SOE: insufficient).

Key Question 6. Effectiveness of Augmented Versus Standard Cystoscopy

This Key Question addresses the effectiveness of blue light or other methods of augmented cystoscopy compared with standard cystoscopy for recurrence rates, progression of bladder cancer, mortality, or other clinical outcomes in initial diagnosis or surveillance of patients treated for NMIBC. We included 14 trials (in 19 publications) that evaluated clinical outcomes of augmented (fluorescent or narrow band imaging) cystoscopy versus standard white light cystoscopy.

  • There was no difference between fluorescent versus white light cystoscopy in risk of mortality (3 trials; RR, 1.28; 95% CI, 0.55 to 2.95; I2 = 41%) (SOE: low).
  • Fluorescent cystoscopy with 5-aminolevulinic acid (5-ALA) or hexaminolevulinate (HAL) was associated with decreased risk of bladder cancer recurrence versus white light cystoscopy at short-term (<3 months; 9 trials; RR, 0.58; 95% CI, 0.36 to 0.94, I2 = 75%), intermediate-term (3 months to <1 year; 5 trials; RR, 0.67; 95% CI, 0.51 to 0.88; I2 = 35%), and long-term followup (≥1 year; 11 trials; RR, 0.81; 95% CI, 0.68 to 0.98; I2 = 64%), but findings were inconsistent and potentially susceptible to performance bias (because of failure to blind the initial cystoscopy) and publication bias (SOE: low).
  • There was no difference between fluorescent and white light cystoscopy in risk of progression to muscle-invasive bladder cancer (9 trials; RR, 0.78; 95% CI, 0.55 to 1.12; I2 = 0%) (SOE: moderate).
  • Narrow band imaging was associated with lower risk of bladder cancer recurrence at 3 months (3.9% vs. 17%; odds ratio [OR], 0.62; 95% CI, 0.41 to 0.92) and at 12 months (OR, 0.24; 95% CI, 0.07 to 0.81) compared with white light cystoscopy in one trial (SOE: low).

Key Question 7. Adverse Effects: Tests

We included seven studies that evaluated adverse effects of various tests for diagnosis and post-treatment surveillance of bladder cancer.

  • Urinary biomarkers miss 23 to 42 percent of patients with bladder cancer and are incorrectly positive in 11 to 28 percent of patients without bladder cancer, but no study directly measured effects of inaccurate diagnosis on clinical outcomes (SOE: insufficient).
  • There were no clear differences between fluorescent cystoscopy and white light cystoscopy in the risk of false-positives in two trials (SOE: low).
  • There were no clear differences between fluorescent cystoscopy and white light cystoscopy in the risk of renal and genitourinary adverse events in two trials (SOE: low).

Key Question 8. Adverse Effects: Treatments

This Key Question addressed adverse effects of various treatments, including intravesical chemotherapeutic or immunotherapeutic agents and TURBT. We included 22 studies of intravesical therapies that reported harms.

Intravesical Therapy Versus No Intravesical Therapy

  • Four trials of BCG versus no intravesical therapy reported granulomatous cystitis or irritative symptoms in 27 to 84 percent of patients treated with BCG, macroscopic hematuria in 21 to 72 percent, and fever in 27 to 44 percent. Harms were not reported in patients who did not receive intravesical therapy (SOE: low).
  • Evidence on harms associated with non-BCG intravesical therapies versus no intravesical therapy was very limited, although some trials reported an increased risk of local adverse events with intravesical therapies. Evidence was insufficient to determine effects of non-BCG intravesical therapies versus no intravesical therapy on risk of systemic adverse events (SOE: low for local adverse events; insufficient for systemic adverse events).

BCG Versus MMC

  • BCG was associated with increased risk of any local adverse event (2 trials; RR, 2.01; 95% CI, 1.59 to 2.54; I2 = 0%), granulomatous cystitis (5 trials; RR, 1.71; 95% CI, 1.22 to 2.41; I2 = 58%), dysuria (3 trials; 48% vs. 32%; RR, 1.23; 95% CI, 1.03 to 1.46; I2 = 34%), and hematuria (6 trials; RR, 1.78; 95% CI, 1.24 to 2.56; I2 = 62%) versus MMC (SOE: low for local adverse events and dysuria; moderate for granulomatous cystitis and hematuria).
  • BCG was associated with increased risk of any systemic adverse event (2 trials; RR, 2.01; 95% CI, 1.59 to 2.54; I2 = 0%) and fever (4 trials; RR, 4.51; 95% CI, 2.31 to 8.82; I2 = 25%) versus MMC (SOE: low).
  • There was no difference between BCG and MMC in risk of discontinuation of instillations (4 trials; RR, 1.26; 95% CI, 0.39 to 4.01; I2 = 70%) (SOE: low).
  • BCG alone was associated with increased risk of discontinuation of instillations versus BCG plus MMC given sequentially (1 trial; RR, 4.06; 95% CI, 2.09 to 7.86) (SOE: low).

BCG Plus MMC Versus MMC

  • There was no difference between sequentially administered BCG plus MMC and MMC alone in local adverse events (1 trial; RR, 1.36; 95% CI, 0.60 to 3.08) or risk of granulomatous cystitis (1 trial; RR, 1.30; 95% CI, 0.88 to 1.93) (SOE: low).
  • There was no difference between BCG and MMC given sequentially and MMC used alone in systemic adverse events (1 trial; RR, 1.07; 95% CI, 0.63 to 1.84), but BCG plus MMC was associated with increased risk of fever (1 trial; 12% vs. 3%; RR, 3.75; 95% CI, 1.08 to 13) (SOE: low).
  • There was no difference between alternating BCG plus MMC and MMC alone in risk of discontinuation of instillations in patients with CIS (1 trial; RR, 0.54; 95% CI, 0.16 to 1.84) or in patients with Ta or T1 tumors (1 trial; RR, 0.93; 95% CI, 0.52 to 1.65) (SOE: low).

BCG Versus Doxorubicin

  • BCG was associated with increased risk of cystitis versus doxorubicin (1 trial; RR, 17; 95% CI, 1 to 289), but there was insufficient evidence to determine effects on dysuria (3 trials; data not pooled) and hematuria (2 trials; data not pooled) because of small numbers of trials with inconsistent results (SOE: low for cystitis; insufficient for dysuria and hematuria).

BCG Versus Epirubicin

  • BCG was associated with increased risk of local side effects (1 trial; RR, 3.28; 95% CI, 1.26 to 8.53), granulomatous cystitis (4 trials; RR, 1.86; 95% CI, 1.35 to 2.56; I2 = 65%), dysuria (1 trial; RR, 2.43; 95% CI, 1.13 to 5.24), hematuria (4 trials; RR, 1.77; 95% CI, 1.41 to 2.22; I2 = 0%), and fever (2 trials; RR, 9.73; 95% CI, 2.72 to 35; I2 = 0%) versus epirubicin alone, but results were mixed for discontinuation of intravesical therapy (2 trials; data not pooled) (SOE: low for local side effects, dysuria, granulomatous cystitis, hematuria, and fever; insufficient for discontinuation of instillations).
  • BCG alone was associated with increased risk of systemic adverse events (1 trial; RR, 5.97; 95% CI, 2.18 to 16), granulomatous cystitis (1 trial; RR, 2.28; 95% CI, 1.46 to 3.54), and discontinuation of instillations (1 trial; RR, 4.56; 95% CI, 1.35 to 15) versus sequentially administered BCG and epirubicin, but there was no difference in risk of dysuria (1 trial; RR, 1.22; 95% CI, 0.56 to 2.66), hematuria (2 trials; RR, 2.27; 95% CI, 0.86 to 6.00; I2 = 0%), or fever (2 trials; RR, 2.09; 95% CI, 0.48 to 9.02; I2 = 0%) (SOE: low).

BCG Versus Gemcitabine

  • There were no differences between BCG and gemcitabine in risk of local adverse events requiring postponement or discontinuation of intravesical therapy (1 trial; RR, 1.33; 95% CI, 0.32 to 5.49), systemic adverse events (1 trial; RR, 0.50; 95% CI, 0.10 to 2.5), dysuria (2 trials; RR, 1.51; 95% CI, 0.92 to 2.50; I2 = 0%), or hematuria (2 trials; RR, 4.62; 95% CI, 0.78 to 27; I2 = 29%), but BCG was associated with increased risk of fever (2 trials; RR, 6.24; 95% CI, 1.03 to 38; I2 = 5%) (SOE: low).
  • One trial found no difference between BCG alone and BCG plus gemcitabine given sequentially in risk of dysuria (RR, 0.92; 95% CI, 0.52 to 1.65) or hematuria (RR, 0.30; 95% CI, 0.08 to 1.09) (SOE: low).

BCG Versus Interferon

  • BCG was associated with increased risk of dysuria versus interferon alpha-2a (1 trial; RR, 84; 95% CI, 5.29 to 1,319) but no difference in risk of fever (1 trial; RR, 4.82; 95% CI, 0.25 to 94) (SOE: low).
  • BCG alone was associated with increased risk of constitutional symptoms (1 trial; RR, 1.63; 95% CI, 1.12 to 2.38) and fever (1 trial; RR, 2.26; 95% CI, 1.30 to 3.95) versus coadministration of BCG and interferon alpha-2b (SOE: low).

BCG Versus Thiotepa

  • BCG was associated with increased risk of bladder irritability (1 trial; RR, 2.93; 95% CI, 1.45 to 5.90), cystitis (1 trial; RR, 18; 95% CI, 1.11 to 306), and fever (1 trial; RR, 8.36; 95% CI, 0.47 to 150) versus thiotepa (SOE).

MMC Versus Doxorubicin

  • Evidence was insufficient to determine effects of MMC versus doxorubicin on risk of local adverse events, based on inconsistent results from six trials (SOE: insufficient).

MMC Versus Epirubicin

  • One small trial found no difference between MMC and epirubicin 80 mg in risk of urinary symptoms (SOE: low).

MMC Versus Interferon Alpha

  • One trial found MMC to be associated with greater risk of hematuria versus interferon alpha (RR, 2.00; 95% CI, 1.09 to 3.65), decreased risk of fever (RR, 0.13; 95% CI, 0.03 to 0.55), and no difference in risk of dysuria or urinary frequency (SOE: low).

MMC Versus Gemcitabine

  • One trial found MMC to be associated with increased risk of chemical cystitis versus gemcitabine (RR, 3.93; 95% CI, 1.17 to 13.14), with no difference in risk of dysuria or hematuria (SOE: low).

Doxorubicin Versus Epirubicin

  • Doxorubicin was associated with increased risk of chemical cystitis versus epirubicin (1 trial; RR, 1.85; 95% CI, 1.13 to 3.03), with no clear difference in risk of dysuria or urinary frequency (2 trials) or hematuria (3 trials; RR, 1.53; 95% CI, 0.50 to 4.66; I2 = 0%) (SOE: low).

Doxorubicin Versus Thiotepa

  • One trial found no difference between doxorubicin and thiotepa in risk of bladder irritability (RR, 0.92; 95% CI, 0.36 to 2.37) (SOE: low).

Epirubicin Versus Interferon Alpha

  • One trial found no difference between epirubicin and interferon alpha in risk of dysuria or fever (SOE: low).

Key Question 8a. Adverse Effects of Treatments: Patient Characteristics

  • No study evaluated how harms of treatment vary in subgroups defined by patient characteristic, such as age, sex, race/ethnicity, performance status, or medical comorbidities (SOE: insufficient).

Discussion

Key Findings and Strength of Evidence

The key findings of this review are described in the summary-of-evidence table (Table A).

Table A. Summary of the strength of evidence.

Table A

Summary of the strength of evidence.

Urinary biomarkers were associated with sensitivity for bladder cancer that ranged from 0.58 to 0.77 and specificity that ranged from 0.72 to 0.89, for positive likelihood ratios that ranged from 2.18 to 6.10 and negative likelihood ratios that ranged from 0.21 to 0.48. Findings were robust in sensitivity and stratified analyses, although evidence was strongest for quantitative NMP22 and qualitative BTA (SOE: moderate) and relatively sparse for other biomarkers (SOE: low). Across urinary biomarkers, sensitivity was greater for higher stage and higher grade tumors (SOE: high). For qualitative BTA, sensitivity was somewhat higher for evaluation of patients with signs or symptoms of bladder cancer than for surveillance of patients previously treated for bladder cancer, but for quantitative NMP22 there was no clear difference in diagnostic accuracy based on reason for testing. Studies that directly compared the accuracy of quantitative NMP22 and qualitative BTA found no differences in diagnostic accuracy (SOE: moderate). There were too few head-to-head comparisons of other urinary biomarkers to reach firm conclusions regarding comparative accuracy. Sensitivity was increased when urinary biomarkers were used in conjunction with urine cytology (SOE: moderate). No study evaluated clinical outcomes associated with use of urinary biomarkers for diagnosis or surveillance of bladder cancer (SOE: insufficient). Urinary biomarkers miss 23 to 42 percent of patients with bladder cancer and are incorrectly positive in 11 to 28 percent of patients without bladder cancer, which could result in delayed diagnosis or unnecessary cystoscopies and other diagnostic procedures, but no study directly measured effects of inaccurate diagnosis on clinical outcomes (SOE: insufficient).

Most trials found that fluorescent cystoscopy was associated with decreased risk of subsequent bladder recurrence versus white light cystoscopy, but there was no difference in risk of progression or mortality, although data for these outcomes were relatively sparse (SOE: low). In addition, evidence on effects on risk of recurrence was inconsistent, and the only trial25 designed to minimize performance bias (by blinding the cystoscopist to instillation of photosensitizer vs. placebo) found no difference in risk of bladder cancer recurrence.

Intravesical therapy was effective for reducing risk of bladder cancer recurrence versus no intravesical therapy. Compared with no intravesical therapy, BCG was associated with decreased risk of bladder cancer recurrence (RR, 0.63; 95% CI, 0.50 to 0.79) as well as progression (RR, 0.50; 95% CI, 0.32 to 0.77) (SOE: moderate). MMC, doxorubicin, and epirubicin were also associated with decreased risk of bladder cancer recurrence versus no intravesical therapy (RR, 0.66 to 0.80), but effects on bladder cancer progression were not statistically significant (MMC and epirubicin) or showed no effect (doxorubicin). Although trials varied with respect to doses, instillation regimens, and patient populations evaluated, findings were generally robust in sensitivity and subgroup analyses. No intravesical agent, including BCG, was associated with decreased risk of all-cause or bladder cancer–specific mortality versus no intravesical therapy. Evidence on gemcitabine, interferon alpha, and thiotepa was sparse, and we found no randomized trials of valrubicin, paclitaxel, or apaziquone.

Head-to-head trials of intravesical therapy using different drugs showed few clear differences. For BCG versus MMC, the most well-studied comparison, there was no difference on any outcome, including bladder cancer recurrence, progression, or mortality (SOE: moderate). However, BCG was associated with decreased risk of bladder cancer recurrence in the subgroup of trials that evaluated maintenance regimens (SOE: low). Other head-to-head comparisons were evaluated in fewer trials, and in general showed few differences. A possible exception was for BCG versus epirubicin, for which there was some evidence that BCG might be associated with decreased risk of bladder cancer recurrence and progression versus epirubicin (SOE: low). Although doxorubicin was associated with increased risk of bladder cancer recurrence versus epirubicin (RR, 1.56; 95% CI, 1.08 to 2.22), this finding was based on only three trials (SOE: low).26-28 Evidence to determine the effects of tumor characteristics on estimates of effectiveness of intravesical therapies was limited but indicated no differences in risk estimates based on factors such as tumor stage, grade, multiplicity, recurrence status, and size (SOE: low). However, even if relative estimates of effectiveness are similar, absolute effects will vary depending on the underlying incidence of recurrence, progression, mortality, or other outcomes. Therefore, patients with higher stage, higher grade, multiple, recurrent, or larger tumors would be expected to experience greater absolute benefits. Evidence to determine the effects of patient characteristics, such as age, sex, race/ethnicity, performance status, or comorbidities, on estimates of effectiveness of intravesical therapies was not available.

Results from trials that compared effects of intravesical therapy using different doses or instillation regimens for the same agent were difficult to interpret because of variability in the patient populations, doses, instillation regimens, and other factors. For BCG, there were no clear differences between standard and lower doses in risk of bladder cancer recurrence, progression, or mortality, including in patients with higher risk NMIBC, but there was some inconsistency between trials (SOE: low). Limited evidence suggested that BCG maintenance regimens (>6 weeks) are more effective than induction regimens (≤6 weeks) at reducing risk of bladder cancer recurrence in patients with higher risk tumors (SOE: low). Trials on the effects of dose and duration of other intravesical agents on outcomes reported inconsistent results and were clinically heterogeneous, making it difficult to draw strong conclusions (SOE: insufficient to low). However, there is no evidence that prolonging therapy for more than 1 year is more effective than shorter regimens.

Evidence on harms associated with intravesical therapies was more limited than evidence on benefits. Trials of BCG versus no intravesical therapy found that local and systemic adverse events were relatively common (chemical cystitis or irritative symptoms in 27% to 84% of patients, macroscopic hematuria in 21% to 72%, and fever in 27% to 44%) (SOE: low). BCG was also associated with an increased risk of local adverse events and fever versus MMC (SOE: low to moderate). Standard-dose BCG was associated with increased risk of local and systemic adverse events versus lower dose BCG. Few trials reported harms of intravesical agents other than BCG versus no intravesical therapy or versus another intravesical agent.

The only randomized trial of radiation therapy found no effects on recurrence, progression, or survival in patients with T1 Grade (G) 3 cancers when compared with no radiotherapy (for unifocal cancers and no CIS) or against intravesical therapy (for multifocal disease or CIS) (SOE: low).29

Findings in Relationship to What Is Already Known

Our findings on diagnostic accuracy were generally consistent with prior systematic reviews that found urinary biomarkers insufficiently accurate to replace cystoscopy.30-32 Estimates for sensitivity and specificity were generally similar in our review and prior reviews, even though we excluded case-control studies and included more recently published studies. In addition, prior reviews did not evaluate potential differences in diagnostic accuracy for testing performed for evaluation of signs and symptoms of bladder cancer versus for surveillance.

Prior systematic reviews33,34 found fluorescent cystoscopy to be associated with decreased risk of recurrent bladder cancer versus white light cystoscopy, but they were published prior to a recent trial that was the only one to blind the cystoscopist to instillation of the photosensitizer and found no effect.25 Like our report, prior reviews found no effect of fluorescent cystoscopy on risk of progression or mortality. Although prior reviews also found that fluorescent cystoscopy detected more bladder cancers on initial cystoscopy, this was not an assessed outcome for our review.

Our findings regarding the comparative effectiveness and harms of intravesical therapies are generally consistent with prior reviews that found intravesical therapy to be associated with decreased risk of bladder cancer recurrence versus no intravesical therapy35,36 and found BCG to be associated with decreased risk of bladder cancer progression. Prior systematic reviews that focused on immediate single-instillation therapy also found intravesical therapy to be more effective than no intravesical therapy in reducing risk of bladder cancer recurrence, a conclusion consistent with our finding of no clear difference in risk estimates based on the type of instillation regimen.37-39 Like our review, a prior systematic review found that maintenance therapy with BCG was associated with decreased risk of bladder cancer versus MMC, despite some differences in the trials that were included, definitions of maintenance therapy, and use of individual patient data in the prior review.40 Our findings are also consistent with prior systematic reviews that found BCG to be associated with decreased risk of bladder cancer versus epirubicin,41 that the evidence on intravesical gemcitabine is limited,42 and that the optimal dose and duration of intravesical therapy cannot be determined based on the available evidence.43

Applicability

Some issues could impact the applicability of our findings. Some studies of diagnostic accuracy did not report results separately for patients undergoing evaluation of signs and symptoms of bladder cancer and those undergoing surveillance, although there is some evidence that diagnostic accuracy may vary based on the indication for testing. Studies of intravesical therapy varied in the doses used; the timing, number, frequency, and duration of instillations; and other factors (e.g., the BCG strain), making it difficult to reach conclusions that are widely generalizable. In addition, trials varied with regard to tumor characteristics in the patient populations evaluated. Another factor that potentially impacts applicability is that most studies focused on effects of intravesical therapy on recurrence of bladder cancer. Fewer trials evaluated more potentially serious distal outcomes, such as progression or mortality. A number of studies were conducted in Japan, where management of bladder cancer may differ from that in the United States. Treatment studies tended to exclude patients with significant comorbidities or poor general performance status, which could limit applicability to these populations. Very little information was available to determine whether diagnostic accuracy or treatment effects vary according to patient factors, such as age, sex, race/ethnicity, performance status, or comorbidities.

Implications for Clinical and Policy Decisionmaking

Our review has implications for clinical and policy decisionmaking. As there are no studies evaluating effects of using urinary biomarkers for diagnosis or surveillance of bladder cancer on clinical outcomes, decisions regarding their use must necessarily be made on the basis of diagnostic test performance. Table B shows estimated probabilities for bladder cancer following use of urinary biomarkers, based on likelihood ratios calculated from pooled sensitivities and specificities. In populations with a pretest probability of 5 percent, the post-test probability increased to 16 to 24 percent following a positive result and decreased to 1.8 to 2.5 percent following a negative result. In settings with a pretest probability of 20 percent, the post-test probability increased to 37 to 60 percent following positive results and decreased to 8.0 to 11 percent following a negative result. Whether urinary biomarkers are sufficiently accurate to rule out bladder cancer and thereby reduce the need for cystoscopy depends on the ability of clinicians to estimate the pretest probability of disease and the acceptable threshold for a missed or delayed diagnosis. Use of urinary biomarkers in combination with urinary cytology increases the sensitivity for bladder cancer, but still misses about 10 percent of cases. Regarding fluorescent cystoscopy, studies have not shown an effect on progression or mortality, and trials that found reduced risk of recurrence may have been affected by performance bias. These findings might inform decisions regarding widespread adoption of fluorescent cystoscopy.

Table B. Post-test probability of bladder cancer using different biomarkers.

Table B

Post-test probability of bladder cancer using different biomarkers.

Our findings also have implications for use of intravesical therapy. Although intravesical therapy was associated with decreased risk of bladder cancer recurrence, there were no clear effects on bladder cancer–specific or all-cause mortality, and intravesical therapies were associated with local and systemic adverse events. Our findings are consistent with guidelines that recommend BCG as first-line therapy.10,44 As no intravesical agent was more effective than BCG at reducing risk of bladder cancer recurrence, BCG is the only intravesical agent associated with decreased risk of bladder cancer progression versus no intravesical therapy, and some evidence indicates that BCG is associated with decreased risk of bladder cancer recurrence versus other intravesical agents. However, BCG is also associated with a high risk of adverse events. Some evidence indicates that using lower than standard doses of BCG maintains effectiveness while reducing harms. Other evidence suggests that longer courses of therapy may be necessary for optimal effects, particularly in higher risk patients. Therefore, decisions to use intravesical therapy and regarding the intravesical agent, doses, and regimen selected should take into account the tradeoffs between potential benefits and harms. Benefits are likely to be higher in patients at higher risk for disease progression and harms.

Limitations of the Review Process

Substantial statistical heterogeneity was present in most pooled analyses of diagnostic accuracy; this situation is common in meta-analyses of diagnostic accuracy.45-47 As noted in the “Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy,” “heterogeneity is to be expected in meta-analyses of diagnostic test accuracy.”47 To address the anticipated heterogeneity, we used random-effects models to pool studies and stratified studies according to the reason that imaging was performed and the unit of analysis used. We also performed additional stratified and sensitivity analyses based on the reference standard used, study characteristics (such as country in which the study was conducted, factors related to risk of bias), patient characteristics, and technical factors related to the imaging tests under investigation. Results were generally robust in sensitivity analyses, despite the heterogeneity. We also focused on evaluations of comparative test performance based on within-study comparisons of imaging modalities, which tended to be associated with less heterogeneity than pooled across-study estimates. A limitation of our analysis of within-group comparisons is that we had to treat the two compared groups as independent because we had aggregated data only. Individual patient-level data would be required to take into account the paired nature of the comparisons. Such correlations are generally positive and would be expected to result in more narrow CIs. Although it is possible that this could have caused us not to detect statistically significant differences, the point estimates indicated very little difference between tests.

We did not construct summary receiver operating characteristic curves. Almost all studies of a specific urinary biomarker used the same definition for a positive test, including tests based on a quantitative threshold. Estimates of sensitivity and specificity at different thresholds are needed to construct informative receiver operating characteristic curves.48

Statistical heterogeneity was also present in some analyses of intravesical therapies and fluorescent cystoscopy. To address this, we used the Dersimonian-Laird random-effects model to pool studies. The Dersimonian-Laird random-effects model may result in CIs that are too narrow when heterogeneity is present, particularly when the number of studies is small.24 Therefore, we repeated analyses using the profile likelihood method, which resulted in similar findings. Regardless of the method used, meta-analyses based on small numbers of trials can underestimate statistical heterogeneity and must be interpreted with caution.24 We also stratified trials according to factors such as risk-of-bias rating, dose, number of instillations, duration of followup, enrollment of patients with high-risk NMIBC, and other factors. Although statistical heterogeneity remained present in some analyses, with some unexplained outlier trials, results were generally robust.

We excluded non–English-language articles and did not search for studies published only as abstracts. Because of small numbers of trials for meta-analyses involving intravesical therapies, we did not formally assess for publication bias using statistical or graphical methods for assessing sample size effects, as research indicates that such methods can be seriously misleading in such situations.49,50 For fluorescent cystoscopy, we found one relatively large trial that showed no effect on risk of recurrence versus white light cystoscopy, suggesting that publication bias could have impacted results.51

Limitations of the Evidence Base

Several limitations of the evidence base limited our ability to reach strong conclusions with regard to several aspects of diagnosis and treatment of NMIBC. Other than quantitative NMP22 and qualitative BTA, urinary biomarkers were assessed in small numbers of studies (6 or fewer), resulting in less precise estimates. In addition, most of the evidence on comparative accuracy was indirect, as few studies directly compared the accuracy of two or more biomarkers against cystoscopy and histopathology.

For fluorescent cystoscopy, a limitation of the evidence base is that few trials reported effects on progression or mortality, and instead mostly focused on evaluating effects on recurrence. In addition, only one trial of fluorescent cystoscopy blinded the cystoscopist to whether the photosensitizer had been instilled, which may have an impact on assessments of recurrence because of performance bias related to knowledge of the type of initial cystoscopy performed.

A limitation of the evidence for all Key Questions addressed in our review is that very few trials were assessed as low risk of bias. Methodological shortcomings included failure to adequately describe randomization and allocation concealment methods, and unblended design. Findings would be stronger if more high-quality trials were available.

Other limitations include the lack of evidence on how use of urinary biomarkers impacts clinical outcomes (including harms), the evidence from only a single randomized trial on effects of radiation therapy for NMIBC, no trials on effects of using a risk-adapted approach, and no studies on how using different surveillance intervals impacts outcomes. Few studies evaluated effects of patient characteristics, such as age, sex, race/ethnicity, performance status, or comorbidities, on diagnostic test performance or effectiveness of intravesical therapy.

Research Gaps

We identified a number of important research gaps. Given the increased sensitivity of urinary biomarkers with cytology, studies on how this combination impacts use of cystoscopy and subsequent clinical outcomes might be helpful for determining its role in diagnosis or surveillance. Randomized trials that adequately safeguard against performance bias associated with use of photosensitizers for fluorescent cystoscopy are needed to determine effects on recurrence, progression, and mortality. Additional head-to-head trials of intravesical therapies that use more standardized instillation regimens and doses, report outcomes in subgroups stratified by patient and tumor characteristics, and include long-term outcomes related to progression and mortality would help clarify optimal treatment strategies. Research is also needed to determine the effectiveness of risk-adapted approaches to guide selection of therapy, including use of nontraditional prognostic markers, effects of different surveillance intervals and protocols, and newer techniques such as electromotive administration of intravesical therapy.

Conclusions

Urinary biomarkers are falsely negative in a substantial proportion of patients with bladder cancer, and additional research is needed to clarify advantages of fluorescent cystoscopy over white light cystoscopy. Intravesical therapy reduces risk of bladder cancer recurrence versus no intravesical therapy. BCG is the only intravesical therapy shown to be associated with decreased risk of bladder cancer progression, but it is associated with a high rate of adverse events.

References

1.
Siegel R, Ma J, Zou Z, et al. Cancer statistics, 2014. CACancer J for Clin. 2014;64(1):9–29. [PubMed: 24399786]
2.
U.S. Cancer Statistics Working Group. United States Cancer Statistics: 1999–2009 Incidence and Mortality Web-based Report. Atlanta: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute; 2013. [November 4, 2014]. www​.cdc.gov/uscs.
3.
National Cancer Institute; Surveillance, Epidemiology and End Results Program: Turning Cancer Data Into Discovery. Browse the SEER Cancer Statistics Review (CSR) 1975–2009. [October 16, 2014]. http://seer​.cancer.gov​/archive/csr/1975_2009_pops09​/browse_csr​.php?sectionSEL=27&pageSEL​=sect_27_table.01.html.
4.
James AC, Gore JL. The costs of non-muscle invasive bladder cancer. Urol Clin North Am. 2013 May;40(2):261–9. [PubMed: 23540783]
5.
American Cancer Society. Bladder Cancer; Early Detection, Diagnosis, and Staging Topics. [October 17, 2014]. www​.cancer.org/cancer​/bladdercancer/detailedguide​/bladder-cancer-survival-rates.
6.
Kaufman DS, Shipley WU, Feldman AS. Bladder cancer. Lancet. 2009 Jul 18;374(9685):239–49. [PubMed: 19520422]
7.
American Cancer Society. Bladder Cancer; Survival rates for bladder cancer by stage. 2013. [October 17th 2014]. http://www​.cancer.org​/cancer/bladdercancer​/detailedguide/bladder-cancer-survival-rates.
8.
American Cancer Society. Bladder Cancer; How is bladder cancer diagnosed? [October 3rd, 2014]. http://www​.cancer.org​/cancer/bladdercancer​/detailedguide/bladder-cancer-diagnosis.
9.
Herman MP, Svatek RS, Lotan Y, et al. Urine-based biomarkers for the early detection and surveillance of non-muscle invasive bladder cancer. Minerva Urologica e Nefrologica. 2008 Dec;60(4):217–35. [PubMed: 18923359]
10.
Hall MC, Chang SS, Dalbagni G, et al. Guideline for the management of nonmuscle invasive bladder cancer (stages Ta, T1, and Tis): 2007 update. J Urol. 2007 Dec;178(6):2314–30. [PubMed: 17993339]
11.
Babjuk M, Burger M, Zigeuner R, et al. EAU guidelines on non-muscle-invasive urothelial carcinoma of the bladder: update 2013. Eur Urol. 2013 Oct;64(4):639–53. [PubMed: 23827737]
12.
Methods Guide for Effectiveness and Comparative Effectiveness Reviews. Rockville, MD: Agency for Healthcare Research and Quality; Jan, 2014. AHRQ Publication No. 10(14)-EHC063-EF. Chapters available at www​.effectivehealthcare.ahrq.gov.
13.
Methods Guide for Medical Test Reviews. Rockville, MD: Agency for Healthcare Research and Quality; Jun, 2012. AHRQ Publication No. 12-EHC017. www​.effectivehealthcare​.ahrq.gov/reports/final.cfm.
14.
Altman DG, Bland JM. Diagnostic tests 3: receiver operating characteristic plots. BMJ. 1994 Jul 16;309(6948):188. [PMC free article: PMC2540706] [PubMed: 8044101]
15.
Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem. 1993 Apr;39(4):561–77. [PubMed: 8472349]
16.
U.S. Preventive Services Task Force. US Preventive Services Task Force Procedure Manual. Jul, 2008. AHRQ Publication No. 08-05118-EF. www​.uspreventiveservicestaskforce.org.
17.
Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Internal Med. 2011;155(8):529–36. [PubMed: 22007046]
18.
Fu R, Gartlehner G, Grant M, et al. Methods Guide for Effectiveness and Comparative Effectiveness Reviews. Rockville, MD: Agency for Healthcare Research and Qualit;; Jan, 2014. Conducting quantitative synthesis when comparing medical interventions. AHRQ Publication No. 10(14)-EHC063-EF. Chapters available at www​.effectivehealthcare.ahrq.gov.
19.
Chu H, Cole SR. Bivariate meta-analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach. J Clin Epidemiol. 2006 Dec;59(12):1331–2. author reply 2-3. [PubMed: 17098577]
20.
Zwinderman AH, Bossuyt PM. We should not pool diagnostic likelihood ratios in systematic reviews. Stat Med. 2008 Feb 28;27(5):687–97. [PubMed: 17611957]
21.
Trikalinos TA, Balion CM, Coleman CI, et al. Chapter 8: meta-analysis of test performance when there is a “gold standard” J Gen Intern Med. 2012 Jun;27 Suppl 1:S56–66. [PMC free article: PMC3364353] [PubMed: 22648676]
22.
Fu R, Gartlehner G, Grant M, et al. Conducting quantitative synthesis when comparing medical interventions: AHRQ and the Effective Health Care Program. J Clin Epidemiol. 2011 Nov;64(11):1187–97. [PubMed: 21477993]
23.
Higgins JP, Thompson SG, Deeks JJ, et al. Measuring inconsistency in meta-analyses. BMJ. 2003 Sep 6;327(7414):557–60. [PMC free article: PMC192859] [PubMed: 12958120]
24.
Cornell JE, Mulrow CD, Localio R, et al. Random-effects meta-analysis of inconsistent effects: a time for change. Ann Intern Med. 2014 Feb 18;160(4):267–70. [PubMed: 24727843]
25.
Stenzl A, Penkoff H, Dajc-Sommerer E, et al. Detection and clinical outcome of urinary bladder cancer with 5-aminolevulinic acid-induced fluorescence cystoscopy : a multicenter randomized, double-blind, placebo-controlled trial. Cancer. 2011 Mar 1;117(5):938–47. [PubMed: 21351082]
26.
Ali-el-Dein B, el-Baz M, Aly AN, et al. Intravesical epirubicin versus doxorubicin for superficial bladder tumors (stages pTa and pT1): a randomized prospective study. J Urol. 1997 Jul;158(1):68–73. discussion -4. [PubMed: 9186325]
27.
Eto H, Oka Y, Ueno K, et al. Comparison of the prophylactic usefulness of epirubicin and doxorubicin in the treatment of superficial bladder cancer by intravesical instillation: a multicenter randomized trial. Kobe University Urological Oncology Group. Cancer Chemother Pharmacol. 1994;35 Suppl:S46–51. [PubMed: 7994786]
28.
Shuin T, Kubota Y, Noguchi S, et al. A phase II study of prophylactic intravesical chemotherapy with 4'-epirubicin in recurrent superficial bladder cancer: comparison of 4'-epirubicin and adriamycin. Cancer Chemother Pharmacol. 1994;35 Suppl:S52–6. [PubMed: 7994787]
29.
Harland SJ, Kynaston H, Grigor K, et al. A randomized trial of radical radiotherapy for the management of pT1G3 NXM0 transitional cell carcinoma of the bladder. J Urol. 2007 Sep;178(3 Pt 1):807–13. discussion 13. [PubMed: 17631326]
30.
Lotan Y, Roehrborn CG. Sensitivity and specificity of commonly available bladder tumor markers versus cytology: results of a comprehensive literature review and meta-analyses. Urology. 2003 Jan;61(1):109–18. discussion 18. [PubMed: 12559279]
31.
Van Rhijn BW, van der Poel HG, van der Kwast TH. Urine markers for bladder cancer surveillance: a systematic review. Eur Urol. 2005 Jun;47(6):736–48. [PubMed: 15925067]
32.
Hajdinjak T. UroVysion FISH test for detecting urothelial cancers: meta-analysis of diagnostic accuracy and comparison with urinary cytology testing. Urol Oncol. 2008 Nov-Dec;26(6):646–51. [PubMed: 18367109]
33.
Burger M, Grossman HB, Droller M, et al. Photodynamic diagnosis of non-muscle-invasive bladder cancer with hexaminolevulinate cystoscopy: a meta-analysis of detection and recurrence based on raw data. Eur Urol. 2013 Nov;64(5):846–54. [PubMed: 23602406]
34.
Kausch I, Sommerauer M, Montorsi F, et al. Photodynamic diagnosis in non-muscle-invasive bladder cancer: a systematic review and cumulative analysis of prospective studies. Eur Urol. 2010 Apr;57(4):595–606. [PubMed: 20004052]
35.
Shelley MD, Kynaston H, Court J, et al. A systematic review of intravesical bacillus Calmette-Guerin plus transurethral resection vs transurethral resection alone in Ta and T1 bladder cancer. BJU Int. 2001 Aug;88(3):209–16. [PubMed: 11488731]
36.
Han RF, Pan JG. Can intravesical bacillus Calmette-Guerin reduce recurrence in patients with superficial bladder cancer? A meta-analysis of randomized trials. Urology. 2006 Jun;67(6):1216–23. [PubMed: 16765182]
37.
Sylvester RJ, Oosterlinck W, van der Meijden APM. A single immediate postoperative instillation of chemotherapy decreases the risk of recurrence in patients with stage Ta T1 bladder cancer: a meta-analysis of published results of randomized clinical trials. J Urol. 2004;171(6 Pt 1):2186–90. [PubMed: 15126782]
38.
Abern MR, Owusu RA, Anderson MR, et al. Perioperative intravesical chemotherapy in non-muscle-invasive bladder cancer: a systematic review and meta-analysis. J Natl Compr Canc Netw. 2013 Apr 1;11(4):477–84. [PubMed: 23584348]
39.
Perlis N, Zlotta AR, Beyene J, et al. Immediate post-transurethral resection of bladder tumor intravesical chemotherapy prevents non-muscle-invasive bladder cancer recurrences: an updated meta-analysis on 2548 patients and quality-of-evidence review. Eur Urol. 2013 Sep;64(3):421–30. [PubMed: 23830475]
40.
Malmstrom P-U, Sylvester RJ, Crawford DE, et al. An individual patient data meta-analysis of the long-term outcome of randomised studies comparing intravesical mitomycin C versus bacillus Calmette-Guerin for non-muscle-invasive bladder cancer. Eur Urol. 2009 Aug;56(2):247–56. [PubMed: 19409692]
41.
Shang PF, Kwong J, Wang ZP, et al. Intravesical Bacillus Calmette-Guerin versus epirubicin for Ta and T1 bladder cancer. Cochrane Database Syst Rev. 2011;(5) CD006885. [PubMed: 21563157]
42.
Shelley MD, Jones G, Cleves A, et al. Intravesical gemcitabine therapy for non-muscle invasive bladder cancer (NMIBC): a systematic review. BJU Int. 2012 Feb;109(4):496–505. [PubMed: 22313502]
43.
Sylvester RJ, Oosterlinck W, Witjes JA. The schedule and duration of intravesical chemotherapy in patients with non-muscle-invasive bladder cancer: a systematic review of the published results of randomized clinical trials. Eur Urol. 2008 Apr;53(4):709–19. [PMC free article: PMC2587437] [PubMed: 18207317]
44.
Babjuk M, Bohle A, Burger E, et al. Guidelines on Non-Muscle-Invasive Bladder Cancer (Ta, T1 and CIS). Arnhem, The Netherlands: 2014.
45.
Lijmer JG, Bossuyt PM, Heisterkamp SH. Exploring sources of heterogeneity in systematic reviews of diagnostic tests. Stat Med. 2002 Jun 15;21(11):1525–37. [PubMed: 12111918]
46.
Gatsonis C, Paliwal P. Meta-analysis of diagnostic and screening test accuracy evaluations: methodologic primer. AJR Am J Roentgenol. 2006 Aug;187(2):271–81. [PubMed: 16861527]
47.
Macaskill P, Gatsonis C, Deeks JJ, et al. Chapter 10. Analysing and presenting results, version 10. In: Deeks JJ, Bossuyt PM, Gatsonis C, editors. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. The Cochrane Collaboration; 2010. http://srdta​.cochrane.org/
48.
Chappell FM, Raab GM, Wardlaw JM. When are summary ROC curves appropriate for diagnostic meta-analyses? Stat Med. 2009 Sep 20;28(21):2653–68. [PubMed: 19591118]
49.
Deeks JJ, Macaskill P, Irwig L. The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J Clin Epidemiol. 2005 Sep;58(9):882–93. [PubMed: 16085191]
50.
Song F, Khan KS, Dinnes J, et al. Asymmetric funnel plots and publication bias in meta-analyses of diagnostic accuracy. Int J Epidemiol. 2002 Feb;31(1):88–95. [PubMed: 11914301]
51.
Alken P, Siegsmund M, Gromoll-Berbmann K, et al. A randomised controlled multicenter trial to compare the effects of transurethral detection and resection of bladder carcinomas under 5-ALA induced fluorescence light to conventional white light [abstract 593] Eur Urol Suppl. 2007;6:171.

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (6.0M)

Related information

  • PMC
    PubMed Central citations
  • PubMed
    Links to PubMed

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...