FormalPara Key Points

Pembrolizumab was estimated to reduce costs and extend quality-adjusted life-years (QALYs) compared with active comparators for the adjuvant treatment of completely resected stage III melanoma, dominating ipilimumab in the overall population and dabrafenib + trametinib in the BRAF mutation-positive subgroup.

Pembrolizumab increased costs relative to the strategy of routine observation in the overall population, with sufficiently higher QALYs to be considered cost-effective from a US health system perspective.

1 Introduction

Melanoma is a type of skin cancer that develops from specialized pigmented cells known as melanocytes. In the USA, an estimated 91,270 new cases of melanoma were diagnosed in 2018, with 9320 associated deaths [1]. In the USA, 84% of patients with melanoma are initially diagnosed at stage I–II (localized), 9% at stage III (loco-regional), and 4% at stage IV (distant metastases), with 5-year survival ranging from 98% for stage I to 23% for stage IV melanoma [2]. Although most patients are diagnosed with localized disease and are cured, 20–30% of patients with early-stage melanoma later develop recurrence with higher rates observed in patients with regionally advanced disease [3].

The standard of care for resectable stage III disease is surgical excision with safety margins based on the presence and depth of invasion plus lymphadenectomy if regional lymph nodes are involved, followed by observation alone or with adjuvant therapy [4, 5]. Systemic adjuvant therapy is recommended for patients with a high post-operative recurrence risk based on factors including tumor site, tumor thickness, ulceration, tumor mitotic count, or lymph node involvement [5,6,7]. Adjuvant therapy reduces the risk of recurrence and mortality by targeting residual micrometastatic disease [8].

The first adjuvant treatments to receive US Food and Drug Administration (FDA) approval were high-dose interferon-α2b and peginterferon-α2b, which showed modest efficacy and considerable toxicity [9,10,11,12]. Interferon-based regimens have since been displaced by the introduction of immune checkpoint inhibitors (anti-lymphocyte antigen-4 [CTLA4] and anti-programmed death-1 [PD-1] monoclonal antibodies) and targeted drugs (BRAF and MEK inhibitors, which are active exclusively against BRAF-mutated melanoma) [13]. Adjuvant high-dose ipilimumab, a CTLA4 inhibitor, was FDA-approved in 2015 based on the European Organization for Research and Treatment of Cancer (EORTC) 18071 trial. At 5 years, recurrence-free survival (RFS) was 40.8% with ipilimumab versus 30.3% with placebo, with overall survival (OS) of 65.4% versus 54.4% [14, 15]. The PD-1 checkpoint inhibitor nivolumab was approved in 2017 for resected stage III/IV melanoma based on improvement in RFS versus ipilimumab in the CheckMate-238 trial (hazard ratio [HR] = 0.65, p < 0.0001) [16, 17]. For resected stage III BRAF mutation-positive (BRAF+) melanoma, dabrafenib + trametinib combination therapy was approved in 2018 based on improvements in RFS versus placebo in the COMBI-AD trial (HR = 0.47, p < 0.0001) [18, 19].

Pembrolizumab is the latest adjuvant therapy to be FDA-approved (15 February 2019) for the adjuvant treatment of completely resected melanoma with lymph node involvement [20]. Pembrolizumab, an immune checkpoint inhibitor, is a highly selective monoclonal antibody that binds and blocks the PD-1 receptor of lymphocytes thereby re-establishing anti-tumor immunity by reactivating the tumor-specific cytotoxic T-lymphocytes that destroy tumor cells. FDA approval was based on results of EORTC-1325/KEYNOTE-054, a randomized, double-blind, placebo-controlled trial of 1,019 patients with completely resected, stage IIIA (> 1 mm lymph node metastasis), IIIB, or IIIC melanoma. Over 15 months of median follow-up, patients randomized to pembrolizumab experienced fewer recurrences/deaths (26.3% vs. 42.8% with placebo; HR = 0.57, p < 0.001) [21]. Secondary endpoints of distant metastases-free survival (DMFS) and OS will be evaluated in the second interim and final analyses of KEYNOTE-054.

Despite recent progress in the adjuvant treatment of high-risk surgically resected melanoma, there are currently no published economic evaluations comparing different novel therapies in this indication. This study aimed to evaluate cost-effectiveness of adjuvant pembrolizumab following complete resection of stage III melanoma versus other adjuvant strategies. Comparators included observation (based on direct comparative evidence from KEYNOTE-054 [21]) and other adjuvant therapies (based on indirect comparative evidence from a systematic literature review and network meta-analysis [NMA] [22]).

2 Methods

2.1 Model Overview

A decision-analytic model was implemented in Excel 2016 (Microsoft Corp., Redmond, WA, USA) to examine the cost-effectiveness of adjuvant treatments for resected stage III melanoma. The analysis adopted a lifetime horizon and 1-week cycle length with half-cycle correction. Outcomes included life-years (LYs), quality-adjusted LYs (QALYs), costs, and incremental cost-effectiveness ratios (ICERs). Annual discounting by 3.0% was used for both costs and effects, and direct healthcare costs were included per a US health system perspective, as stipulated by the reference case of the Second Panel on Cost-Effectiveness in Health and Medicine [23]. Where applicable, cost inputs were inflation-adjusted to 2019 US dollars (USD) using the medical care component of the Consumer Price Index [24].

2.2 Target Populations

The overall target population included adults with complete resection of high-risk stage IIIA (> 1 mm), IIIB, or IIIC melanoma, consistent with enrollment criteria in KEYNOTE-054 and EORTC-18071. The BRAF+ subgroup was modelled separately to include comparisons with dabrafenib + trametinib.

2.3 Intervention and Comparators

Pembrolizumab was modelled based on the FDA-approved, trial-based dosing schedule of 200 mg administered intravenously (IV) every 3 weeks for up to 1 year or 18 doses.

Other FDA-approved adjuvant treatments recommended by National Comprehensive Cancer Network (NCCN) guidelines for resected stage III melanoma include nivolumab, dabrafenib + trametinib (BRAF+ subgroup), ipilimumab, and observation [5]. As of 2019, the guideline-recommended usage of ipilimumab is limited to cases of nodal recurrence after exposure to anti-PD-1 therapy; ipilimumab was nevertheless considered a relevant comparator based on its licensing and continued usage in clinical practice. Comparators in the overall target population thus included ipilimumab (10 mg/kg IV every 3 weeks for four doses followed by every 12 weeks, up to 3 years) and the observation strategy. Dabrafenib + trametinib (dabrafenib 150 mg orally twice daily and trametinib 2 mg orally once daily, up to 1 year) was compared to pembrolizumab and observation in the BRAF+ subgroup.

Nivolumab was not evaluated as a comparator after the feasibility assessment concluded that a comparison via NMA was not feasible due to confounding differences between CheckMate 238 and other trials in the network, including differences in eligibility criteria and control arm protocol [22]. Specifically, ipilimumab was administered for up to 1 year in CheckMate 238 [16], compared with up to 3 years in EORTC 18071 [14, 15]. Additionally, CheckMate 238 enrolled patients with resected stage IIIB, IIIC, or IV melanoma [16], while EORTC 18071 [14, 15] and KEYNOTE-054 [21] enrolled patients with resected stage IIIA–IIIC melanoma and excluded those with stage IV disease. Because cross-trial differences with respect to ipilimumab treatment duration and stage of melanoma could not be adjusted and would result in biased comparisons of RFS between nivolumab versus other adjuvant therapies, nivolumab was excluded from the prior NMA study [22] and cost-effectiveness model.

2.4 Model Structure

A Markov cohort structure was developed in accordance with methods guidance for state transition models [25]. The structure was previously applied in a submission to the National Institute for Health and Care Excellence (NICE) and in a published US-based economic evaluation focusing on the comparison of pembrolizumab versus observation [26, 27]. In this new application, the model framework was expanded to evaluate a broader set of adjuvant comparators.

Health states included recurrence-free (RF), locoregional recurrence (LR), distant metastases (DM), and death (Fig. 1). Patients started in the RF state following surgical resection, with age and percentage female consistent with the KEYNOTE-054 population at baseline (overall: 54 years, 38.4%; BRAF+: 52 years, 40.2%). Adjuvant therapy affected patients’ risks of transitioning directly from RF to LR, DM, or death, but was assumed to provide no continuing therapeutic effect after LR or DM. Risks of transitioning from LR to DM or death were thus assumed equivalent across adjuvant strategies. Following DM, patients were assumed to receive first- and second-line therapies for advanced melanoma. Risks of transitioning from DM to death depended on the efficacy of the first-line therapies received. The distribution of individuals across health states was used in conjunction with state-specific utilities and costs to estimate LYs, QALYs, and costs.

Fig. 1
figure 1

Markov model schematic

2.5 Model Parameters

2.5.1 Efficacy

A parametric multi-state modeling approach was used to estimate transition probabilities (TPs) between states [28,29,30,31]. Base-case parameter estimates are summarized in Table 1 (overall population) and the online supplement (BRAF+ subgroup). Background mortality rates served as a lower bound for all TPs to death [32].

Table 1 Parametric models of health state transitions in the model

Transitions starting from RF TPs from RF to other states were estimated using KEYNOTE-054 interim patient-level data [21] and an NMA of adjuvant drug trials [22]. Parametric distributions were fitted in to the cause-specific hazards of each transition (RF → LR, RF → DM, and RF → death) in the KEYNOTE-054 pembrolizumab and placebo arms, using R software (R Development Core Team, Vienna, Austria [33]). To account for competing risks, competing types of RFS failure were treated as censoring events [28,29,30,31]. For example, to model the RF → DM transition, patients with LR or death before DM were censored at the earlier competing event.

Six candidate distributions were considered to model transitions from RF → LR and RF → DM, including exponential, Weibull, Gompertz, log-logistic, log-normal, and generalized gamma. Exponential distributions were used for RF → death due to the small number of events [21]. All TPs from RF depended upon all three cause-specific hazard functions. Base-case distributions were therefore selected from the 36 possible combinations of distributions for RF → LR and RF → DM. In line with the approach recommended by the NICE Decision Support Unit (DSU) for partitioned survival models [34], the same base-case distribution types were selected for both treatment arms, with consideration of: goodness-of-fit between predicted versus observed RFS, and plausibility of long-term extrapolations based on external data [15]. As noted by the NICE DSU in separate guidance [30], assessing goodness-of-fit is more challenging in the context of multi-state models than partitioned survival models, as survival endpoints are determined by a combination of survival functions rather than by a single survival function [30]. Because Akaike Information Criterion and Bayesian Information Criterion do not provide suitable measures of fit in a multi-state model context, goodness-of-fit was instead assessed using mean squared error and the visual inspection approach described by Williams et al. [28]. Additional details are provided in the Online Supplemental Material.

Base-case distributions for RF → LR and RF → DM in the overall population were Gompertz and generalized gamma, respectively (Table 1). This combination was ranked first for placebo and fifth for pembrolizumab according to goodness-of-fit. Base-case distributions in the BRAF+ subgroup were Gompertz (RF → LR) and log-normal (RF → DM), ranked first for placebo and sixth for pembrolizumab. The Online Supplemental Material presents the overlay of predicted versus observed RFS for both treatment arms. Alternative distributions were tested in scenario analyses, summarized in Table 1 and detailed in the Online Supplemental Material.

For ipilimumab and dabrafenib + trametinib, cause-specific hazards of transitioning from the RF state were derived using time-varying HRs of RFS for comparators versus pembrolizumab, estimated via second-order fractional polynomial NMA [22]. Within each weekly cycle, the HR was applied to the overall hazard of RFS failure under pembrolizumab. The proportion of the overall hazard attributable to each RFS failure type was assumed to be the same as in the pembrolizumab arm. To avoid clinically implausible long-term extrapolations, time-varying HRs for ipilimumab versus pembrolizumab were held constant after 70 weeks, corresponding to mean follow-up in KEYNOTE-054 at the first interim cutoff date. Due to the steeper upward trajectory of time-varying HRs for dabrafenib + trametinib versus pembrolizumab in the BRAF+ subgroup NMA, HRs for this comparator were conservatively held constant after 52 weeks. Time-constant HRs based on fixed-effects NMA were tested in scenario analyses.


Transitions starting from LR In the absence of mature post-recurrence data from KEYNOTE-054, an exponential model of LR → DM was fitted using the Flatiron Health electronic medical record database (1/1/2011–2/28/2018) [35]. A real-world US cohort of 147 patients was followed from the date of LR after resection of stage III melanoma until DM or censoring. Because no direct transitions from LR → death were observed in the sample, the cause-specific hazard of LR → death was assumed equal to that of RF → death estimated for pembrolizumab in KEYNOTE-054.


Transitions starting from DM For each adjuvant treatment, TPs from DM → death depended upon market shares of first-line treatments for advanced melanoma. The model also considered second-line treatment costs, but assumed that survival following DM would be driven by the choice of first-line therapy.

To represent real-world practice, market shares in the observation arm were obtained from unpublished US-specific market research (Table 2). The same market shares were assigned to patients in the ipilimumab arm, who were expected to be eligible for subsequent PD-1 inhibitors. In the pembrolizumab arm, patients’ eligibility for subsequent PD-1 inhibitors was expected to depend on whether DM occurred early or late. Specifically, if DM occurred after ≥ 24 months, 50.0% were assumed to receive rechallenge with pembrolizumab or nivolumab, while 50.0% receive other therapies; otherwise, patients were assumed to be ineligible for pembrolizumab retreatment, and were distributed to other subsequent treatments (Table 2).

Table 2 Utility and cost inputs

Exponential OS and progression-free survival (PFS) distributions were estimated for each first-line subsequent treatment based on trials in the advanced melanoma setting. For pembrolizumab, these distributions were fitted using patient-level data from the pembrolizumab 10 mg/kg every 3 weeks arm (first-line subgroup) of the KEYNOTE-006 trial among ipilimumab-naïve patients with unresectable/advanced melanoma [36]. For other first-line treatments, HRs for OS and PFS versus pembrolizumab were estimated through an NMA of first-line drug trials [36,37,38,39,40,41,42,43]. Expected OS following DM was calculated in each adjuvant treatment arm as a market share-weighted average of expected OS under different first-line treatments. Expected OS was then converted into a weekly hazard of DM → death. Expected PFS was estimated similarly for each adjuvant treatment arm.

2.5.2 Safety

Treatment-specific adverse event (AE) risks were considered for grade 3–5 drug-related AEs that affected ≥ 5.0% at any grade for pembrolizumab or placebo in KEYNOTE-054 [21], ipilimumab in EORTC-18071 [15], or dabrafenib + trametinib in COMBI-AD [18]. Diarrhea (grade 2+) was also considered based on the high expected cost of this AE. Mean durations for the AEs were collected from KEYNOTE-054.

2.5.3 Quality of life

EuroQol-five dimension-three level (EQ-5D-3L) responses were collected during the KEYNOTE-054 trial and scored using the US value set [44]. Utilities for RF and LR were estimated through linear mixed-effects regression with patient-level random effects to link health states (defined according to RECIST V1.1 criteria) with EQ-5D-3L.

Utility in the DM state represented a weighted average of utilities associated with pre- and post-progression DM. The relative weight of these sub-state utilities depended on the estimated ratio of mean PFS:OS within the DM state, given the market shares of different first-line treatments. Utility for pre-progression DM was estimated based on EQ-5D-3L data from KEYNOTE-054. Utility beyond progression could not be estimated reliably using KEYNOTE-054 data due to limited follow-up, and was therefore obtained from a cross-sectional study that elicited standard gamble utilities for advanced melanoma states [45]. Scenario analyses were conducted in which all utilities were informed by KEYNOTE-054, or by a different literature source [46].

AE-related disutility was estimated as the difference in utility associated with RF (without toxicity) versus RF (during any grade 3 + AE) in KEYNOTE-054. For each adjuvant treatment, a one-time QALY decrement was applied in the first model cycle based on the risks, mean durations, and disutility of AEs.

2.5.4 Costs

Adjuvant drug costs were calculated based on their Wholesale Acquisition Cost (WAC) and label-recommended dosing [15, 18, 21]. In a scenario analysis, drug costs were instead calculated using Veterans Affairs national contract prices [47]. The relative dose intensity of pembrolizumab in KEYNOTE-054 (99.7%) was also applied to drug acquisition costs in all adjuvant treatment arms to account for potential dose interruptions/reductions. IV administration costs were calculated based on minutes required per infusion (pembrolizumab: 30; ipilimumab: 90) and costs of IV administration services from the Centers for Medicare & Medicaid Services (CMS) Hospital Outpatient Prospective Payment System [48,49,50] (Table 2). Orally administered drugs were assumed to require no administration costs.

For pembrolizumab, treatment duration was directly based on observed Kaplan–Meier data for time to discontinuation in KEYNOTE-054 (mean doses: 14), without need for extrapolation given the 1-year/18-dose maximum. For other adjuvant treatments, treatment duration was modeled using trial-based exponential distributions [14, 18], subject to FDA-recommended maximums (ipilimumab: 156 weeks; dabrafenib + trametinib: 52 weeks).

Costs of first- and second-line subsequent therapies were modeled as a lump-sum cost following DM (Table 2). The mean total cost of each regimen was calculated based on WAC, label-recommended dosing [48, 49, 51,52,53,54,55], drug administration costs [50], and estimated mean treatment duration. First-line treatment durations were modelled using the exponential PFS distributions, up to the label-recommended maximum where applicable. Second-line treatment durations were assumed to be 21 weeks (or the maximum duration if < 21 weeks), in line with prior health technology appraisals in advanced melanoma [56, 57].

Unit costs of AEs were obtained from the CMS Acute Inpatient Prospective Payment System [58] and published sources [59,60,61]. These costs were multiplied by treatment-specific AE risks and applied as a one-time cost in the first cycle.

Weekly medical cost in the RF state included outpatient services (e.g., physician office visits, radiologic assessments) reported in a retrospective study [62] (Table 2); this cost was assumed to decrease over time, based on guideline recommendations [5]. Following LR, a one-time cost of salvage surgery was applied based on the cost of surgery obtained from literature [62] and observed probabilities of lymphadenectomy, skin lesion resection, in-transit metastases resection, or other surgery after LR in KEYNOTE-054 [63]. Following DM, a one-time cost was applied based on medical resources associated with first-line treatment initiation [64]. Subsequent weekly medical costs were based on a retrospective chart review of US patients with unresectable stage III/IV melanoma [65].

One-time costs of palliative/terminal care were applied for patients transitioning to death after DM [66].

2.6 Sensitivity Analyses

One-way deterministic sensitivity analyses (DSAs) and scenario analyses were conducted to examine the influence of specific inputs and assumptions. The Online Supplemental Material lists all DSAs performed in each target population.

Probabilistic sensitivity analysis (PSA) with 10,000 iterations was also conducted to estimate the probability of each treatment being cost-effective under different willingness-to-pay thresholds. In each iteration, inputs were randomly drawn from their specified distributions (Tables 1, 2). Standard errors or variance–covariance matrices of the selected distributions were based on original data sources, where available.

2.7 Model Validation

Model predictions for RFS, DMFS and OS are presented in Fig. 2. The model was internally and externally validated [67]. Clinical opinion was sought to validate efficacy inputs and other key model assumptions. Internal validity was assessed by comparing modeled efficacy against original sources that informed the efficacy inputs. RFS curves predicted for pembrolizumab and observation were plotted alongside observed Kaplan–Meier curves to assess their alignment. Model predictions for ipilimumab and dabrafenib + trametinib were similarly compared to reported statistics in the EORTC-18071 and COMBI-AD trials, respectively. To assess external validity, external data from the placebo arm of EORTC-18071 (median follow-up: 5.3 years) was used to validate 5-year OS projections for observation.

Fig. 2
figure 2

Modelled long-term a RFS, b DMFS, and c OS in the overall population. DMFS distant metastases-free survival, OS overall survival, RFS recurrence-free survival

3 Results

3.1 Base-Case Results

In the overall population, total costs were $511,290 for pembrolizumab, $461,422 for observation, and $992,721 for ipilimumab (Table 3). Cost differences were largely attributable to adjuvant treatment costs, which were zero for observation and over fourfold higher for ipilimumab than pembrolizumab. Total QALYs were 9.24 for pembrolizumab, 5.95 for observation, and 7.09 for ipilimumab, with total LYs of 10.54, 7.15, and 8.35, respectively. The proportion of LYs spent recurrence-free was 80.3% with pembrolizumab versus 57.3% with observation and 66.6% with ipilimumab. The resulting ICERs of pembrolizumab versus observation were $15,155/QALY and $14,691/LY. Pembrolizumab dominated (i.e., was less costly and more effective than) ipilimumab.

Table 3 Base-case cost-effectiveness results by target population

In the BRAF+ subgroup, pembrolizumab dominated dabrafenib + trametinib, with incremental total costs of − $62,776 and incremental QALYs and LYs of 0.93 and 0.91, respectively. In this subgroup, pembrolizumab also dominated observation, with incremental total costs of − $11,250 and incremental QALYs and LYs similar to those estimated in the overall population.

3.2 Deterministic Sensitivity Analysis (DSA) and Scenario Analysis Results

Tornado diagrams in Fig. 3 present the ten most influential sensitivity analyses in comparisons between pembrolizumab versus each comparator. Results are displayed in terms of ICERs or (for comparisons in which pembrolizumab was dominant in the base case) incremental net monetary benefits (INMB) using a $100,000/QALY willingness-to-pay threshold.

Fig. 3
figure 3

Tornado diagrams based on deterministic sensitivity analysis (DSA) of pembrolizumab versus: a ipilimumab and b observation in the overall population; and c dabrafenib + trametinib and d observation in the BRAF+ subgroup. DM distant metastases, HR hazard ratio, ICER incremental cost-effectiveness ratio, LR locoregional recurrence, NMB net monetary benefit, QALY quality-adjusted life-year, RF recurrence-free, RFS recurrence-free survival, USD United States dollar, VA Veterans Affairs

Relative to observation, pembrolizumab ranged from dominant to having an ICER of $60,185/QALY in the overall population or $15,442/QALY in the BRAF+ subgroup. Pembrolizumab dominated ipilimumab in all sensitivity analyses and dominated dabrafenib + trametinib in all but two scenarios. The results were most sensitive to parameters determining RFS. Other influential parameters included time horizon, discount rate for effectiveness, RF-related utility, subsequent treatment assumptions, and use of Veterans Affairs national contract prices for drugs.

3.3 Probabilistic Sensitivity Analysis (PSA) Results

PSA results are illustrated through cost-effectiveness acceptability curves in Fig. 4. At a willingness-to-pay of $100,000/QALY, the probability of cost-effectiveness in the overall population was 88.9% for pembrolizumab, 0.0% for ipilimumab, and 11.1% for observation. In the BRAF+ subgroup, the probability of cost-effectiveness was 78.6% for pembrolizumab, 17.7% for dabrafenib + trametinib, and 3.6% for observation.

Fig. 4
figure 4

Combined cost-effectiveness acceptability curves: a overall population; and b BRAF+ subgroup

4 Discussion

This study evaluated the cost-effectiveness of pembrolizumab, the latest FDA-approved adjuvant treatment option for resected stage III melanoma, compared with ipilimumab, dabrafenib + trametinib, and observation alone.

Results suggest that pembrolizumab is cost-effective relative to each comparator based on the common willingness-to-pay threshold of $100,000/QALY [68, 69]. In the overall population, the base-case ICER of pembrolizumab versus observation was $15,155/QALY. Pembrolizumab was dominant over ipilimumab in the base-case analysis, all PSA iterations, and across all input values and scenarios tested in DSAs. This finding was driven by ipilimumab leading to shorter RFS and higher adjuvant drug costs given the longer label-recommended maximum duration. Costs of subsequent treatments, disease management, and terminal care were also lower for pembrolizumab, reflecting the lower incidence of recurrence. In the context of a rapidly evolving treatment landscape in this indication, NCCN guidelines recommend adjuvant treatment with ipilimumab only in limited instances of nodal recurrence as of 2019 [5].

In the BRAF+ subgroup, pembrolizumab dominated dabrafenib + trametinib, and was estimated to be slightly cost-saving and therefore dominant over observation. Subsequent treatment costs were higher in the observation arm, the combined result of greater recurrence risk and larger expected market share of dabrafenib + trametinib (the most costly subsequent treatment).

Sensitivity analysis results were largely consistent with the base case, showing the most variation when using alternative distributions to model transitions starting from RF, which drive long-term survival extrapolations. In DSAs, the highest ICERs versus observation (overall: $60,185/QALY; BRAF+: $15,442/QALY) resulted from using parametric models with time-constant treatment effects; this alternative approach produced comparatively worse fit with observed RFS than the base-case parametric models individually fitted to each KEYNOTE-054 arm. Relative to dabrafenib + trametinib, INMB was negative only in two scenarios that used alternative distributions to model RF → LR and RF → DM transitions, which overestimated 3-year RFS in the dabrafenib + trametinib arm by 12–17 percentage-points relative to that observed in COMBI-AD (58%) [18].

Patient-level data from KEYNOTE-054 were used to parametrically model transitions determining RFS for pembrolizumab and observation. Post-recurrence transitions were modeled based on real-world data and clinical trials of advanced melanoma treatments given the absence of long-term survival data from KEYNOTE-054. Extrapolation of OS using RFS is an approach supported by recent evidence on the natural history of high-risk melanoma after resection [70]. Additionally, in the EORTC-18071 trial, relative reductions in recurrence/death with ipilimumab versus placebo (HR = 0.76) closely matched relative reductions in death (HR = 0.72). Five-year OS in the placebo arm of EORTC-18071 (54.4%) was comparable to modeled OS in the observation arm (59.0%) [15].

With the recent FDA approval of several adjuvant treatments for resected stage III melanoma, comprehensive economic evaluations of these treatments are needed to inform reimbursement decision-making by payers. The cost-effectiveness of pembrolizumab was previously evaluated relative to routine observation in a US-based study [26]. Prior submissions to NICE have assessed the cost-effectiveness of pembrolizumab [27], nivolumab [71], and dabrafenib + trametinib [72] from a UK national payer perspective, each focusing on the comparison with observation. The present study is the first published cost-effectiveness analysis to comparatively evaluate different active adjuvant treatments for melanoma.

Notable strengths of this study include modelling the RFS efficacy of pembrolizumab and observation based on head-to-head trial data. Consistent with best practice, selection of parametric functions was based on goodness-of-fit with observed data and clinical plausibility of long-term projections [30]. Duration of pembrolizumab treatment was precisely estimated using observed data, without extrapolation. Utility values for most health states were directly measurable in KEYNOTE-054 based on EQ-5D-3L, a generic preference-based measure recommended by the Second Panel on Cost-Effectiveness in Health and Medicine [23].

This study is nevertheless subject to limitations. First, there are no direct head-to-head trials comparing pembrolizumab to comparators other than observation. A systematic literature review and NMA were conducted to indirectly compare RFS across adjuvant therapies [22]. Time-varying HRs from a fractional polynomial NMA were used to account for non-proportionality of hazards; scenario analyses using time-constant HRs yielded similar results.

Second, due to between-trial heterogeneity in treatment protocols and inclusion criteria, nivolumab could not be evaluated in the NMA [22] or cost-effectiveness model. Patients randomized to ipilimumab received treatment for ≤ 1 year in CheckMate-238, compared with ≤ 3 years in EORTC-18071. CheckMate-238 also enrolled patients with resected stage IIIB-C/IV disease, while EORTC-18071 and KEYNOTE-054 enrolled those with resected stage IIIA-C disease. Consequently, ipilimumab outcomes are not comparable across trials, preventing balanced indirect comparisons of nivolumab versus other adjuvant comparators; a similar limitation was noted by the Evidence Review Group that evaluated the NICE submission for nivolumab [73].

Third, DMFS and OS data were not part of the pre-specified interim analyses of KEYNOTE-054, and could not be used to inform post-recurrence TPs. Therefore, the model assumed no sustained therapeutic benefit of adjuvant treatments after recurrence, potentially resulting in conservative estimates of incremental QALYs versus observation. This assumption should be re-examined as mature DMFS and OS data become available from KEYNOTE-054.

Finally, this cost-effectiveness analysis focused on direct healthcare costs from a payer perspective; indirect costs related to work productivity and caregiving were not considered. Including indirect costs would be expected to reduce ICERs by offsetting the cost of pembrolizumab treatment.

5 Conclusion

Results of this study showed that pembrolizumab reduced costs and extended survival relative to active comparators for the adjuvant treatment of completely resected stage III melanoma, dominating ipilimumab in the overall population and dabrafenib + trametinib in the BRAF+ subgroup. Pembrolizumab was estimated to increase costs relative to routine observation, with sufficient incremental benefit to be considered cost-effective from a US health system perspective. These conclusions were robust across a range of input values and assumptions. Future research is nevertheless warranted to validate these results against long-term survival in KEYNOTE-054 and comparator trials.