Introduction

Hepatocellular carcinoma (HCC) is the fifth most common cancer and the third most lethal malignancy worldwide [1, 2]. Although surveillance for HCC with alpha-fetoprotein (AFP) and ultrasound in patients at risk for HCC is recommended [3,4,5], more than 35% of patients are diagnosed with advanced-stage HCC [6]. The Barcelona Clinic Liver Cancer (BCLC) stage C includes patients who have performance status (PS) 1 or 2, extrahepatic spread (EHS), macrovascular invasion (MVI), or Child–Turcott–Pugh class (CP) A or B according to the American and European HCC practice guidelines [3,4,5]. As such, BCLC stage C patients are often quite heterogeneous, but different prognostic factors are generally considered to be of similar degree of impact in the current staging system, and the only recommended treatment for this population is sorafenib [3,4,5]. However, while sorafenib has been shown to prolong overall survival (OS) in patients with MVI and/or EHS or poor PS [7, 8], other factors such as age and other treatment modalities have also emerged as significant factors associated with survival in this population [9,10,11,12]. As a result, new scoring systems or substaging for BCLC-C HCC have been proposed [13,14,15,16,17], but many involve complex classification limiting their application in routine practice. Therefore, this study aims to develop a practical and easy-to-use substaging system for BCLC-C HCC in a training cohort in Taiwan and validate the proposed substaging system in two external validation cohorts, one from Taiwan and one from a diverse international cohort comprised of patients from the U.S. and other East Asian countries outside of Taiwan.

Materials and methods

Study design and study patients

This is a retrospective cohort study inclusive of 1856 BCLC-C HCC patients from six study centers and four countries (Taiwan, 2; USA, 2; Japan, 1, and South Korea, 1). Study patients were divided into three cohorts: The Taiwan training cohort from E-Da Hospital (n = 622, 2011–2017, Figure S1A), the Taiwan validation cohort from Kaohsiung Medical University Hospital (n = 774, 2011–2017, Figure S1B), and the international validation cohort from Stanford University Medical Center (USA), Mayo Clinic (USA), Hanyang University Seoul Hospital (South Korea), and Ogaki Municipal Hospital (Japan) (n = 460, Figure S1C). This study was approved by the Institutional Review Board at each participating institution.

Study definitions and statistical analysis

See the supplementary information.

Results

Patient baseline characteristics

Clinical features at HCC diagnosis for all study cohorts were shown in Table 1. In the total cohort (n = 1856), the median age was 65 years, the majority (81.5%) were male, about one-third had hepatitis B virus (HBV, 35.1%), one-third had hepatitis C virus (HCV, 36.6%), 7.1% had both HBV and HCV infections, the majority (64.5%) had CP class A, about half (51.0%) had MVI, and about one-third (36.6%) had EHS. The mortality rate of the total cohort was 90.8 per 100 person-years.

Table 1 Baseline demographic features of patients in the total cohort, training cohort, Taiwan validation cohort, and international validation cohort

The Taiwan training cohort, the Taiwan validation cohort, and the international validation cohort differed significantly in all included clinical variables except for sex and median follow-up time, which ranged 6.5–8.3 months.

Training cohort and development of the new Taiwan BCLC-C substaging system (TBCSS)

Factors associated with overall survival

To select components for the subclassification of BCLC stage C patients, we examined OS in the training cohort and found significant OS differences by several patient and tumor characteristics (Table 2). The median OS was significantly longer in patients with a largest tumor size < 10 cm than in those with a tumor size ≥ 10 cm (11.5 vs 4.2 months, p < 0.0001, Figure S2A), in patients without EHS than in those with EHS (8.8 vs 4.5 months, p < 0.0001, Figure S2B), in patients without MVI than in those with MVI (8.5 vs 5.7 months, p < 0.0001, Figure S2C), and in patients with CP class A than in those with CP class B (10.3 vs 3.9 months, p < 0.0001, Figure S2D). In addition, the median OS was significantly longer in patients aged < 75 years old compared to those aged ≥ 75 years old (7.6 vs 4.5 months, p = 0.048), in patients with PS1/2 versus those with PS0 (10.1 vs 5.3 months, p < 0.0001), in patients with single tumor versus those with multiple tumors (13.2 vs 5.6 months, p < 0.0001), and in patients with AFP < 200 versus those with AFP ≥ 200 (8.3 vs 5.3 months, p < 0.0001) (Table 2).

Table 2 Prognostic factors associated with overall survival in training cohort

Next, using multivariable Cox’s regression analysis (Table 2), we identified factors independently associated with OS. We found that aged ≥ 75 years old (HR 0.66; 95% CI 0.53–0.83; p < 0.0001), tumor size ≥ 10 cm (HR 0.71; 95% CI 0.59–0.85; p < 0.0001), the presence of EHS (HR 0.65; 95% CI 0.54–0.79; p < 0.0001), the presence of MVI (HR 0.74; 95% CI 0.61–0.88; p < 0.0001), CP class B (HR 0.48; 95% CI 0.40–0.58; p < 0.0001) and multiple liver tumors (HR 0.83; 95% CI 0.68–0.95; p = 0.039) were all significantly associated with worse OS, but not AFP levels.

Development of a new prognostic substaging system

Based on the significant risk factors for OS, we constructed the novel Taiwan BCLC-C substaging system (TBCSS) shown in Table 3. We classified patients with no tumor/liver risk factors and with aged < 75 years as substage C0, those with no tumor/liver risk factors but ≥ 75 years old as substage C1, and regardless of age those with any one of four tumor/liver risk factors as substage C2, those with any two of four tumor/liver risk factors as substage C3, and those with three or four tumor/liver risk factors as substage C4.

Table 3 Proposed substaging for Barcelona Clinic Liver Cancer Stage C

As shown in Fig. 1a, the survival curves of the substages demonstrated a significant distinction among five substages in this training cohort (p = 0.011 to < 0.0001 for comparisons between each substage from C1 to C4 against C0). The median OS ranged from 43.8 months (95% CI 32.2–53.7) for substage C0, to 20.6 months (95% CI 14.1–25.9) for C1, 11.5 months (95% CI 8.02–14.1) for C2, 5.7 months (95% CI 4.02–5.98) for C3, and 3.2 months (95% CI 2.41–3.59) for C4.

Fig. 1
figure 1

Overall survival of patients according to substage by the new Taiwan BCLC-C substaging system in: a Taiwan validation cohort, b international validation cohort, c combined Taiwan and international validation cohort, and d total cohort (training cohort and combined validation cohort)

Validation of the new TBCSS

Taiwan validation cohort

The TBCSS also categorized patients in the Taiwan validation cohort with significant distinction in OS (p = 0.013 to < 0.0001 for comparison between the substages; Fig. 1b). The median OSs for the Taiwan validation cohort patients in substages C0, C1, C2, C3, and C4 were 36.8 months (95% CI 16.8–55.2), 21.1 months (95% CI 13.5–28.5), 8.6 months (95% CI 6.41–9.59), 2.9 months (95% CI 1.55–2.45), and 2.0 months (95% CI 1.40–2.60), respectively.

International validation cohort

We found similar performance of the TBCSS in our international cohort with significant distinction in OS among patients of different substages (p = 0.039 to < 0.0001; Fig. 1c). The median OSs for the international validation cohort patients in substages C0, C1, C2, C3, and C4 were 42.1 months (95% CI 30.3–51.1), 22.9 months (95% CI 15.8–30.0), 11.7 months (95% CI 8.44–13.6), 8.6 months (95% CI 5.77–10.2), and 3.4 months (95% CI 2.94–4.00), respectively.

Total cohort

Analysis of OS by substage

Combining patients from our training cohort and both our validation cohorts, we analyzed OS for each substage of the new TBCSS (p < 0.0001 for all substage comparisons; Fig. 1d). The median OS times for patients in substages C0, C1, C2, C3, and C4 were respectively 38.9 months (95% CI 28.1–47.9), 21.5 months (95% CI 16.3–25.7), 9.8 months (95% CI 7.93–10.1), 5.1 months (95% CI 4.57–5.44), and 3.3 months (95% CI 2.78–3.22) for our entire study cohort.

Sensitivity analysis with CP class A patients

As a sensitivity analysis to further isolate the effect of tumor and liver characteristics in patients with the best liver function, we analyzed only CP class A patients (n = 1197) and found that the TBCSS continued to allow great distinction in regards to OS among patients of the different substages very well (p < 0.0001 for all substage comparisons, Figure S3a–d).

Subanalysis by substage within each treatment modality

Figure S4a–d showed the median OS for substages C0, C1, C2, C3, and C4 within each treatment group. Within each treatment modality, OS was significantly different in patients of different substage (p = 0.48 to < 0.0001) except for the distinction between C1 and C0 in the sorafenib group. For example, among patients undergoing surgical resection, the median OS for substages C0, C1, C2, C3, and C4 was 65.9 months (95% CI 41.3–88.7), 44.6 months (95% CI 26.7–61.3), 23.7 months (95% CI 16.6–29.4), 9.9 months (95% CI 1.37–16.6), and 5.6 months (95% CI 1.02–8.98), respectively.

Subanalysis by treatment modality within each substage

The OS for patients undergoing different types of treatment within each substage is presented in Table 4. In substage C0, 43.6% and 31.4% of the patients received curative therapies and TACE, respectively. The median OS in patients receiving curative therapies was 49.8 months (95% CI 28.8–67.1), which was substantially longer than those receiving TACE (36.7 months, 95% CI 21.0–50.9, p = 0.092), but significantly longer than those receiving other palliative therapies (28.5 months, p < 0.0001), sorafenib (19.8 months, p = 0.04) or BSC (17.6 months, p = 0.045). The OS did not significantly differ among patients receiving TACE versus other palliative therapies, sorafenib or BSC.

Table 4 Survival outcomes by treatment modality in each substage of total patients

In substage C1, 37.4% and 32.4% of the patients received curative therapies and TACE, respectively. The median OS was longest in patients receiving curative therapies (35.8 months, 95% CI 17.6–52.3), which did not differ significantly from those receiving TACE (25.6 months, 95% CI 13.2–36.7, p = 0.173), but significantly longer than those receiving other palliative therapies (15.1 months, 95% CI 11.2–18.7, p = 0.022), sorafenib (14.9 months, 95% CI 1.19–26.8, p = 0.046) and BSC (8.7 months 95% CI 4.31–11.7, p < 0.0001). The OS did not differ between patients receiving sorafenib and BSC.

In substage C2, most patients underwent TACE (32.2%), received sorafenib (20.6%), or had BSC (22.2%). Only 14.2% of patients underwent curative treatment but their median OS was longest among patients in substage C2 (23.8 months, 95% CI, 15.4–30.3), which was significantly longer than those undergoing TACE (12.5 months, 95% CI 9.53–14.4, p < 0.0001), other palliative therapies (7.6 months, 95% CI, 5.33–8.66, p < 0.0001), sorafenib (8.8 months, 95% CI 5.83–10.1, p < 0.0001) or BSC (4.4 months 95% CI 3.65–5.14, p < 0.0001). The OS did not differ between patients receiving other palliative therapies and sorafenib.

In substage C3, the most common intervention was BSC (34.7%), followed by sorafenib (28.1%), TACE (21.7%), and only few with curative treatment (6.9%) or other palliative therapy (6.9%). The median OS was longest in patients receiving curative therapies (9.8 months, 95% CI 1.36–16.6), which was significantly longer than in those receiving TACE (8.3 months, 95% CI 5.63–10.3, p = 0.007), other palliative therapies (3.5 months, 95% CI 2.09–3.91, p < 0.0001), sorafenib (5.4 months, 95% CI 3.54–6.45, p < 0.0001) and BSC (2.2 months 95% CI 1.58–2.41, p < 0.0001). The OS did not differ between patients receiving other palliative therapies and BSC.

In substage C4, half of the patients received BSC (47.9%) and one-quarter received sorafenib therapy (25.0%). The median OS was shortest in patients receiving BSC (2.0 months, 95% CI 1.74–2.25), which was significantly less than those receiving curative therapies (5.9 months, 95% CI 2.93–7.06, p < 0.0001), TACE (5.5 months, 95% CI 2.94–7.05, P < 0.0001), other palliative therapies (3.6 months, 95% CI 1.51–4.48, p = 0.014) and sorafenib (4.5 months, 95% CI 3.03–4.96, p < 0.0001). The OS did not differ among patients receiving curative therapies, TACE, other palliative therapies and sorafenib.

Discussion

In our large, multicenter, multinational study, we provided evidence to substage BCLC-C patients according to age, CP class and tumor characteristics (largest tumor size ≥ 10 cm, EHS, and MVI) to 5 substages C0–4 of the new TBCSS. This TBCSS derived from a Taiwan training cohort has shown great performance with clear distinction in OS rates of patients in the Taiwan validation cohort as well as the international validation cohort, despite significant differences in both patient, tumor and treatment characteristics among these three cohorts. We also showed that the performance of the TBCSS remained robust in sensitivity analysis of only CP A patients and in subgroups of patients by both substage and treatment modality. Importantly, our subgroup analysis showed that substage C0–C3 receiving curative treatment had better survival than those receiving sorafenib. Based on our findings, to maximize the survival benefit for patients with BCLC-C HCC, we proposed a novel treatment algorithm for BCLC-C patients according to the TBCSS as detailed in Fig. 2.

Fig. 2
figure 2

Proposed treatment strategy for Barcelona Clinic Liver Cancer Stage C (BCLC-C) hepatocellular carcinoma patients according to the Taiwan BCLC-C substaging system. MVI macrovascular invasion, EHS extrahepatic spread, CP class Child–Pugh class, Y/O years old, RFA radiofrequency ablation, TACE transcatheter arterial chemoembolization; systemic therapies means sorafenib

The factors used for the current TBCSS are all readily available and already included in the BCLC staging system (age, CP class, tumor size, MVI and EHS). Aging is related to a progressively decreased functional reserve of multiple organs and the reduced survival in HCC patients aged ≥ 75 years in our study have also been observed in prior studies [9,10,11]. In our total cohort, patients aged < 75 years and without any other risk factors (C0) had the best survival (median OS, 38.9 months), followed by patients aged ≥ 75 years and without one risk (C1) and those with at least one risk factor (C2–C4). Reserved liver function, as categorized by CP classification, is another important factor associated with survival among HCC patients [3,4,5]. BCLC stage C HCC patients are almost contraindicated to any HCC treatment modalities if the patients are CP class B or C [18]. In addition, we observed that tumor size is important prognostic factor associated with survival [19]. We showed that HCC patients with largest tumors ≥ 10 cm have a poorer prognosis than those with smaller tumors. It is likely due to the relationship between large tumors and the presence of factors that portent poorer prognosis, such as satellite tumors, microscopic lymphovascular involvement, and distant recurrence [20, 21].

Due to the high heterogenicity in nature and prognosis of BCLC-C population, several studies have attempted to develop substaging systems and for patients initially grouped as BCLC-C stage [13,14,15,16,17]. Prior efforts have constructed BCLC-C subclassification based on tumor size, distant metastasis, HCC type, and bile duct invasion but not other important prognostic factors, such as MVI and CP class, [16] or based on tumor burden, major portal vein invasion and distant metastasis, and CP class but without age [17]. The newly proposed TBCSS by our current study, consisting of five already collected variables for the main BCLC staging, provides a comprehensive assessment of the patient, the liver and the tumor with age, CP class, and detailed tumor factors (size, MVI, EHS). The OS rates among the substages of the TBCSS differed significantly as opposed to those seen with prior substaging systems [13, 17].

Sorafenib has generally been the only recommended therapy and the most frequently used treatment for BCLC-C patients before the introduction of the other systemic molecular targeted therapies and immunotherapy [3, 4, 22, 23], but only about 20% of our diverse cohort of BCLC patients received this treatment. While sorafenib has been shown to significantly prolong survival in unresectable HCC compared to supportive care in general, [24,25,26] we only observed significantly higher survival in sorafenib patients compared to BSC patients within BCLC-C substages C2–C4, but not among those with substage C0–1 though part of this can be due to the limited number of patients receiving sorafenib in substage C0–1 in our study (n = 11). Thus, our results indicated that sorafenib was an effective therapy for advanced HCC patients in countries in the Asia–Pacific region [26]. However, as noted above, the majority of our BCLC-C patients actually received other palliative and even curative treatments and that both curative treatment and other palliative treatment such as TACE were superior than sorafenib in regards to OS among TBCSS C0–C3 for curative treatment and C2–C3 for TACE.

Thus, our study provided support for a new treatment paradigm for the early substages of the BCLC-C group (Fig. 2). In the current study, 43.6% of BCLC-C substage C0 patients received curative therapy. The median OS could reach 49.8 (95% CI 28.8–67.1) months, which is only slightly lower than the expected survival of about 60 months among patients with very early-stage HCC (BCLC stage 0) receiving curative therapy in Taiwan as well as in West [4, 27]. It is also similar to the expected survival of about 48 months among patients with early stage HCC (BCLC stage A) receiving curative therapy. For the 31.4% of substage C0 patients in the current study who received TACE therapy, the median OS was 36.7 (95% CI 21.0–50.9) months, which was not inferior to 20–25 months among patients with intermediate stage HCC (BCLA stage B) who received TACE therapy [3,4,5, 27]. Similarly, curative therapies and TACE could provide better survival than sorafenib, other local palliative therapies and BSC among patients of BCLC substages C1–C3 in our cohort. Our results indicated that curative therapies and TACE could be recommended in a proportion of patients with previously defined advanced HCC BCLC stage C, if they meet criteria for the TBCSS substages C0-C3 and eligible for curative HCC therapies or TACE [28, 29].

However, for substage C4, neither curative therapies nor TACE provided better survival than sorafenib, indicating that sorafenib remains the best therapeutic option for the population pending future data with newly introduced systemic molecular targeted therapy, such as lenvatinib or regorafenib [30] and immunotherapy such as nivolumab or pembrolizumab [30].

The strength of our study is the large and diverse study cohort of both Eastern and Western patients of diverse liver disease etiologies and treatment modalities. Yet despite these differences in patient demographic, liver and tumor characteristics and other potential differences in regional practice patterns, our newly proposed substaging system for BCLC-C patients consistently showed divergent OS outcomes distinguishing each substage even when further stratified by treatment modality. Our study also has limitations inherent in its retrospective design such as missing data and heterogeneity in local referral and practice patterns. However, the primary outcome of our study is OS, which is an objective and universal event, and the factors included in our new TBCSS are also objective factors already included in the BCLC-C staging system. Second, whether antiviral therapy for HBV and/or HCV could improve the OS among BCLC-C HCC patients and each different substages needs further studies. Lastly, due to study time period, we were not able to study the effects of more novel systemic therapies that became available recently.

In conclusion, our study proposed a novel substaging system for BCLC-C HCC patients that can identify substage patients who may benefit from curative treatment and TACE versus sorafenib. We advocate for curative treatment or TACE for TBCSS substage C0–C1 and also in patients substage C2–C3 prior to systemic therapy, but we confirmed the current recommendation for systemic therapy or supportive care for substage C4 patients. We also recommend further validation of the TBCSS in larger scale prospective studies that also investigate the effectiveness of new systemic therapies for BCLC-C patients.