Introduction

Idiopathic pulmonary fibrosis (IPF) is a fibrotic progressive interstitial lung disease (ILD) characterised by declined pulmonary function and overall poor prognosis1,2. The Gender-Age-Physiology (GAP) index, a simple point-scoring calculator of multidimensional prognostic staging system, was originally proposed and validated in 2012 for the prediction of 1, 2 and 3-year mortalities in IPF3. The GAP index calculates the baseline IPF characteristics and has shown excellent prognostic-group separation ability.

After development of the GAP index, antifibrotic therapy (AFT), i.e. pirfenidone and nintedanib, was established and recommended for IPF treatment in the international guideline of 20152. The pirfenidone and nintedanib slow disease progression by reducing the annual decline of forced vital capacity (FVC) in patients with IPF4,5,6,7. Further, AFT was reported to reduce the decline of FVC in patients with other ILD types, including systemic sclerosis-associated ILD8 and progressive fibrosing ILDs9,10. Based on its documented ability to delay lung-function deterioration, AFT is considered to reduce the risk of mortality11,12. However, the utility of the GAP index in the context of AFT has not been fully evaluated, and the development of a simple and easily applicable prognostic staging system for AFT is expected.

A lower body mass index (BMI) was previously reported to be associated with poor outcome in patients with IPF. Recently, the clinical implication of sarcopenia, which is characterised by progressive and generalised skeletal disorder involving accelerated loss of muscle mass and function, has been highlighted in various diseases14,15. These metabolic dysfunctions, partly represented as muscle wasting and body-weight loss, are frequently found in patients with various respiratory-disease types, including ILD16,17,18,19. Importantly, muscle wasting and body-weight loss have been associated with poor outcome in patients with IPF13,18,19,20,21. These previous studies suggested that preventing skeletal-muscle wasting as well as preserving body weight and lung function are important for the management of patients with IPF.

This multi-centre, retrospective, two-cohort study aimed to evaluate the GAP index at AFT initiation in patients with IPF. Additionally, this study also assessed the value of the BMI and of combined BMI and GAP index assessment for prognosis discrimination in patients with IPF treated with AFT.

Results

Clinical characteristics

The clinical characteristics of the patients with IPF at the time of AFT initiation are shown in Table 1 and Fig. 1. Most patients were approximately 70 years old and 80% of the patients were male in both cohorts. Pirfenidone was commonly used in the Hamamatsu cohort, and surgical lung biopsy was frequently performed in the Hamamatsu cohort. The median follow-up period since AFT was initiated was approximately 2 years, with 70 patients having more than a 3-year observation period and 43 patients having more than a 5-year observation period. The number of patients with previous history of AE at AFT initiation tended to be higher in the Seirei cohort: 16 (13.4%) vs. 7 (6.4%), respectively. The pulmonary function test showed severe-to-moderate impairment of spirometry and decreased %DLCO in both cohorts. The levels of serum albumin were slightly lower in the patients in the Seirei cohort. Long-term oxygen therapy (LTOT) and immunosuppressants were more frequently prescribed in the Seirei cohort. Immunosuppressants were mainly initiated due to acute exacerbation (AE), and some were prescribed before the PANTHOR trial22.

Table 1 Clinical characteristics of 229 patients with IPF at initiations of antifibrotic therapy.
Figure 1
figure 1

Flow diagram of patient selection. AFT antifibrotic therapy, IPF idiopathic pulmonary fibrosis, DLCO diffuse capacity of the lung for carbon monoxide.

Assessment of the GAP stage in patients with IPF at the time of AFT initiation

The distributions of patients with IPF at the time of AFT initiation according to the GAP index are shown in Table 2. The frequencies of GAP stages I and stage II were approximately 40%. Among them, GAP stage II was most frequent in both cohorts. Meanwhile, the proportion of GAP stage III was lower than 20% in both cohorts, and the proportion of GAP stage III was lower in the Seirei cohort than in the Hamamatsu cohort; especially, that in the Seirei cohort was 8.4%.

Table 2 GAP stage in patients with IPF at initiations of antifibrotic therapy.

Prognostic classification of the GAP index in patients with IPF treated with AFT

During the follow-up period, 122 deaths were noted. There were no significant differences in causes of death between the two cohorts (Table 3). In both cohorts, approximately 60% of patients died because of chronic respiratory failure, whereas the incidence of lung cancer was lower than 10%. The survival analyses according to the GAP index are shown in Fig. 2 and Supplementary-Table S1. In both cohorts, the GAP index successfully divided the prognosis of patients with IPF treated with AFT into three groups with distinct prognosis. The discrimination performance of the GAP index in patients of the combined cohorts was 0.675 (C-statistics). Among the components of the GAP index, FVC (%) and DLCO (%) also yielded prognostic separation in the combined-cohort patients (Fig. 3), but not the ‘Age’ and ‘Gender’ factors (not shown). The C-index of the ‘GAP index: FVC (%)’ and ‘GAP index: DLCO (%)’ were 0.668 and 0.655, respectively.

Table 3 Cause of mortality in patients with IPF treated with antifibrotic therapy.
Figure 2
figure 2

Prognostic classifications according to the GAP index at initiation of antifibrotic therapy in patients with IPF. Kaplan–Meier curves based on the data of the patients with IPF of the Hamamatsu cohort (A), Seirei cohort (B), and combined cohort (C) according to the GAP stage. P values were determined using the log-rank test. GAP Gender-Age-Physiology, IPF idiopathic pulmonary fibrosis.

Figure 3
figure 3

Prognostic classifications according to GAP index: FVC (%) and GAP index: DLCO (%). Kaplan–Meier curves based on the data of the patients with IPF of the Hamamatsu cohort (A,D), Seirei cohort (B,E), and combined cohort (C,F) according to GAP stage: FVC (%) and GAP index: DLCO (%). P values were determined using the log-rank test. GAP Gender-Age-Physiology, FVC forced vital capacity, DLCO diffuse capacity of the lung for carbon monoxide, IPF idiopathic pulmonary fibrosis.

Prognostic factors in patients with IPF treated with AFT

The univariate and multivariate analyses are presented in Table 4. As shown, the GAP index and its components, %FVC and %DLCO, were found to be significant prognostic factors in the multivariate analyses. Additionally, a lower BMI was a significant prognostic factor independent of the GAP index in the multivariate analyses.

Table 4 Prognostic factors in 229 patients with IPF treated with antifibrotic therapy by univariate and multivariate Cox-proportion analyses.

Prognostic value of the BMI at AFT initiation in patients with IPF

As the BMI showed prognostic value independent of the GAP index, we performed combined assessment of the GAP index with the BMI, i.e., GAP plus BMI, for IPF prognosis. Interestingly, a reverse J-shaped association was seen between the BMI and mortality rate in the Cox proportional hazards regression model (Supplementary-Figure S1). According to the ROC analyses, we identified a BMI 24 kg/m2 as the optimal cut-off. If the value of the BMI was lower than the cut-off value, the severity of the GAP stage was advanced by one stage. For example, the patients with GAP stage I who had BMI < 24 kg/m2 were categorized as GAP plus BMI stage II. Subsequently GAP plus BMI classified patients into four groups. The survival analyses according to GAP plus BMI are shown in Fig. 4. For clinical relevance, survival curves applying BMI < 20 kg/m2 (C-statistics 0.688) were also shown in Supplementary-Figure S2. GAP plus BMI was successful in distinguishing the prognoses in the combined-cohort patients with IPF (C-statistics 0.698). To calculate model improvement by GAP plus BMI at the 1-year, 3-year, and 5-year survivals, we applied IDI and NRI analyses. GAP plus BMI improved the discriminative performance of 3-year survival at 21% compared to that provided by the GAP index alone, although statistical significance was not reached (Table 5).

Figure 4
figure 4

Prognostic classifications according to GAP plus BMI. Kaplan–Meier curves based on the data of the patients with IPF of the Hamamatsu cohort (A), Seirei cohort (B), and combined cohort (C) according to GAP plus BMI. P values were determined using the log-rank test. GAP Gender-Age-Physiology, BMI body mass index, IPF idiopathic pulmonary fibrosis.

Table 5 Additional predictive values of GAP plus BMI at initiations of antifibrotic therapy in patients with IPF.

Discussion

The present study retrospectively examined the utility of the GAP index and of the BMI for prognosis prediction in patients with IPF treated with AFT. First, the GAP index at AFT initiation yielded clear prognostic distinction in the two cohorts of patients with IPF. Next, as BMI showed prognostic value independent of the GAP index, we evaluated the prognosis using the GAP index in conjunction with BMI: GAP plus BMI. Assessment of GAP plus BMI at AFT initiation successfully divided the patients into four groups with distinct prognoses. Our data suggested the clinical usefulness of the GAP index and importance of BMI assessment at AFT initiation in patients with IPF.

The GAP index was developed as a clinical baseline prediction model to distinguish prognostic groups in IPF based on data obtained at the time of diagnosis3. The GAP index is an easy and simple method that enables the assessment of cross-sectional data, without need for longitudinal data, which is advantageous. Accordingly, the GAP index is widely used and its utility has been shown in other types of ILD including unclassifiable ILD, chronic hypersensitive pneumonitis, and connective tissue disease-associated ILD23,24. However, at the time of GAP index development, no pharmacological therapy including pirfenidone was recommended in the international guideline25. Thus, the clinical implications of the GAP index in AFT have not been fully evaluated. In this regard, this study examined the prognostic classifications of the GAP index with two cohorts of patients with IPF treated with AFT and showed that assessment of the GAP index at AFT initiation clearly distinguished the prognoses.

The most common cause of mortality in these cohorts was chronic respiratory failure (57.4%) followed by AE (23.8%), whereas the lung-cancer incidence was very low at 5.7%. Interestingly, an epidemiological study conducted in Japan between 2003 and 2007 (i.e. before the introduction of AFT) reported the incidence of AE, chronic respiratory failure, and lung cancer at 40%, 24%, and 11%, respectively26; the lung-cancer incidence was twice as high as that in our cohort and resulted in an increase in the combined incidence of chronic respiratory failure and AE after AFT development. Although AE can occur in patients with preserved lung function, low FVC was reported as the most consistent risk factor for AE in IPF, and low DLCO was also identified as a well-known risk factor for AE in an international statement27. Therefore, these changes in cause of death may have positively contributed to mortality prediction using the GAP index at AFT initiation.

Given that the GAP index is a baseline risk-prediction model largely based on physiological factors, these issues also imply their own limitations. Disease behaviour or exercise capacity are not required to calculate the GAP stage. Indeed, both baseline and changes in spirometry and 6-min walking test (6MWT) were reported to be independent predictors of mortality28,29. Additionally, an overestimated risk for mortality with the GAP index was reported by the same authors who developed the GAP index30. They reported improvement in risk prediction performance using a modified GAP index that also included longitudinal change in FVC and respiratory hospitalization30. However, assessing longitudinal variables and 6MWT may not be suitable for risk assessment in severe patients or patients for whom there are no historical data available.

In this setting, as the BMI was shown to have prognostic value independent of the GAP index and it is easy to measure, the present study examined the value of using GAP plus BMI in patients with IPF treated with AFT for prognostic separations. Recently, clinical implications of sarcopenia, a metabolic dysfunction involving loss of skeletal muscle, in ILD was reported beyond those in chronic obstructive pulmonary disease and cancer17,18,19. Indeed, skeletal-muscle loss was well correlated with lower BMI and was associated with worse outcome in patients with IPF18,19. Thus, lower BMI partly represents skeletal-muscle wasting in patients with IPF. Further, consistent with our findings, a second analysis of the INPULSIS study reported that lower BMI (< 25 kg/m2) and weight loss (> 5% during 52 weeks) were associated with faster decline in FVC, suggesting shorter survivals in such patients20. Although GAP plus BMI did not achieve significant improvement of model discrimination compared with that of the original GAP index, GAP plus BMI distinguished patients into four distinct prognoses. Especially, patients with GAP stages I and II (approximately 85% of our cohort) were re-classified from two groups into three groups. This multidimension approach might be helpful for physicians in practice.

The present study had several limitations. Although we confirmed the utility of the GAP index at AFT initiation using two cohorts of patients with IPF, this study was retrospective. Further, there had been reported concerns regarding the use of p-value in net reclassification improvement and its validity31. Second, the sample size was not sufficiently large. Given the small number of patients in each cohort, significance of GAP plus BMI was ascertained when we examined the two cohorts together. Third, the cut-off of the BMI appears to depend on ethnicity and country20. In our cohort, although few patients were obese, reverse-J associations were found between BMI and mortality, suggesting that both upper and lower cut-offs in BMI might be needed. Indeed, sarcopenic obesity is known to lead to poor outcomes in patients with cancer32. Collectively, these limitations may have introduced potential biases to this study. Thus, large-scale prospective studies are required to overcome these limitations.

In conclusion, the present retrospective study showed that assessment of the GAP index at AFT initiation could successfully separate patients with IPF in prognostic groups. Our multivariate prognostic evaluation also revealed that lower BMI was associated with poor outcome independent of the GAP index. Interestingly, GAP plus BMI also separated patients into four distinct prognoses in the combined-cohort of patients. Collectively, these results indicated the value of the GAP index and BMI measurement for assessing prognostic prediction in patients with IPF treated with AFT.

Methods

Subjects

This retrospective study initially included 311 consecutive patients with ILD who started treatment with pirfenidone or nintedanib at Hamamatsu University of School of Medicine (Hamamatsu cohort, n = 154), Seirei Hamamatsu Hospital, and Seirei Mikatahara Hospital (Seirei cohort, n = 157). All patients were treated between February 2009 and March 2020. Eighty-two patients with ILD were excluded from the study: 49 patients were diagnosed with non-IPF ILD, ten patients with IPF did not undergo spirometry, and 23 were not evaluated with the diffusion capacity of the lung for carbon monoxide test (DLCO) at the time of AFT initiation. Thus, this study finally included 229 patients with IPF treated with AFT who had available data to assess the GAP score: the Hamamatsu cohort (n = 110) and Seirei cohort (n = 119) (Fig. 1). All participants fulfilled the IPF consensus criteria1,25. The study protocol was approved by the Ethical Committee of Hamamatsu University School of Medicine (17-196) and was carried out in accordance with the approved guidelines. The need for patient approval and/or informed consent was waived by the Ethical Committee of Hamamatsu University School of Medicine, because of the retrospective nature of the study.

Data collection

Clinical data were obtained from the patients’ medical records. Laboratory findings and pulmonary and functional test results obtained at the time of AFT initiation were recorded. AE was diagnosed based on the ATS guidelines27,33.

Assessing the GAP index and GAP plus BMI index

The GAP score was calculated based on data at the time of AFT initiation according to a previous study3: sex (female, 0 points; male, 1 point), age (≤ 60 years , 0 points; 61–65 years, 1 point; > 65 years, 2 points), %FVC (> 75%, 0 points; 50–75%, 1 point; < 50%, 2 points), and %DLCO (> 55%, 0 points; 36–55%, 1 point; ≤ 35%, 2 points; cannot perform, 3 points). The GAP stage was defined based on the total GAP score: stage I (0–3 points), stage II (4–5 points), and stage III (6–8 points). The GAP plus BMI was calculated based on the GAP stage and BMI at the time of AFT initiation. If the BMI value was lower than the cut-off value, the severity of the GAP stage was advanced by one stage.

Statistical analysis

Discrete variables are expressed as totals (percentages), and continuous variables are expressed as the median [interquartile range]. The Mann–Whitney U test was used to compare continuous variables. Fisher’s exact test for independence was used to compare categorical variables. The area under the receiver operating characteristic (ROC) curve was used to identify the optimal cut-off for the BMI. The overall survival time was measured from AFT initiation. Univariate and multivariate analyses were also performed using a Cox proportional hazards regression model. Cumulative survival probabilities were calculated using the Kaplan–Meier method and the log-rank test. The model performance was evaluated by discrimination using concordance statistics (C-statistics), i.e., the ability of a model to discriminate those with an outcome from those without an outcome. Model improvement was calculated by comparison with the GAP index; change in C-statistics, integrated discrimination improvement (IDI), and net reclassification improvement (NRI) were employed. Statistical analyses were performed using JMP (Ver13, SAS Institute, Inc., Cary, NC) and R (Ver4.0.2, R Foundation for Statistical Computing, Vienna, Austria). All analyses were two-tailed, and P-values < 0.05 were considered statistically significant.