Introduction

Degenerative cervical myelopathy (DCM) is the leading cause of spinal cord injury and a major source of disability globally1. DCM refers to a clinical syndrome of neurological impairment related to chronic compression of the cervical spinal cord secondary to age-related osteoarthritic changes affecting the spinal column2,3,4. Over 70% of individuals over 65 years old have evidence of cervical degenerative changes, and approximately one-quarter of these people become clinically symptomatic from mechanical spinal cord compression5. As the global population continues to age6, optimizing the diagnosis and treatment of DCM are key public health priorities.

Operative intervention for DCM was traditionally performed with the goal of halting progression of neurological deficits7. However, more recent evidence1, 8 indicates that surgery actually improves function and quality-of-life (QOL); hence, surgical decompression has become standard of care for patients with moderate or severe symptomatic DCM. Nonetheless, there is substantial variability in the treatment outcomes of individual patients. There is hence a critical need to parse out this heterogeneity and identify factors associated with clinical improvement or worsening post-operatively. This would inform patient counselling and calibration of expectations, as well as facilitate selection of good surgical candidates.

An important limitation of previous studies is multicollinearity. This phenomenon refers to the existence of intrinsic correlations among predictor variables, which limits the ability to disentangle their independent contributions to outcome in a multi-variate regression9. Partial least squares (PLS) circumvents this limitation of traditional statistical models10. This mathematical approach is well suited to facilitate understanding of the interrelations among multi-dimensional data. PLS affords the opportunity to distill combinations of individual patient characteristics into common “phenotypes” that may be associated with outcomes. Further, as an advanced data-driven approach, PLS does not rely on any preconceived assumptions or hypotheses to test associations between predictors and outcomes10, 11. These features make PLS a potentially powerful tool in the analysis of complex relationships in epidemiological datasets; however, little work has been done in the application of PLS analysis to DCM specifically. Here, we apply PLS to dissociate phenotypes, defined by weighted combinations of demographic, disease, and treatment characteristics, associated with functional status, disability, and QOL outcomes in patients undergoing surgical decompression for DCM.

Methods

Data source & patient population

This was a post-hoc analysis of a dataset that combined the AOSpine CSM North America1 (CSM-NA; ClinicialTrials.gov NCT00285337) and AOSpine CSM International12 (CSM-I; ClinicalTrials.gov NCT00565734) studies. These were single-armed, prospective, multi-center cohort studies conducted at 26 global sites that aimed to evaluate the efficacy of surgical decompression in patients with DCM with regard to functional status, disability, and quality-of-life (QOL) outcomes. Approval for the study was obtained from the University of Toronto Research Ethics Board and the study was conducted in accordance with ethics guidelines. Patients were enrolled if they provided written informed consent and met the following eligibility criteria: (1) age 18 years or older; (2) symptomatic DCM with at least one clinical sign of myelopathy; (3) imaging evidence of cervical cord compression; and (4) no prior cervical spine surgery. All patients underwent surgical decompression of the cervical spine, with or without an instrumented fusion procedure.

Clinical variables

Data pertaining to patient demographics (e.g., age, sex, BMI, education, comorbidities), clinical presentation (e.g., symptoms, signs, duration of myelopathy, causative pathology), and surgical treatment (e.g., approach, number of levels, operative duration) were obtained (Table 1).

Table 1 Predictor (X) and outcome (Y) variables included in analysis.

Outcomes

Functional status was assessed by the mJOA scale13, 14. Quality of life was evaluated by the Short Form-36 (SF-36)15 and disability by the Neck Disability Index (NDI)16, 17. These are both patient self-reported measures, the SF-36 being a generic health-related quality of life instrument, and the NDI being specific to neck conditions. The outcomes of interest were change in mJOA, SF-36 (all eight domains and two summary scales), and NDI from baseline to 2-year postoperative follow-up (Table 1).

Partial least squares analysis

We applied a PLS analysis to better understand patient phenotypes associated with outcomes following surgical decompression for DCM. A PLS analysis has the distinct advantages of decomposing the correlation between a set of variables and extracting patterns of variable contributions to the overall relationship with outcome. Multi-dimensional associations between clinical covariates (predictors) and outcome measures were assessed using PLS for all subjects with complete baseline and 2-year follow-up data. Detailed methodology has been published previously11, 18.

Raw matrix construction

The data were separated into X (predictor) and Y (outcome) matrices. The X matrix included 478 subjects by 50 clinical covariates and the Y matrix included 478 subjects by 12 outcome variables. Data centering was performed, such that covariates with large absolute values do not dominate the analysis. To center continuous data, z-scores were generated from the sample distribution. Zero-centering was performed for binary and ordinal variables.

Correlation matrix construction

A heterogeneous correlation matrix, R, was calculated between X and Y data matrices using the hetcor function of the polycor package in R statistical software. Pearson product-moment correlations (between continuous variables), polyserial correlations (between continuous and ordinal variables), and polychoric correlations (between ordinal variables) were generated19.

PLS analysis

Single value decomposition (SVD) was performed on the heterogeneous correlation matrix to produce latent variables (LVs), or components, explaining the greatest amount of correlation between X and Y (the predictors and outcomes, respectively)20, 21 as follows:

$$\text{R = USV'}$$

The latent variable represents the relation between the outcome scores and predictor variables. The values of U, S, and V were derived through SVD. Each column (i) of the matrices U and V characterizes a single component. The vectors V(i) and U(i) represent the weights of the independent and dependent variables that make the greatest contribution to the latent variable, respectively. The diagonal of the matrix S contains the singular values. The effect size and amount of variance explained by each component may be derived by calculating the ratio of a single squared singular value to the sum of all squared singular values20.

Statistical analysis

The significance of the latent variables was tested by bootstrapping with 5000 iterations, a process of sampling with replacement. For each bootstrap, as previously reported, the data were rotated by a Procrustes rotation to align the first three vectors of U and V to the SVD of the original non-bootstrapped data18. The first three vectors were chosen because they explained the majority of the variance in the original SVD. The resampling distribution was used to derive standard errors and 95% confidence intervals for the contributions of each variable to the component. A bootstrap ratio, the ratio of each element in U and V to its bootstrap-estimated standard error (similar to a z score), was calculated to estimate the statistical reliability of each variable weighted by its contribution to the overall latent variable11, 18, 20. Significance was determined using |z|> 3.29, corresponding to p < 0.001.

Plots were generated to depict contributions of the X and Y variables to the first 3 latent variables explaining the greatest amount of variance from the PLS analysis. Each plot graphs the bootstrap-estimated 95% confidence intervals of significant X and Y variables. SVD, bootstrapping, and plot generation were conducted using MATLAB software (Mathworks Inc., Natick, MA, USA).

Results

A total of 478 patients met eligibility criteria. Baseline clinical and surgical characteristics of the study cohort are presented in Table 2 and outcomes are presented in Table 3. Thirty-seven percent of the patient cohort was female, and the mean age was 56.4 ± 11.7 years (mean ± standard deviation). The signs and symptoms of the patients were varied, but most patients experienced hand numbness (88.5%) and weakness (86.6%). An anterior surgical approach was the choice of treatment in 62.3% of cases, while three possible posterior surgical approaches were employed: laminectomy alone, laminectomy with fusion, and laminoplasty. Generally, the functional status of patients improved after surgery, with the mean mJOA rising from 12.7 at baseline to 15.2 post-operatively.

Table 2 Baseline clinical and surgical characteristics of the patient cohort.
Table 3 Summary of 2-year outcomes of the patient cohort, represented using mean ± standard deviation. Δ denotes the change at 2-years relative to baseline.

Three significant latent variables were identified by PLS. Bootstrap ratios for predictors and outcomes for each latent variable are presented in Table 4. The first latent variable (Fig. 1) demonstrated that the phenotype of patient with intrinsic hand muscle atrophy (z = − 3.33, p < 0.001) treated by a surgical approach other than laminectomy alone (z = 4.51, p < 0.001) had greater improvement at 2 years in physical health-related QOL, including the Physical Component Summary (z = − 9.98, p < 0.001), Physical Functioning (z = − 5.35, p < 0.001), and Role Physical (z = − 5.51, p < 0.001) domains of the SF-36. Figure 2 plots the second latent variable. This revealed that patients with respiratory (z = 3.43, p < 0.001), rheumatologic (z = 4.59, p < 0.001), and psychological (z = 4.09, p < 0.001) comorbidities had poorer improvement in functional status post-operatively, as measured by the mJOA scale (z = 7.26, p < 0.001). Finally, in the third latent variable (Fig. 3), a phenotype of a patient with more severe myelopathy at presentation (lower mJOA) (z = − 8.80, p < 0.001) presenting with a history of gait impairment (z = 4.59, p < 0.001) and exam findings of a broad-based, unstable gait (z = 3.66, p < 0.001) was associated with poorer mental health-related QOL, as evaluated by change in the Mental Component Summary (z = − 7.44, p < 0.001), Mental Health (z = − 7.25, p < 0.001), Vitality (z = − 3.66, p < 0.001), and Social Functioning (z = − 3.81, p < 0.001) domains of the SF-36 at 2 years.

Table 4 Bootstrap ratios for predictor and outcome variables for each latent variable.
Figure 1
figure 1

Significant contributions of the predictor (X) and outcome (Y) variables to the first latent variable. Mean values of outcome variables (bootstrap ratios) are shown with red diamonds while those for predictor variables are shown with blue diamonds. Error bars denote 95% confidence intervals.

Figure 2
figure 2

Significant contributions of the predictor (X) and outcome (Y) variables to the second latent variable. Mean values of outcome variables (bootstrap ratios) are shown with red diamonds while those for predictor variables are shown with blue diamonds. Error bars denote 95% confidence intervals.

Figure 3
figure 3

Significant contributions of the predictor (X) and outcome (Y) variables to the third latent variable. Mean values of outcome variables (bootstrap ratios) are shown with red diamonds while those for predictor variables are shown with blue diamonds. Error bars denote 95% confidence intervals.

Discussion

A critical strength of PLS is its ability to overcome problems relating to multicollinearity; therefore, unlike traditional regression techniques, PLS does not require any preformed hypotheses or assumptions regarding the importance of particular variables10, 11. As a purely data-driven method, PLS hence has the potential to unveil novel associations that may not be intuitive, hypothesized to exist, or readily apparent. Further, PLS is able to examine multiple outcome variables simultaneously and dissociate the unique associations of a set of predictor variables with each. Here, we have applied PLS to disentangle multi-dimensional associations between predictors and outcomes in patients undergoing surgical decompression for DCM. This analysis has revealed several novel findings warranting further exploration and investigation. Specifically, a PLS approach dissociated the interrelations between baseline variables and three distinct aspects of a patient’s outcome: (1) functional status; (2) physical QOL; and (3) mental QOL.

Older age has been associated with poorer functional outcomes in patients with DCM22, 23. However, older patients generally have greater medical comorbidities and poorer physiological reserves; and it is unclear from the current literature whether age in and of itself is predictive of outcomes, or whether this influence is a biproduct of the association of age with comorbid status24, 25. The findings of the current study, which controlled for a host of baseline variables and distilled multi-dimensional associations, would support the latter. Indeed, it is interesting that in our analysis, age was not a significant predictor, whereas the second latent variable found a phenotype of respiratory, psychological, and rheumatologic comorbidities was strongly associated with lesser gains in functional status, namely change in mJOA score, following surgical decompression. Recently, the concept of ‘frailty’ has gained traction, which refers to a multi-faceted loss of reserves (e.g., energy, physical ability, cognition, health) giving rise to vulnerability26. Frailty is perhaps best conceptualized as an assessment of one’s ‘physiological age’. Studies of patients undergoing spine surgery for various pathologies have found that frailty may be more important than chronological age in predicting clinical outcomes27, 28. Similarly, the findings of the second latent variable are perhaps most appropriately interpreted as increasing frailty may lead to poorer functional gains following surgery for DCM. An explanation may be that patients who are more frail are less able to translate neurological recovery into functional gains; in other words, in these patients, improvements in motor or sensory function may not necessarily translate into a meaningful change in functional abilities.

Previous studies have found that more severe DCM-related impairment and longer duration of symptoms are associated with poorer surgical outcomes; most of these studies have focused on mJOA score as the outcome22, 29, 30. Using PLS, the present paper was able to delve somewhat deeper and provide novel insights. From the current analysis, it appears that the severity of symptoms, but also the pattern of impairment, is important in prediction outcomes; and further, that there is a differential impact on physical versus mental QOL outcomes.

First, based on the third latent variable, patients with more severe myelopathy at baseline, as assessed by the mJOA scale, and in particular, with subjective and objective impairment of gait, may have poorer mental QOL outcomes. Multiple prior studies have identified the presence of gait impairment to be a negative prognostic factor13, 30; and further, there is data to suggest that gait is less likely to recover in patients with more severe myelopathy31. The presence of non-recovering gait deficits post-operatively could understandably impact one’s self-perceived quality of life, and the impact may be disproportionately borne by mental and emotional facets of QOL. The presence of significant unsteadiness and/or need for assistive ambulation devices, for example, could limit one’s ability to engage in social activities, both within and outside the home, adversely impacting one’s overall emotional well-being. Based on these findings, DCM patients presenting with significant gait impairment should be counselled on the poorer mental QOL outcomes in order to calibrate expectations; and moreover, it may be necessary to more closely monitor these patients for the development of depressive symptoms.

Second, from the first latent variable, presentation with hand muscle atrophy was associated with greater improvement in physical QOL following surgical decompression. Hand muscle atrophy in DCM is thought to be due to multi-segmental compression affecting the ventral nerve roots or anterior horn of the spinal cord31,32,34. In general, this is felt to be self-limited and respond favorably to intervention32, 35. Based on the current analysis, it appears that patients presenting with hand muscle atrophy may have significant impairments in physical QOL that improve substantially following surgical decompression. Interestingly, surgical approach was a significant variable within this latent variable, with approaches other than laminectomy alone being associated with greater improvements in physical QOL. This association may be explained by the fact that patients with hand muscle atrophy often present with multi-segmental compression of the ventral nerve roots warranting either an anterior or a posterior multi-level approach (e.g., laminoplasty, laminectomy with fusion), rather than a simple laminectomy alone31,32,33,35. Nonetheless, by the nature of PLS, it is not possible to determine the independent effects of hand muscle atrophy versus laminectomy alone within this latent variable. Rather, one may conclude that the presence of hand muscle atrophy went hand-in-hand with surgical approaches excluding laminectomy alone, and these together were associated with more favorable physical QOL outcomes. The corollary is that the use of laminectomy alone as a surgical approach was associated with lack of hand muscle atrophy as a presenting symptom, and these together were associated with poorer physical QOL outcomes.

A discussed previously, the key strength of this paper is the use of PLS to explain multi-dimensional variance. This approach has not been applied to examine associations between predictors and outcomes in DCM previously, and accordingly, has revealed several novel insights. Nonetheless, this paper does have important limitations. First, all patients included in this analysis were treated surgically, and accordingly, the associations unveiled here do not apply to the natural history of DCM; that is, patients who are treated non-operatively. Further, because PLS by nature is a data-driven approach, these findings are exploratory in nature and warrant further investigation and validation, ideally in a prospective fashion.

Conclusions

Using a data-driven approach, we were able to disentangle multi-dimensional associations between predictors and outcomes in patients undergoing surgical decompression for DCM. This revealed several novel insights: (1) comorbid status and frailty adversely impact gains in functional status; (2) presentation with hand muscle atrophy as a symptom is associated with greater improvements in physical quality of life; and (3) more severe myelopathy with gait impairment is associated with poorer mental quality of life outcomes. These findings warrant further investigation.