Introduction

Endovascular thrombectomy (EVT) has been established as a standard practice in large-vessel occlusion (LVO) stroke patients [1]; however, often more than one retrieval attempt is needed in order to achieve complete reperfusion [2]. It has previously been described that reperfusion within the first retrieval attempt will lead to a better clinical outcome [2,3,4,5], especially if achieved by the first-pass retrieval [5,6,7]. One differentiating factor was that previous studies included all reperfusion grades; hence, presenting several confounders that are difficult to adjust for time of ischemia, which differs if reperfusion is achieved after first retrieval vs. multiple retrievals [8] and reperfusion grade [9] as well as reperfusion status after each retrieval (e.g. TICI 2b after the first two retrievals with final TICI 3 score after third attempt vs. TICI 0 after the first two retrievals and sudden TICI 3 reperfusion after third attempt) [10].

In the present study, we screened a multicenter stroke registry and selected patients who showed no reperfusion (TICI 0) in order to minimize the effect of the abovementioned confounders. We hypothesized that in this subset of patients higher numbers of retrieval attempts are associated with inferior clinical outcome.

Material and Methods

Patient Selection

A total of 6635 patients from the German Stroke Registry—Endovascular Treatment (GSR-ET 06/2015-12/2019; ClinicalTrials.gov Identifier: NCT03356392) were screened for inclusion. The GSR-ET is an ongoing, open-label, prospective, multicenter registry of consecutively recruited EVT patients, with 25 participating stroke centers in Germany [11].

The inclusion criteria for this study were: (1) acute large-vessel occlusion (LVO) stroke in patients > 18 years, (2) decision to perform EVT, (3) available data for admission National Institutes of Health Stroke Scale (NIHSS) score; administration of i.v. thrombolysis; number of retrieval attempts; final TICI score documented by DSA, and for the outcome to be assessed according to the modified Rankin scale (mRS) at 90 days (mRS90), (4) documented final TICI score of 0.

Study protocols and procedures were conducted in compliance with the Declaration of Helsinki and in accordance with ethical guidelines (the leading ethics committee of the Ludwig-Maximilians University Munich approved the GSR-ET, as well as the approval from the local ethics committees of the participating hospitals).

Endovascular Thrombectomy

Endovascular thrombectomy was performed according to the clinical routine of the referring stroke center and in accordance with current guidelines [11, 12]. The technical approach was chosen by the attending neurointerventionalist who also made the decision when to terminate the procedure or to perform multiple retrieval attempts.

Data Acquisition and Management

Data acquisition was performed according to the protocol of the GSR-ET, as has been previously described [11,12,13]. In summary, data were collected by local neurointerventionalists and neurologists and subsequently underwent standardized quality checks to control for consistency, plausibility, and integrity.

The final TICI score was assessed on the last DSA series by the attending interventionalist. The number of retrievals was documented by the neurointerventionalist immediately after the intervention and included aspiration attempts as well as retrievals with stent retriever devices. The mRS score was assessed at 90 days.

Statistical Analyses

All analyses were performed with the R statistics program (v4.0.5, R Core Team 2021, Vienna, Austria; Rstudio IDE v. 1.4.1106, Boston, MA, USA). Normally distributed variables are displayed as mean and standard deviation (SD), and were compared with the Welch’s t‑test for unequal variances. Non-normally distributed data are displayed as median and interquartile range (IQR) and were compared with the Kruskal Wallis or Mann-Whitney-Wilcoxon tests. Categorical variables are reported as proportions and were compared by means of the χ2-test or Fisher’s exact test, when appropriate.

The primary outcome of interest was functional independence, defined by mRS at day 90 (mRS90) of 0–2 (see Table 1). The secondary outcome was the mRS90 as factor variable on an ordinal scale. The main explanatory variable of interest was the number of retrieval passes performed, which was dichotomized at the median number of retrieval attempts for all patients. In order to robustly estimate the effect of retrieval passes on mRS90 while controlling for key pretreatment patient characteristics, we performed propensity score matching (PSM) [14] using the MatchIt package [15] with a 1:1 ratio without replacement, using the nearest neighbor matching algorithm with a caliper width of 0.25, which is appropriate for estimating the average treatment effect in the treated population [16]. Propensity scores were calculated using the following covariates based on previous publications and clinical relevance: age, sex, prestroke mRS of > 1, NIHSS at admission, location of occlusion and administration of i.v. thrombolysis [5, 17]. The same covariates were used for fitting a generalized linear model with binomial link function (primary analysis) with functional independence as the dependent variable on the original cohort (Table 2). Likewise, as a secondary analysis, this same model formula was used in an ordinal logistic regression model with the proportional odds assumption [18] (Supplemental Table 1). As a sensitivity analysis, the binary logistic regression model was additionally tested on a subset of the initial (non-matched) collective that incorporated all available baseline variables with less than 10% missing values (Supplemental Table 2). P-values < 0.05 were considered statistically significant and were not adjusted for multiple testing due to the explanatory nature of this investigation.

Table 1 Comparison of clinical and treatment characteristics before and after propensity score matchinga
Table 2 Summary table of predictor estimates of the binary logistic regression model with functional independence (mRS at day 90 of 0–2) as the dependent variable

Results

Of 6635 patients screened for inclusion, 377 met the inclusion criteria (7.8%). Mean age was 75 (±12.8) years, median NIHSS score on admission was 15 (IQR 9–19), and median number of device passes was 2 (IQR 0–4). A good clinical outcome (mRS ≤ 2) was observed in 41 patients (10.9%) and the mortality rate was 54.6% (median mRS90 of 6).

Propensity score matching (PSM) was performed at the median of device passes over all included cases. Thus, cases with > 2 device passes (n = 129) were considered “treated” and paired with available controls receiving ≤ 2 device passes (n = 129) that had the closest propensity score to them. Unmatched control cases were excluded from univariate analyses (n = 119). Baseline data as well as clinical and procedural outcome measures for unmatched and matched patient groups are displayed in Table 1.

Before PSM age, time from groin puncture to final TICI and NIHSS at 24 h were significantly different between groups, while NIHSS at discharge remained significant (p = 0.007) throughout pair matching. Of 129 matched patients with ≤ 2 retrieval attempts, 18 (14%) had a good clinical outcome, compared to only 5 patients (3.9%) with > 2 retrieval attempts (Table 1, Fig. 1). This difference was highly significant in univariate analysis (OR 0.29, 95% CI 0.07–0.73, p = 0.009). There was no significant difference in the occurrence of symptomatic intracranial hemorrhage (sICH) between the two groups with 4 (3.1%) for ≤ 2 retrieval attempts compared to 9 (7.1%) for > 2 retrievals (p = 0.25).

Fig. 1
figure 1

Ordinal modified Rankin scale after 90-day follow-up (mRS 90) in patients without reperfusion (TICI 0) after ≤ 2 vs. > 2 retrieval attempts (n = 258, 1:1 propensity-score matched patients). Functional independence (mRS 0–2) was observed in 18/129 patients (14%) with ≤ 2 retrieval attempts, and 5/129 patients (3.9%) with > 2 retrieval attempts

In the multivariable binary logistic regression fitted on complete cases (n = 357, Table 2) of the original cohort using the same covariates as for the PSM, the primary outcome (mRS ≤ 2, functional independence) was less likely to be achieved in patients with > 2 retrievals (OR 0.2, 95% CI 0.07–0.52, p = 0.002) after adjusting for confounders. Furthermore, older age (OR 0.95, 95% CI 0.92–0.97, p < 0.001) and higher NIHSS scores on admission (OR 0.89, 95% CI 0.83–0.95, p < 0.001) were associated with lower odds of functional independence (Table 2).

Similarly, in the secondary analysis using multivariable ordinal logistic regression (Supplemental Table 1), a higher mRS90 was associated with > 2 retrievals (OR 1.68, 95% CI 1.08–2.64, p = 0.022). Furthermore, older age (OR 1.05, 95% CI 1.03–1.07, p < 0.001), a pre-stroke mRS of > 1 (OR 3.02, 95% CI 1.73–5.42, p < 0.001), and a higher NIHSS on admission (OR 1.10, 95% CI 1.07–1.14, p < 0.001) were associated with higher mRS scores on day 90.

As a sensitivity analysis, a multivariable binary logistic regression model was fitted including the complete cases (n = 319) of all baseline variables from Table 1 with less than 10% missing values (Supplemental Table 2). Again, functional independence was less likely in patients with > 2 retrievals (OR 0.15, 95% CI 0.04–0.48, p = 0.003). Older age per year (OR 0.95, 95% CI 0.91–0.98, p = 0.007) and higher NIHSS scores on admission (OR 0.87, 95% CI 0.80–0.94, p < 0.001) were associated with lower odds of functional independence. Atrial fibrillation narrowly missed the significance threshold for good a clinical outcome (OR 4.57, 95% CI 0.99–24.64, p = 0.062).

Discussion

This study investigated the influence of the number of retrieval attempts performed during mechanical thrombectomy for acute ischemic stroke on clinical outcome in patients with no reperfusion (TICI score of 0). We found a significant association between more than two retrieval attempts and higher mRS scores at 90 days, even after propensity score matching and correcting for age, sex, pre-stroke mRS, NIHSS score on admission, location of occlusion and administration of i.v. thrombolysis.

Several studies have shown a decline in good clinical outcome if multiple retrieval attempts are performed and have reported best outcome results if reperfusion was achieved by just one retrieval, also known as the first-pass effect [2,3,4,5,6,7]; however, these studies included all reperfusion grades, making the analysis susceptible to bias. The number of retrievals is associated with increased procedure time [8] and patients with early successful reperfusion will have shorter ischemia times than patients with delayed reperfusion. Hence, it could be the procedure duration and thereby the time of brain ischemia as a latent variable that is responsible for an impaired clinical outcome, not the number of retrievals. Furthermore, these studies included TICI 2b and TICI 3 reperfusion grades were declared as “successful reperfusion”; however, substantial differences in clinical outcomes have been reported between TICI 2b and TICI 3 cases [9]. Furthermore, the situation is complicated by the fact that not only first-pass reperfusion but sudden reperfusion at later points in time (from TICI 0 to TICI 2b/3) is also associated with improved clinical outcome [10].

These confounders have been addressed by Nikoubashman et al. [7], who only included patients with TICI 3 reperfusion and verified their results in a matched-pair analysis. They suggested that indeed the number of device passes rather than the procedure time itself is associated with the clinical outcome; however, the authors stated that differences in patient characteristics cannot be ruled out completely.

To address these potential biases we exclusively included patients with a TICI score of 0 and performed 1:1 propensity score matching as well as detailed multivariable analyses of patients with more than two retrieval attempts, adjusted for typical confounders. We could thereby confirm the results of the previous retrospective studies that the number of retrieval attempts is independently associated with a worse clinical outcome. As Nikoubashman et al. stated [7] there are two possible explanations for this phenomenon:

First, the first-pass effect could be real and the number of retrievals increases the risk of vessel injury and subsequent symptomatic intracranial hemorrhage (sICH) [19,20,21]; however, we did not observe a statistically significant increased number of sICH in our study, although this comparison might lack statistical power due to the small number of patients in our cohort. Furthermore, each retrieval attempt is associated with risk of dispersing small distal emboli, which could lead to impaired clinical outcome [22, 23]. Other possible reasons include the prolonged procedure time, the need for induction of general anesthesia and increased amounts of contrast agents [24, 25].

Second, the first-pass effect could be an epiphenomenon and the number of retrieval attempts merely the byproduct of other causes of impaired clinical outcome, e.g. for tandem occlusions of the cervical ICA and the M1 segment more retrieval attempts are performed compared to single M1 occlusions. Therefore, we adjusted for the location of occlusion in our multivariable analysis. Furthermore, the clot histology has been associated with reperfusion results, higher number of retrievals and thrombus fragmentation which might impair the clinical condition [26]. We could not correct for clot histology or composition in our analyses as it is not part of the registry.

Another important point is the phenomenon of spontaneous reperfusion which is associated with improved outcome [27]. It has been reported that in the first 24 h from symptom onset, 24% of untreated patients and 46% of patients with iv-rtPA achieve successful reperfusion [28]. Accordingly, some of the patients with TICI 0 at the end of EVT might be reperfused later on. A hypothesis to explain our findings is that multiple unsuccessful retrieval attempts diminish the chance of subsequent spontaneous reperfusion due to vessel wall damage or mechanical alteration of the occluding thrombus. This could be tested by performing vessel imaging in a cohort of TICI 0 patients on day 1 after EVT, to assess the reperfusion rate depending on the number of retrievals performed; however, these data are not included in the GSR-ET.

The most important bias in retrospective trials assessing the number of retrieval attempts remains true for the present study: the reasons for performing multiple attempts vs. terminating a procedure are heterogeneous, for example one might perform multiple attempts in a young patient with large penumbral tissue as opposed to an older patient with small penumbral tissue. Furthermore, the reasons for failure of reperfusion and therewith inclusion in the present study are heterogeneous and include difficulties in accessing the intracranial vessels, passing the thrombus/occlusion, and mobilizing/removing the thrombus as well as the event of reocclusions [13, 29, 30]. Due to the study design, there was no information on the technical approach of EVT (e.g. aspiration or stent-retriever as first-line approach) available.

The results of the present study support the hypothesis that extended endovascular intervention with increased number of retrieval attempts leads to a poorer clinical outcome. This might impact treatment decision making, if after several retrieval attempts, late successful reperfusion also does not result in an improved outcome [5]. To define the optimal number of retrieval attempts is beyond the scope of this study and would require a randomized controlled trial to correct for the abovementioned biases.

Conclusion

In patients with failure of reperfusion, more than two retrieval attempts were independently associated with a worse clinical outcome after adjusting for age, sex, admission NIHSS score, and the site of occlusion.