Introduction

With the emergence of endovascular thrombectomy (EVT) as part of standard treatment for acute ischemic stroke (AIS) caused by a large vessel occlusion (LVO), occluding thrombo-emboli have become available for histopathologic analysis. Insight in the relationship between thrombus composition and stroke etiology could be of value for secondary stroke prevention, as the appropriate preventive treatment depends on stroke etiology. Several causes of AIS have been identified, and the most prevalent being large artery atherosclerosis and cardiac embolism; both of which may lead to thrombo-embolic occlusions in the intracranial circulation. Stroke etiology currently remains unknown in approximately 30–50% of patients despite extensive clinical workup [1]. If a clear association exists between thrombus composition and stroke etiology, examination of extracted thrombi could be useful to guide therapeutic choices for secondary prevention of recurrent stroke [2].

Various previous studies have studied the relationship between thrombus composition and stroke etiology [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21]. Most studies so far, however, did not find an association, possibly due to small sample sizes [3, 5, 6, 8, 11,12,13,14,15, 17]. While several studies did find an association between thrombus composition and etiology, they yielded contradictory results: some reported a higher amount of red blood cells (RBC) in thrombi associated with large artery atherosclerosis than in thrombi of cardiac or unknown origin [9, 10, 16, 18, 20, 21], while others reported the opposite with a higher amount of RBC in thrombi related to a cardiac source of embolization [7, 19, 22].

The aim of this study was to assess the relationship between thrombus composition and stroke etiology in a large population of AIS patients, treated with EVT for a large vessel occlusion of the anterior circulation, in daily clinical practice. In contrast to previous studies, we were able to compare clinical baseline characteristics of patients with and without thrombi available for analysis.

Patients and methods

Study population and sample selection

The MR CLEAN (Multicenter Randomized Clinical Trial of Endovascular Treatment for Acute Ischemic Stroke in the Netherlands) Registry was a prospective observational study of all patients who underwent EVT for AIS in the Netherlands (Appendix) [23]. Enrolment started March 2014, directly after the final inclusion in the MR CLEAN trial [24]. All patients undergoing EVT for AIS in the anterior or posterior circulation (defined as at least entry into the angiography suite and receiving arterial puncture), in one of the sixteen centers performing EVT in the Netherlands, were registered. The central medical ethics committee evaluated the study protocol and granted permission to carry out the study as a registry. For this histopathologic substudy, patients who met the following criteria were included: age 18 years and older, a proximal intracranial vessel occlusion in the anterior circulation as shown on CT angiography (CTA), availability of clinical and imaging data for the assessment of stroke etiology, and available thrombus for histological assessment. Data of patients treated until 15 June 2016 were collected and analyzed for this study.

Thrombus analysis

After EVT, the thrombi were immediately stored in 4% buffered formaldehyde before embedding in paraffin. The thrombi were mostly retrieved by mechanical thrombectomy using a stent retriever, with only 14.8% using aspiration thrombectomy. For each paraffin block, 5-µm sections were cut at two depths, generally at a depth of 170 and 230 µm (Microm HM335 S, Microm International GmbH, Waldorf, Germany), as it has been shown that partial sectioning of the thrombus provides a good estimate of thrombus composition [25]. The two sections at these depths were collected on a single slide, stained by hematoxylin–eosin (HE), digitized at 20 × magnification (228 nm/pixel, Hamamatsu Nano-Zoomer, Hamamatsu Photonics K.K., Hamamatsu City, Japan), and images were stored as raw Hamamatsu.ndpi datafiles.

An analysis of stained sections was performed using Orbit Image Analysis software (Orbit Image Analysis, Idorsia Ltd.) [26]. Orbit enables the analysis of native Hamamatsu.ndpi files and utilizes machine learning algorithms for image segmentation, classification, and quantification. First, a unified foreground/background segmentation model was trained to exclude background from further analysis. Using one unified classification model for all thrombi yielded less classification accuracy due to slight differences in staining between samples; therefore, separate classification models were created for each slide containing two sections. All models were trained to quantify percentages of red blood cells (RBC), fibrin and platelets (F/P), and white blood cells (leukocytes), as the main components of extracted EVT thrombi [3, 5,6,7,8,9,10,11,12,13,14,15,16,17,18,19, 22]. The weighted average of the two sections was considered representative for the whole thrombus [25]. Orbit was then used in conjunction with a custom script to enable batch analysis of all sections with individual classification models applied for each slide, generating both a Jason data file containing the quantification results per sample and a classification overlay image per sample (Fig. 1) to allow for visual inspection of the classification accuracy. Thrombus sections with large central defects in RBC rich sample regions, generally difficult to cut, were corrected by manually outlining and measuring these defects and adding the missing RBC surface area after comparison with the other section, and the original paraffin blocks (biomedical scientist (HH), validated by a pathobiologist (HB)).

Fig. 1
figure 1

Examples of generated classification overlay images into RBC (red), F/P (green), and leukocytes (blue) to allow for visual inspection of the classification accuracy using Orbit. The top three images are original HE stained images; the bottom three images are the semi-automatically segmented images

Assessment of stroke etiology

All patients underwent CTA or magnetic resonance angiography (MRA) of the cervical arteries before EVT. Furthermore, 12-lead ECG followed by ECG-monitoring for at least 24 h was performed. Additional etiologic workup was performed in accordance with local protocols. Stroke etiology was determined based on data provided in the discharge letters and imaging by two trained observers who were blinded for histological thrombus composition. The presumed cause of stroke was determined for each patient using the TOAST criteria as a guideline: large artery atherosclerosis (TOAST 1), cardioembolism (TOAST 2), stroke of other determined cause (TOAST 4), and stroke of undetermined cause (TOAST 5). In our cohort of patients who underwent EVT for AIS, there were no patients with small vessel disease as cause of stroke (TOAST 3). A patient was considered to have large artery atherosclerotic stroke, if there was > 50% atherosclerotic stenosis or atherosclerotic occlusion at the bifurcation of the carotid artery on the symptomatic side. Patients were considered to have undetermined stroke etiology, if more than one possible cause was identified; if no cause was identified despite complete workup, as described above; or if diagnostic workup was incomplete.

For statistical analysis, each patient was allocated to one of three predefined etiologic categories: (a) “non-cardioembolic” for both large artery atherosclerotic disease and other determined causes (TOAST 1 and 4), (b) “cardioembolic” (TOAST 2), or (c) “undetermined” (TOAST 5). For graphical representation of thrombus composition within groups, we further subdivided cardioembolic stroke into medium-risk or high-risk based on the evidence of the relative propensity for embolization, according to the original TOAST classification [27]. We also subdivided non-cardioembolic into large artery atherosclerosis, carotid artery dissection, and other determined cause. Stroke of undetermined etiology was subdivided in “ > 1 cause” (more than one possible cause was identified) and “cryptogenic” (no cause was identified despite complete workup, as described).

Statistical analysis

Clinical characteristics and histological thrombus composition were described using standard statistics. Since thrombus composition did not follow a normal distribution, a Kruskal–Wallis test was performed first to assess differences in thrombus composition between all three etiologic groups (cardioembolic, non-cardioembolic, and undetermined). The association of histological components (percentages of RBC, F/P, and leukocytes) with stroke etiology (non-cardioembolic stroke, cardioembolic stroke, and stroke with undetermined etiology) was estimated with univariable and multivariable multinomial logistic regression and presented as (adjusted) odds ratios (aOR) with 95% confidence intervals (CI), with cardioembolic stroke as the reference category. We adjusted for potentially relevant differences in baseline characteristics by performing Kruskal–Wallis, χ2 test and Fisher-Freeman-Halton tests and including characteristics with a p-value of < 0.20 in the multivariable models (supplemental Table S1). For the regression analyses, single imputation was performed for missing values (supplemental Table S2). To assess the representativeness of the patients with an available thrombus to all patients who underwent EVT in clinical practice, we compared patient, clinical, and imaging characteristics of the included patients from the MRCLEAN Registry to those who were not included using ANOVA and Mann–Whitney U tests. Statistical analyses were performed using Stata (StataCorp. 2017. Stata Statistical Software: Release 15.1 SE. College Station, TX: StataCorp LLC) and SPSS (IBM Corp. Released 2020. IBM SPSS Statistics for Windows, Version 27.0. Armonk, NY: IBM Corp).

Results

Patient population

Thrombus samples of 332 patients from the MR CLEAN Registry were included for histological analysis in this study (Fig. S1). Baseline characteristics of these patients did not differ with those not included from the registry. Importantly, there were also no statistically significant differences in stroke etiology. For included patients, stroke etiology was categorized as cardioembolic in 114 patients (34.3%), non-cardioembolic in 58 patients (17.5%), and undetermined in 160 patients (48.2%), compared to 364 (30.5%), 199 (16.7%), and 631 (52.8%), respectively, in patients from the registry who were not included for histological analysis (Table 1).

Table 1 Baseline characteristics for patients from the MR CLEAN cohort with histological analysis as compared to those without. ANOVA was performed; p-values > 0.05 are printed bold

Differences in thrombus histology between etiologic groups

Histopathologic evaluation showed the thrombi typically contained areas being predominantly RBC, F/P, or leukocyte rich (Fig. 2). A large heterogeneity in overall composition was seen in our cohort (supplemental Fig. S2). For all thrombi, median RBC content was 27.1% (IQR 15.9–42.4), median F/P content was 67.0% (IQR 53.1–78.1) and median leukocyte content was 4.8% (IQR 3.0–7.1). RBC content in thrombi from patients with non-cardioembolic etiology (median 41.2%, IQR 20.5–53.0), was higher than in thrombi from patients with a cardiac etiology (median 25.8%, IQR 13.8–38.2) and thrombi from patients with an undetermined origin (median 25.4%, IQR 14.3–39.0). Inversely, the F/P content in thrombi from patients with non-cardioembolic etiology (median 54.5%, IQR 55.1–80.3) was lower than in thrombi from patients with cardioembolic etiology (median 67.1%, IQR 55.1–80.3) and thrombi with an undetermined origin (median 70.0%, IQR 57.1–78.2). Using a Kruskal–Wallis test for group differences, a significant difference was seen between the three etiologic groups for RBC content (p < 0.001) and F/P content (p = 0.002), but not for leukocyte content (p = 0.24) (Table 2 and Fig. 3). These differences were more pronounced for high-risk cardioembolic etiologies (supplemental Fig. S3). After correction for baseline differences in the regression analysis, RBC and F/P significantly differed between non-cardioembolic and cardioembolic strokes. Increased RBC (OR, 1.02 [95% CI 1.01–1.05]) and decreased F/P (OR, 0.98 [95% CI 0.95–0.99]) were associated with non-cardioembolic stroke, as opposed to cardioembolic stroke (Table 3). In other words, for every 1% increase in RBC content, the OR for a non-cardioembolic stroke (as opposed to cardioembolic stroke) was 1.02 (Fig. 4). Leukocyte content did not differ between non-cardioembolic and cardioembolic stroke. Furthermore, we did not observe any differences in thrombus composition between strokes with undetermined etiology and cardioembolic strokes in the univariable regression analyses (OR 1.00 [95% CI 0.99–1.01] for RBC, OR 1.00 [95% CI 0.99–1.01] for F/P and OR 1.00 [95% CI 0.94–1.07] for leukocytes) and multivariable regression analyses (Table 3).

Fig. 2
figure 2

Examples of RBC rich (A), F/P rich (B), mixed (C), and leukocyte rich (D) thrombus areas. The inserted boxes represent magnifications of the underlying area of each example. HE-stain, bar = 200 µm (AC), 100 µm (D), and 25 µm (inserts). RBC are red, F/P purple, and nuclei of leukocytes blue

Table 2 Thrombus composition stratified by stroke etiology
Fig. 3
figure 3

Bar-whisker plot of thrombus composition for the three etiologic groups. P-values are given based on Kruskal Wallis test. x = outliers > 1.5 times box height. Significant differences were found for both erythrocyte content (p < 0.001) and fibrin/platelet content (p = 0.002), but not for leukocytes (p = 0.240)

Table 3 Univariable and multivariable multinomial logistic regression for the relationship of thrombus composition with stroke etiology
Fig. 4
figure 4

Likelihood of cardioembolic (CE), non-cardioembolic (NCE) and undetermined (Undet) etiology (Etio) based on erythrocyte content after univariable multinomial regression. For every 1% increase in RBC content, the OR for a non-cardioembolic stroke (as opposed to cardioembolic stroke) was 1.02

Discussion

We studied the association between histopathologic composition of mechanically extracted thrombi and stroke etiology in patients with AIS included in the Dutch MR CLEAN Registry. RBC and F/P contents of thrombi retrieved from patients with AIS differed significantly between strokes of cardioembolic and non-cardioembolic origin. The RBC content in thrombi from patients with non-cardioembolic etiology was higher than in thrombi from patients with a cardiac etiology and thrombi from patients with an undetermined origin. Inversely, the F/P content in thrombi from patients with non-cardioembolic etiology was lower than in thrombi from patients with cardioembolic etiology and thrombi with an undetermined origin. In addition, cardioembolic thrombi had a similar histopathologic composition to thrombi from strokes of undetermined origin.

Our findings of a significant correlation between stroke etiology and thrombus composition are in line with several previous studies [10, 16, 20, 21]. In our cohort, the wide confidence intervals and overlap in composition hamper reliable prediction of stroke etiology based on thrombus composition alone in individual cases. However, an increasingly high percentage of RBC in thrombi is associated with a higher likelihood of a non-cardioembolic etiology.

Like some previous studies [9, 10, 16, 21, 28], a similarity in thrombus composition was found for thrombi of cardioembolic origin and those of undetermined origin. Our results suggest that patients with an undetermined stroke origin and F/P rich thrombi have a higher likelihood of a cardiac source and may benefit from more extensive monitoring for arrhythmias and/or extended cardiac analysis.

Contradictory results in previous studies regarding the relationship between thrombus histology and etiology may in part be caused by random variation in relatively small samples, and a lack of consensus on histopathological processing and analysis [29]. Indeed, studies with larger sample sizes did find significantly higher fractions of RBC in ischemic stroke caused by large artery disease [10, 16, 20, 21], while smaller studies often failed to show such a correlation. Interestingly, all previous studies that found a higher RBC percentage in cardioembolic stroke were performed in an Asian population [9, 30, 31]. Lastly, many previous studies included thrombi from patients with posterior circulation stroke [3, 6, 8,9,10, 12, 13, 15, 18], while the determination of large artery atherosclerosis as a cause of AIS is based on stenosis grading of the anterior circulation.

Recently, studies have started to make use of machine learning software for image analysis [28, 30, 32], which implies training and validating a segmentation model first. Differences in accuracy of these segmentation models will affect quantification of thrombus components. In contrast to previous studies, we used a custom script in conjunction with Orbit image analysis software which enabled visual quality control in batch mode, enabling the possibility of verification of the segmented components RBC, F/P, and leukocytes by an experienced pathobiologist (HB). This diminished uncertainties normally inherent to the use of machine learning algorithms for segmentation purposes, ensuring high quality thrombus classification.

Our study has several limitations. Due to the ex vivo nature of thrombus analysis, all studies investigating the association between thrombus composition and stroke etiology theoretically suffer from a selection bias, since data from patients with thrombi resistant to thrombectomy as well as thrombi completely decomposed by IVT are not available for analysis. This study is the first to also have clinical data of all patients in the cohort without thrombus material available for analysis, enabling us to compare characteristics of included patients to those who were not included in the study. Indeed, this comparison was not possible in prior studies, including STRIP [20], through lack of sufficient clinical data in the LVO patient group as a whole. It is a feature that is available in, and a unique characteristic of, the MR CLEAN studies. We can therefore validate that no significant differences were found in stroke etiology between the two groups, and only minor differences in patient baseline characteristics were found. Therefore, we believe our results are valid for the general stroke population eligible for EVT. Using only HE staining makes a reliable differentiation between fibrin and platelets impossible; therefore, these components were combined into one category, F/P. Differentiation of platelets as a separate component and even adding more in-depth immunohistochemical analysis or even 3-D characterization of thrombi could certainly be of interest, as it has been recently shown that platelet content also differs between etiologic groups [32]. Also, since most patients received IVT prior to mechanical thrombectomy, it is possible that (partial) thrombolysis altered thrombus composition; we did, however, adjust for IVT in our regression analysis. Lastly, using the TOAST classification possibly underestimates the number of patients with large artery atherosclerosis as the cause for stroke, since it only values the degree of stenose at the carotid bifurcation. However, we did adhere to this classification, as it is the most used classification system in practice, enabling comparison of our results with previous studies.

Conclusion

Thrombus composition is significantly associated with stroke etiology, with an increase in RBC and a decrease in F/P, raising the odds for a non-cardioembolic cause. Secondly, thrombus composition of cardioembolic and undetermined etiology are similar. These results suggest that patients with an undetermined origin and F/P-rich thrombi are more likely to have a cardiac cause and may benefit from more extensive monitoring for arrhythmias and/or extended cardiac analysis.