Introduction

Social inclusion is a fundamental human right that has posed serious challenges for patients with severe mental illness [1], constituting 5% of the global population [2]. Social inclusion has been described as “a multidimensional state where prevailing conditions enable full and active participation in all aspects of daily life” [3,4,5], where “multidimensional” is vaguely unrestricted to individual characteristics [6], relationships with other individuals and group environments [3, 6], subjective and objective living environments [7,8,9,10] and social-political rights [6]. Research on social inclusion is essential, as limited healthcare expenditures are spent on psychiatric disorder management (e.g., 0.47% of the Dutch total healthcare expenditures in 2017) [11,12,13], leaving problems other than clinical remission unattended. Such situation also applies to schizophrenia spectrum disorder (SSD) which accounts for one in five individuals with mental disorders [14, 15]. Guided SSD management elevates social inclusion, thus benefiting clinical and social recoveries, patients’ quality of life, and reducing burdens on healthcare organizations, and patient families. Therefore, investigating social inclusion for an indicative purpose, and identifying its predictive factors in SSD are crucial for promoting recoveries and preserving social inclusion in SSD.

Quantifying social inclusion and related concepts such as social capital, social participation and so forth [16] have faced challenges due to its multidimensional nature, ambiguous scope, conceptual complexity, lack of validated instruments and consequently absent application [16, 17]. We previously adopted a unidimensional construct of social inclusion through social functioning [18]. However, low social inclusion has been recognized in diversifying forms of low socio-economic status (e.g., unemployment, low education), a lack of functional recoveries, low social engagement, and quality of life (QoL), which have been studied as standalone conventional outcomes [19,20,21,22]. Therefore, as a broad holistic concept, social inclusion should be approximated by aggregating the aforementioned outcomes. In the present study, we expanded the uniform concept to multidimensional social inclusion by aggregating subscales of social functioning and additional QoL, to provide a more comprehensive evaluation of a patient’s social inclusion level. Hence, it remains to be studied whether patients’ subgroups of individuals experiencing similar mSI and the predictors of mSI subgroups are applicable.

Previous observational studies have identified factors and predictors of outcomes that are highly relevant to mSI, including socio-economic status (e.g., age, ethnicity, education, marriage status) [18, 21, 23, 24], genetic predisposition for SSD [18], early-life factors (e.g., premorbid adjustment and childhood trauma) [18, 24], disease profiles (e.g., types of diagnoses, cognition, symptoms, global functioning) [18, 21, 23,24,25,26,27], baseline work status [23], unmet needs [28], living environment [21, 29], and medication and substance use [18, 21]. Traditional approaches, such as regressions, have identified these (predictive) factors but have not been used in social inclusion prediction. In addition, as individual-level risk can maximize the utilities of prediction models and the intricate nature of mSI may benefit from a relaxed assumption of linearity, one solution is data-driven approaches. Due to their methodological advantages and focus on prediction accuracy, they have been increasingly applied in predicting SSD onset among ultra-high-risk populations [30, 31], psychosis outcomes including symptoms, treatment, relapsing and hospitalization [21, 32,33,34,35,36], and more recently, social aspects such as social recoveries, vocation, education and QoL [21, 36], with common algorithms such as support vector machine, decision tree and random forests. However, evidence has shown that their performance compared to standard approaches varies, depending on evaluation metrics relevant to the research question and/or clinical requirements [37,38,39,40]. Therefore, we hypothesized that mSI clusters exist within the SSD cohort. We also hypothesized that the data-driven approach might not perform worse and identify distinct predictors compared to the standard approach. We aimed to evaluate the predictability of mSI clusters by employing and comparing the performance of standard and data-driven approaches. We addressed firstly how many mSI subgroups can be identified in SSD. Secondly, what are the predictors of mSI, and which of the standard and data-driven approaches performs most accurately in predicting patients’ mSI at 3-year follow-up concerning model accuracy and discriminability.

Methods

Study design and population

The overall study design is illustrated in Fig. 1. To identify subgroups of mSI, K-means clustering was applied. To predict mSI subgroups, prediction models were built using multinomial logistic regression (ModelMLR) and random forest (ModelRF), which were then internally validated. Lastly, to compare the two models, simulations and individual-level inspections were conducted.

Fig. 1
figure 1

Study design and analytical framework. SSD schizophrenia spectrum disorder, mSI multidimensional social inclusion, SMOTE synthetic minority oversampling technique, MLR multinomial logistic regression, RF random forest

We used the Dutch-nationwide database Genetic Risk and Outcome in Psychosis (GROUP) project, data release 8.0. Details on the GROUP project structure, participant recruitment, data collection and ethical approval have been published elsewhere [41]. In brief, 1119 patients were recruited at baseline across 36 partner mental health institutes including four university medical centres in Amsterdam, Groningen, Maastricht, and Utrecht. The inclusion criteria were: (1) age of 16–50 years; (2) fluent Dutch speaking; (3) diagnosis of a non-affective psychotic disorder according to DSM-IV; (4) the first contact with mental health care service no longer than 10 years ago; and (5) being able and willing to give written consent. The measurements taken at baseline and 3-year follow-up were used. Loss of follow-up was not applicable due to the naturalistic design of the GROUP project. As data was collected when a patient was visiting the clinics.

Outcome

We defined mSI by the clusters derived from 13 subscales from two surveys: Social Functioning Scales (SFS) [42] and World Health Organization Quality of Life (WHOQOL-BREF) [43] (Supplementary Table S1), based on literature and data availability. These two validated well-developed self-reported questionaries were used to cover direct and indirect aspects of social inclusion in SSD. The baseline mSI was not available as SFS was not collected.

Predictors

Potential predictors were included at baseline and/or 3-year follow-up based on literature, experts’ opinions and data availability. The inclusion of the 3-year follow-up in the analysis was mainly attributed to inspecting the predictability of a predictor at the baseline on the outcome mSI after the 3-year follow-up [44]. Covariables include domains of WHOQOL-BREF measured at baseline (physical, psychologic, social and environmental domains) [43]. The variables conceptually overlapping with the outcome mSI were considered predictors to avoid complicating and invalidating the mSI outcome conceptualized on the two validated questionnaires. For detailed predictors and explanations see Table 1.

Table 1 Characteristics of study population and predictors

Data analysis and statistical modeling

To construct outcome mSI, all subscales of SFS and WHOQOL-BREF were standardized and used to cluster patients with K-means clustering (Fig. 1, Objective 1). The assumption of K, symbolizing the number of subgroups, was premised on statistical indexes (i.e., silhouette, duda, pseudot2, Hartigan and gap indexes with the Euclidean distance) [45] and supplemented with clinical knowledge. Subgroup (between clusters) difference was assessed by Kruskal–Wallis tests. The two-group comparison was examined by Dunn’s Kruskal–Wallis Multiple Comparison tests and Bonferroni adjustment.

Prediction models were parallelly constructed via multinomial logistic regression (ModelMLR, standard approach) and random forest (ModelRF, data-driven approach) (Fig. 1, Objective 2). In the ModelMLR, the subgroup with the best mSI level was chosen as the reference group because of our particular interest in the groups with relatively worse mSI levels under the probable mimicry of the subgroup with the best mSI to healthy controls. We presented odds ratios (ORs), confidence interval (95% CI), model performance including accuracy (i.e., 1-misclassification rate) and its 95% CI derived from bootstrapping. As for the ModelRF, we reported the variable importance of the identified predictors (i.e., derived from a variable-specific out-of-bag decrease in accuracy averaged over all trees after permutation), where a higher value indicates higher usefulness of a variable in prediction. We also reported the model performance metric, including accuracy in both training and testing and the P value of the one-sided binomial test. In addition, considering the imbalanced outcome, P values of the one-sided binomial test in both models were reported to examine if the model accuracy is significantly better than no information rate (NIR), suggesting if the model can allocate a patient into the right outcome group significantly better than classification by chance. Finally, complete-case sensitivity analyses were conducted and reported in Supplementary Results.

Two models were compared by accuracy and mSI-cluster discriminability (Fig. 1, Objective 3). We conducted simulations with 1000 repeats using a random draw of 30%, 50%, 70%, 80% and 90% of the total sample. Furthermore, the individual-level prediction accuracy and mSI-cluster discriminability of both models were also examined by scatterplots and confusion matrix. Such evaluation shares similarities with the use of the Area Under the Curve (AUC), which is typically employed for binary outcomes.

The data analyses were conducted using R version 1.4.1103[46]. Technical details including outlier inspection, missingness and imputation, statistical power, justification of the chosen algorithm and model constructions were illustrated in Supplementary Methods.

Results

Clusters of multidimensional social inclusion

Figure 2 demonstrates the centroid features of five subgroups identified: (1) “very low (social functioning)/very low (QoL)” cluster (VLL, 8.58% of patients); (2) “low/low” cluster (LL, 12.87%); (3) “high/low” cluster (HL, 49.24%); (4) “medium/high” cluster (MH, 18.05%); and (5) “high/high” cluster (HH, 11.26%) (Statistical indexes see Supplementary Table S2). Differences in 13 subscales among 5 groups were observed (Supplementary Table S3). To gain statistical power, we merged VLL and LL due to their similarities in low social functioning, ending up with four multinomial subgroups: “low/low” (LL, 21.45%), “high/low” (HL, 49.24%), “medium/high” (MH, 18.05%), and “high/high” (HH, 11.26%). Patient characteristics of all included patients and patients in each mSI cluster are shown in Table 1.

Fig. 2
figure 2

Centroid feature of subgroups of multidimensional social inclusion. Green color represents the seven subscales from SFS (Social Functioning Scale), and purple color represents the six subscales from WHOQOL-BREF (the abbreviated version of World Health Organization Quality of Life). VLL “very low/very low” mSI subgroup characterized by the lowest levels of social functioning and quality of life while the quality of life is even worse, LL “low/low” mSI subgroup featured by low levels of social functioning and quality of life but moderately better quality of life, HL “high/low” mSI subgroup with a high social functioning but low quality of life, MH “medium/high” mSI subgroup with a medium level social functioning but a relatively high level of quality of life, HH “high/high” mSI subgroup featured by the highest level of both social functioning and quality of life

Multinomial logistic regression model

The ModelMLR included 22 predictors (Table 2), among which PAS (LL vs HH, 1.66 [1.22, 2.26]; HL vs HH, 1.43 [1.07, 1.92]), PRSSCZ (HL, 0.95 [0.91, 0.99]; MH, 0.93 [0.88, 0.98]), presence of positive (MH, 0.91 [0.86, 0.96]), core negative symptoms (LL, 1.08 [1.02, 1.15]; HL, 1.07 [1.01, 1.13]), and frequency of depressive symptoms (MH, 0.58 [0.37, 0.91]), symptomatic remission (MH, 3.19 [1.90, 5.36]; HL, 0.37 [0.26, 0.54]), number of met needs (MH, 0.86 [0.78, 0.95]), baseline WHOQOL-BREF environment and social domains (MH, 3.09 [1.79, 5.35] and 1.55 [1.06, 2.27] respectively) were identified as some of the more important predictors. Overall, the model accuracy was 59.16% (bootstrapping 95% CI [55.75%, 62.58%]; P = 0.994).

Table 2 Effects (odds ratios) of predictors estimated by the multinomial logistic regression model

Random forest model

The ModelRF identified 22 predictors, among which WHOQOL-BREF domain scores and CTQ-total contributed the most to predicting the mSI group (Fig. 3). Observing the feature importance indices, other important factors were age, PRSSCZ, PAS-overall, symptom severity (positive, core negative and depressive) and the number of met needs. The accuracies on training and testing sets, respectively, were 70.46% ± 2.03% and 61.61% (95% CI [54.90%, 68.01%]; P =0.013).

Fig. 3
figure 3

Variable importance provided by the random forest model. WHOQOL-BREF the abbreviated version of World Health Organization Quality of Life, CTQ Childhood Trauma Questionnaire, Dutch Version, GAF Global Assessment of Functioning, PAS Premorbid Adjustment Score, PRSSCZ polygenic risk score for schizophrenia

Model comparison

For the simulation of 30% (up to 90%) of the observed patients, the mean accuracy of ModelMLR was 59.12% ± 2.50% (59.13% ± 1.20%) and the mean of ModelRF was 92.29% ± 1.34% (92.26% ± 0.67%; Supplementary Table S4). In the comparison of observed and predicted mSI subgroups among 1119 patients, 662 patients (59.16%) and 1033 patients (92.31%) were correctly predicted by ModelMLR and ModelRF correspondingly. We saw a similar pattern and percentage of mSI subgroup partition between the observations and RF-predictions, and contrarily, differences occurred between the observations and MLR-predictions (Fig. 4a). For example, a distinctly higher HL percentage with a (62.73% vs 49.24% observed, 48.88% RF-predicted, Fig. 4b).

Fig. 4
figure 4

Individual level of prediction results. The figure used the complete data of 1119 patients to assess the model on an individual level. a plotted pattern overview from the observed, MLR-predicted, and RF-predicted mSI clusters. b demonstrated the overall model accuracy (i.e., secondary diagonal sum) and the percentage of each mSI cluster which was correctly and incorrectly predicted by the ModelMLR and MdoelRF compared to the observations. From the figure below, we observed comparable distributions between the observed and RF-predicted mSI clusters. Dissimilarly, the MLR-predicted mSI clusters displayed a higher proportion in HL in trade of apparent lower percentages in LL and HH. LL mSI subgroup with both low social functioning and quality of life, HL mSI subgroup with high social functioning and low quality of life, MH mSI subgroup with medium social functioning and high quality of life, HH mSI subgroup with both high social functioning and quality of life, MLR multinomial logistic regression, RF random forest, obs observations

Discussion

We identified five mSI subgroups, including VLL, LL, HL, MH and HH. The ModelMLR and ModelRF consistently regard PRSSCZ, premorbid adjustment, symptoms, baseline environment, social domains and numbers of met needs as paramount predictors for mSI at 3-year follow-up. Comparatively, the ModelRF is cautiously considered better for its discriminability of all mSI subgroups. The mSI clusters intend to preliminarily define and bring awareness to social inclusion, an understudied but crucial outcome in SSD. The mSI prediction models should be further improved and externally validated for possible profound clinical and policy implications.

Clusters of multidimensional social inclusion among patients with schizophrenia spectrum disorder

Among the five mSI subgroups, the “very low (social functioning)/very low (QoL)” (VLL) and “low/low” (LL) indicated a low mSI level. VLL demonstrated the lowest social functioning and QoL in the spectrum with an even worse QoL, while LL exhibited relatively low levels of social functioning and QoL but slightly higher QoL. The VLL and LL combined, totalling 21.45%, presented more prominent core negative symptoms and worse premorbid adjustment compared to other mSI subgroups. Given that approximately 80% of the participants were diagnosed with schizophrenia [47] and a recent meta-analysis has indicated a 32.19% global prevalence in deficit schizophrenia [48], characterized by primary and persistent negative symptoms that distinguish it from non-deficit schizophrenia [49], the amalgamated group might align with deficit schizophrenia. Furthermore, VLL and LL were merged for the analysis, yet disparities exist. Patients in VLL, compared to LL, were more likely to be non-Caucasian, genetically vulnerable for SSD, have affective psychosis, depressed, have lower IQ, have higher levels of childhood trauma and symptoms, and showed less remission. This may eventually lead to restricted access to the labor market, recreational activities, and social engagement with simultaneously affected QoL and mSI.

The “high/low” (HL) and “medium/high” (MH) implied a medium mSI level. Compared to LL, HL showed a better level of social functioning, particularly in the areas of independence performance, interperson, recreation and prosociality. This improvement could be attributed to their higher premorbid adjustment, slightly less severe symptoms (especially core negative symptoms), and higher rates of symptomatic remissions. However, LL and HL had similar levels of QoL (moderately lower than the average QoL of all patients with SSD), with HL displaying a distinctively lower satisfaction toward general health, environment, and physical conditions. The disparities between LL and HL might suggest that the patients in HL probably experience increased social exposure through recreational and social activities along with psychologic impacts such as internalized stigma, and a systematically perceived low QoL [50]. Differently, MH exhibited an average social functioning level and mildly higher QoL than the average QoL of all patients with SSD, which could be resulted from better IQ, less severe symptoms, stable remission over six months and fewer met needs. No significant difference in occupation was observed among LL, HL and MH which corroborates with previous studies [51, 52]. Contrarily, HL and MH with better function were likely to stay at a job. The final mSI subgroup, “high/high” (HH), showed a high mSI level with the highest levels of social functioning and QoL, suggesting that this subgroup mimics healthy controls, albeit on an overall lower-level contrary to healthy controls. Notably, patients with medium-to-high mSI (HL, MH and HH) demonstrated fewer core negative symptoms, better premorbid adjustment and a higher rate of symptomatic remission over six months. These characteristics could correspond to diagnostic categories such as acute and transient psychotic disorders (ATPD) and non-affective acute remitting psychosis (NARP), marked by abrupt onset of psychotic symptoms within two weeks and early, complete remission [53,54,55].

Therefore, mSI is essential, as a holistic approach, to provide a comprehensive overview of social inclusion of an individual. The subgroup characteristics could also guide intervention strategies. Patients in VLL and LL may require more psychosocial interventions to manage symptoms and improve social functioning. Priority should be given to their eligibility for protected living when independent living is not achievable. While patients in HL could be targeted with current programs aiming for long-term remission and stigma reduction, patients in MH and HH could benefit from training for advanced skills and opportunities for more challenging job positions, which enhance self-esteem and self-actualization, ultimately improving mSI.

Common factors predictive of multidimensional social inclusion

The ModelMLR identified predictors such as gender, ethnicity, current urbanicity, antipsychotic dose, among others. In contrast, the ModelRF selected predictors including baseline all QoL domains, childhood adversity, and change of total week with cannabis consumption in the past 12 months among others. Noteworthily, eight important common predictors were shared between both models. Congruent with previous studies [18, 56,57,58,59], in ModelMLR, more severe core negative symptoms increased the risk of having low-to-medium mSI (LL and HL relative to HH). Premorbid adjustment had the highest negative effect on low-to-medium mSI in ModelMLR (LL and HL relative to HH). Thus, premorbid adjustment showed moderate predictability of the 3-year mSI. This is aligned with earlier studies which have shown that worse premorbid adjustment may lead to poorer social outcomes later in the course of SSD [60,61,62,63,64]. Therefore, premorbid adjustment is undoubtfully vital and can be potentially used for screening of low mSI. Yet, it has not caught enough attention in the field given the limited literature.

Surprisingly, worse positive symptoms significantly predicted good mSI (i.e., MH relative to HH) with a mild protective effect. Other studies have found that positive symptoms do not contribute much to QoL or social cognition [65,66,67], although the cross-sectional symptomatic remission [68] can significantly improve social functioning [69]. A higher genetic vulnerability toward SSD displayed a significant protective effect on good mSI (HL and MH relative to HH). This is possibly due to the single comparison in a relatively low sample size of HH in a multivariate model, which may occupy the variability of mSI concerning PRSSCZ, and consequently yielded a dubious finding. Therefore, the relationship between genetic predisposition and mSI should be independently investigated in well-powered research. Counterintuitively, we found that the more often an SSD patient experienced depressive symptoms, the more likely the patient was to be in high mSI (HH) than in medium mSI (HL) in the ModelMLR, which is discordant with previous studies [70,71,72]. One possible explanation is that in the diagnostic categories of SSD, patients with a higher level of depressive symptoms are more likely to have affective symptoms and to be associated with affective dysregulation, which results in a better outcome (mSI in our case) than the one of the patients with non-affective symptoms such as withdrawal in HL [73]. Specifically, the early detection and interventions of depressive symptoms could be essential to help patients with SSD, which might further impact their lives and subsequent mSI. All the aforesaid factors were confirmed informative in the ModelRF as well.

Abundant studies have emphasized the importance of occupation (and thus financial income and opportunities for acquiring new skills) and social relationships [74, 75], secure stable housing, family support [76] and inclusive and accessible support systems across sectors such as transportation [76, 77] for social inclusion. On the other hand, a higher level of fulfilled needs significantly distinguished MH and HH only in ModelMLR. Particularly in the ModelRF, the number of met needs was considered necessary. Therefore, with a growing emphasis on extramural care [78], the local communities and mental health organizations need to incorporate the heterogeneous environmental and social needs, beyond the medical needs, of patients with SSD at different mSI levels.

In contrast to the univariable analysis, we identified the duration of psychosis as a predictor with limited contribution in the ModelMLR or even with zero importance in the ModelRF for predicting mSI. This might be influenced by the presence of the other important predictors exemplified by QoL-related factors and childhood trauma (Fig. 3). The exclusion of variables related to duration of psychosis such as age at baseline and age of onset from the ModelRF did not improve the low importance of duration of psychosis. While previous studies have suggested an association between a shorter duration of psychosis and favorable changes in symptomatic remission and social functioning [79, 80], limited and conflicting evidence hinders confirmation of the association between chronicity and symptom severity, functioning and QoL [80, 81]. Therefore, the duration of psychosis might be less relevant, resulting in a modest contribution to predicting mSI at 3 years in multivariable models.

Despite many common factors that were selected in the both models, PRSSCZ, positive, negative, depressive symptoms, premorbid adjustment, baseline environment and social-domain satisfactions and the number of met needs were found to be crucially predictive of mSI.

Model performance in prediction

The predictivities of ModelMLR and ModelRF were fair and comparable. We inferred the ModelRF outperforms as it allocated individuals to the correct mSI cluster significantly better than chance alone, suggesting caution in applying the ModelMLR as some predictors in the ModelMLR were indiscriminative to mSI subgroups except for the HL cluster due to the imbalanced mSI outcome. Furthermore, although ModelRF did not perform as well as expected, earlier studies using data-driven methods have reported similar accuracies of 60%-75%, highlighting the complexity of SSD. The holistic mSI measure (compared to a single clinical outcome) and a longer time interval of 3-year (compared to 1 year) may be influenced vastly by various factors and their interactions, making mSI prediction challenging. To be integrated into clinical practices through electronic health records (EHR) (Supplementary Clinical Illustration) [82], the ModelRF requires external validation through international data-sharing efforts [83]. Despite its replicability, the model implementation may encounter a prolonged journey. Aligned with the minority of readily implementable psychiatry prediction models, the ModelRF requires inputs easily obtainable in clinical settings [84]. However, addressing the challenge of accurately and efficiently computing PRSSCZ from available and affordable genotype data remains essential, especially when considering the opportunity costs of assisting individuals facing poor social inclusion [83].

Future perspectives

Future studies should continue working on the conceptualization of mSI and examining its applicability across diagnoses. Meanwhile, developing a validated composite score could enable longitudinal monitoring. Methodologically, future studies should take extra steps in modeling procedures such as outer cross-validation and different feature-selection algorithms and give opportunities to the latest interpretable machine learning algorithms for exertion to pick up next-level clinical utility. Clinically, when building prediction models, future studies should test the utilities of potential factors that measure similar clinical outcomes but with slight variations in submodels and understand the multidimensional mechanisms hidden under the effect sizes such as premorbid adjustment. Furthermore, investigations on the genetic effects on behaviors and mSI along the course of SSD could be necessary for early screening. Statistically, other observable and non-observable factors, such as personality traits, coping strategies, diversity of community residents, community social-economic status, relationship with caregivers, and so forth, might simultaneously be more essential for improving the mSI prediction. With growing awareness of social inclusion and the development of sophisticated prediction models for mSI, along with personalized interventions and supporting policies, patients with SSD would be able to acquire necessary skills and receive essential resources. This would consequently aid them in managing their conditions and achieving great inclusion in society.

Study strengths and limitations

We quantified the multidimensional nature of social inclusion by combining thirteen subscales of the self-reported SFS and WHOQOL-BREF without the intention of developing a validated questionnaire. Our conceptualization emphasizes the multidimensionality that provides a comprehensive overview of an individual’s status of social inclusion through a broad range of activities, the perception of an individual, and the exploitation of the existing large cohort and standardly collected data. The longitudinal measurements in the cohort were utilized. We also compared standard and data-driven models to examine the robustness and enhance the credibility of the factors and predictability of mSI. However, no baseline mSI was available to investigate mSI changes over time. The models require external validation and could be improved with more non-clinical data.

Despite previous efforts in developing conceptual frameworks for social inclusion, it is vital to acknowledge that the constitution of social inclusion is nebulous, implying a variable boundary of this multifaceted construct. The available data do not provide much flexibility or balance the eligibility and validity of the elements used for the construct. No interview was conducted to preliminarily determine the most relevant scope of social inclusion.

Conclusion

We introduced mSI which is backboned by social functioning and quality of life, resulting in five identified subgroups including “very low/very low”, “low/low”, “high/low”, “medium/high”, and “high/high”. We found that genetic predisposition for SSD, premorbid adjustment, positive, negative and depressive symptoms, number of met needs and baseline satisfaction with the environment and social life were robust factors predictive of mSI in SSD. We cautiously concluded that the ModelRF offered a better prediction, compared to the ModelMLR, of the 3-year mSI among patients with SSD due to its better discriminability. Yet, continuous model refinement and external validation are still required.

Our findings indicate that mSI is applicable and offers possibilities for personalized treatment strategies and policymaking tailored for patients with SSD at different mSI levels. Our study emphasizes the special proposition of mSI as an imperative goal for patients with SSD and a possible solution to expensive healthcare and societal harmony.