Background

Recombinant human thrombomodulin (rhTM) has been suggested as an adjunct therapy for patients with sepsis [1]. A recent randomised controlled trial (RCT) failed to demonstrate its beneficial effect on 28-day mortality [2], but there remains controversy in the results of this study due to the heterogeneity of its study population. Indeed, 22% of patients in the RCT did not meet protocol-specified coagulopathy. In addition, an updated meta-analysis including the RCT reported an association between rhTM use and a lower risk of mortality [3]. These findings collectively suggest the importance of appropriately targeting the study population prior to conducting studies to gain maximum benefit [4,5,6]. We identified a distinct phenotype that could be a potential target of rhTM therapy [7], a finding consistent with previously suggested targets, including coagulation disorder and high disease severity [8, 9]. However, for application in the clinical setting, a simple tool for determining this target is necessary [10]. Thus, we aimed to develop and validate a model for predicting the potential target phenotype for rhTM therapy and to implement the model as a web-based application to facilitate further research.

Methods

Study design and settings

The concept of this study is shown in Fig. 1. Details of this study are provided in Additional file 1. This was a secondary analysis of the following multicentre registries: the Japan Septic Disseminated Intravascular Coagulation (JSEPTIC-DIC) study (42 ICUs at 40 institutions, 2011–2013) [11], Tohoku Sepsis Registry (10 institutions, 2015) [12], and Focused Outcomes Research in Emergency Care for Acute Respiratory Distress Syndrome, Sepsis, and Trauma (FORECAST) sepsis study (59 ICUs, 2016–2017) [13]. These studies were approved by the institutional review boards at the participating hospitals, and the need for informed consent was waived.

Fig. 1
figure 1

Current study (development and implementation of a prediction model of rhTM target phenotype)

Study samples

We included all patients (aged ≥ 16 years), who were admitted to the ICU with severe sepsis or septic shock as defined in the three registries, according to the International Sepsis Definitions Conference Criteria [14, 15]. We excluded patients with missing information on 28-day mortality, which is required for determining the phenotype of each patient [7].

Predictors

We used the following coagulation markers for predicting the presence of the target phenotype, in accordance with our previous study [7]: platelet counts, PT-INR, fibrinogen, fibrinogen/fibrin degradation products (FDP), and D-dimer.

Outcomes

The primary outcome was the presence of the clinical phenotype identified in our previous study [7], characterised as severe physiological status and organ dysfunction (high Acute Physiology and Chronic Health Evaluation [APACHE II] and Sequential Organ Failure Assessment [SOFA] scores), coagulopathy (low platelet count, prolonged PT-INR, low fibrinogen, and extremely high FDP and D-dimer levels), high lactate level, and high mortality. We termed this phenotype as “rhTM target phenotype”.

Statistical analysis

We derived our prediction model using the JSEPTIC-DIC study and Tohoku Sepsis Registry (derivation cohort) and validated the model using the FORECAST sepsis study (validation cohort). We imputed missing predictors using the random forest method with the missForest package (Additional file 2: Table S1) [16]. We did not calculate the sample size in advance because we used all available data. The sample size for model development (n = 3694, of which 9% had the target phenotype) was enough to ensure precise predictions and minimise overfitting [17].

For model development, we divided the derivation cohort into the training set (70% of the full sample randomly chosen for model development and hyperparameter tuning) and test set (30% of the full sample randomly chosen for internal validation). Using the training set, using log-transformed predictors, we constructed a prediction model with XGBoost. We used the grid search strategy to identify the best combination of hyperparameters using the ranger and caret packages with tenfold cross validation.

We measured the prediction performance of the developed model by computing the (1) C statistic (i.e., the area under the receiver operating characteristic [ROC] curve) and (2) prospective prediction results.

In addition, among patients those who were predicted to have the potential target phenotype (termed as “predicted target patients”), we assessed the effect of rhTM on in-hospital and 28-day mortality using a generalised estimating equation to account for patient clustering within hospitals. The adjusted variables were selected according to the previous study [7] (see Additional file 1). We also reported the number of patients who met the inclusion criteria for the SCARLET trial (cardiovascular and/or respiratory dysfunction, and PT-INR > 1.4 and a platelet count in the range from 30 to 150 × 109/L) to illustrate the difference in the target study population between studies [2].

Lastly, we uploaded the model online (URL: http://research-kudo-prediction.s3-website-ap-northeast-1.amazonaws.com/), so researchers interested in utilising the model could access it for free. All analyses were performed with R statistical software version 3.6.1 (R Foundation for Statistical Computing).

Results

Patient characteristics were similar between predicted target patients and patients with rhTM target phenotype (Table 1). However, predicted target patients in the current study were likely to have milder coagulopathy. Approximately 8–9% of patients had the rhTM target phenotype. Overall, patients who met the inclusion criteria for the SCARLET trial accounted for 20–30% of rhTM target phenotype.

Table 1 Characteristics and clinical course of patients with sepsis in the derivation and validation cohorts

Using the test set of the derivation cohort, we found that the C statistic of the developed model was 0.993 (95% CI 0.989–0.997). Prospective prediction results were as follows: sensitivity 0.968, specificity 0.955, positive predictive value 0.669, and negative predictive value 0.997. Figure 2 shows the prediction ability of the developed model in the validation cohort. Using the validation cohort, we found that the model had high discrimination (C statistic, 0.996; 95% CI 0.993–0.998). Prospective prediction results were as follows: sensitivity 0.991, specificity 0.967, positive predictive value 0.754, and negative predictive value 0.999.

Fig. 2
figure 2

The receiver operating characteristic curve of the developed model for predicting the presence of the target phenotype in the external validation cohort

Among predicted target patients in the validation cohort, rhTM use was associated with a lower in-hospital mortality (adjusted risk difference, − 31.3% [− 53.5 to − 9.1%]; Table 2).

Table 2 Unadjusted and adjusted risk difference between recombinant thrombomodulin use and outcomes among predicted target patients

Discussion

We derived and validated a machine learning model that accurately predicts the rhTM target phenotype in patients with sepsis and released it online for clinical and research use. The C statistic was 0.994 in the validation cohort, with a sensitivity of 0.981 and a specificity of 0.944. The predicted target patients were likely to have milder coagulopathy compared to those with rhTM target phenotype.

The importance of considering the heterogeneity in the study population and the treatment effects has been emphasised in recent years [6]. As shown in the analysis of multiple sepsis registries and RCTs [5], clinical phenotypes were correlated with host-response patterns and clinical outcomes, and simulations suggested the presence of heterogeneity in treatment effects across phenotypes. Thus, such heterogeneity may at least partially explain the underlying mechanisms of RCTs that failed to reveal significant benefit of therapies in critical care [18, 19]. Indeed, patients who met the inclusion criteria for the SCARLET trial accounted for 20–30% of the patients with rhTM target phenotype, suggesting that further studies are needed to investigate the effects of rhTM for sepsis. Additionally, the process of identifying the target population to be treated is important and should be discussed in future cost–benefit analyses of treatment strategies, even if a small proportion of patients can be treated effectively (as was the case in our study sample).

Subgroup analyses have been widely used to address treatment effect heterogeneity despite its limitations [20]. In particular, conventional subgroup analyses assess one characteristic at a time, which may not reflect the biology or clinical practice where multiple factors often act synergistically [6]. To address this concern, several approaches have been proposed: clustering algorithms to identify distinct clinical phenotypes, Bayesian hierarchical models, and adaptive enrichment [6]. Building on these works, we used a clustering approach in a previous study to address the heterogeneity in our study population. While it is still challenging to find the true phenotypes that are responsible for the heterogeneity, we believe that our research process: (1) discovering the target phenotype, (2) implementing a model for predicting the phenotype, and (3) conducting studies for identifying the optimal target population or exploring underlying mechanisms—is an efficient way of conducting future studies and advancing personalised medicine. For example, our findings support the findings from a post hoc analysis of the SCARLET trial that reported an association between higher baseline thrombin generation biomarker levels and the effect of rhTM [9], by demonstrating that a subtype consisting of a high-dimensional coagulation profile could be a potential target of rhTM.

This study has several limitations. First, although we developed a model to predict the rhTM target phenotype, it remains unclear whether the rhTM target phenotype is the true target of rhTM therapy. Second, there may be diagnostic suspicion bias and unmeasured confounding. Additionally, the number of missing variables for prediction may have limited our findings. Thus, our findings should be validated in randomised controlled trials. Third, because machine learning models are generally difficult to interpret, our model itself does not provide information on the underlying mechanisms. Finally, our data were obtained from Japanese patients, and the generalisability of the results to other populations may be limited.

Conclusions

We developed a model that accurately predicted the rhTM target phenotype. Our model is available online, which could profoundly benefit clinicians and researchers investigating the heterogeneity in the treatment effects of rhTM and its mechanisms.