Background

Venous thromboembolism (VTE) is a common disease with high morbidity and mortality [1], including deep vein thrombosis (DVT) and pulmonary embolism (PE), VTE is the third most common cardiovascular disorder. The incidence of VTE is concealed and its prognosis is poor. At the same time, the increasing incidence rate showed a trend of younger patients [2, 3]. We should focus not only on the many influencing factors of VTE [4,5,6,7,8], but also on its early identification and early intervention [9, 10]. Studies have shown a lack of clinical VTE care standards for inpatients and the low VTE prevention rate which indicate that further improvement is needed [11]. Approximately 50% of VTEs are provoked by immobilization, trauma, surgery, or hospitalization in previous 3 months [12,13,14,15], and 20% associated with cancer while 30% unprovoked [16,17,18]. VTE has many risk factors which are constantly multiplied [19,20,21,22,23,24,25]. Currently, Caprini Risk Assessment Scale is widely used in clinical practice. However, genetic and environmental differences between Western and China limit the validity of Caprini Risk Assessment Scale in Chinese patients. Medical records which contain rich information about disease progression, are useful in mining new risk factors related to VTE patients. Each patient will undergo a series of laboratory tests upon admission, and the blood test indicators of VTE patients will be abnormal in varying degrees [26,27,28]. The timely detection of abnormal change will facilitate the VTE occurrence risk assessment and enable early warning and intervention. Therefore, based on characteristics of VTE patients in China, this study screened out VTE risk early warning indicators other than the traditional scale, and established a VTE Risk Warning Model. This study was to effectively achieve the primary prevention of VTE and provide a scientific theory for VTE prevention.

Methods

Study population

The study conducted from January 1, 2017 to June 30, 2018 in Affiliated Hospital of Nantong University. The inclusion criteria were as follows: patients diagnosed with VTE during hospitalization, age ≥ 18 years, hospitalization time ≥ 2 days (48 h), and clinical medical records completed. Patients who had VTE before admission, those with superficial vein thrombosis, and those who used anticoagulants were excluded. Finally, we included 257 VTE patients. Two hundred fifty-seven age- and disease-duration- matched non-VTE patients of the same period were also included in this study and built the modeling data set. In addition, 63 VTE patients and 85 non-VTE patients from July 1, 2018 to December 31, 2018, were selected for a validation data set, identical inclusion criteria with the former modeling data set. This clinical research protocol complies with relevant provisions of Helsinki Declaration on the protection of the rights and interests of subjects.

Study design

The modeling data set was analyzed by T / χ2 test and Logistic regression analysis. According to VTE independent risk factors screened by Logistic regression analysis, a VTE risk warning model was constructed, and the cut-off value of VTE screening obtained. The cut-off value of VTE risk warning model was used to verify the screening efficacy of VTE risk warning model for VTE patients in the validation data set.

Assessments

Since Logistic regression using maximum likelihood estimation (MLE) method for regression coefficient estimation is sensitive to multicollinearity, a high degree of multicollinearity will lead to a great change in coefficient estimation value or symbol. Therefore, multicollinearity analysis needed to be carried out before multivariate analysis. The variance expansion coefficient (VIF) diagnostic method is one of the common methods. Generally, VIF > 5 indicates multicollinearity existence. In a large sample, VIF > 10.

The Gold Standard is required in Diagnose Test to distinguish the experimental group and control group. The effectiveness of VTE risk early warning model was evaluated by four-division table of diagnostic data. The evaluation indicators mainly include Sensitivity (Sen), Specificity (Spe), Accuracy (Acc), and Youden Index (Youden’ s Index). Receiver operating characteristic curve (ROC) is a widely accepted criterion. The area under curve (AUC) below 0.6 means low discrimination, 0.6 to 0.75 medium discrimination, and above 0.75 high discrimination. The high AUC represents high model accuracy.

Statistical analysis

Statistical analysis adopted SPSS 20.0. In univariate analysis, all statistical variables of VTE group were compared with corresponding variables of control group in order to determine the P value of all statistical variables. Measurement data were described by (X ± S), and count data by frequency. The categorical variables were tested by χ2 test. The calibration was performed byχ2 test or Fisher exact probability method. And continuous variables were tested by t test or t2 test. Logistic regression was used in multivariate analysis. In this model, the variables were selected based on the results of univariate analysis. Variables in univariate analysis that were hypothesis-tested P < 0.3 (in order to prevent missing possible early warning indicators) and consistent with previous documents and clinical experience were included in the follow-up multivariate analysis. In order to simplify the model, stepwise regression method was adopted to screen model variables. The regression method was set as “Forward: LR”, and test level α = 0.05 was specified for introducing variables into the model and 0.10 for removing variables from the model. Logistic regression was used to obtain the regression coefficient, standard error, chi-square value of Wald, P value, corresponding OR value and 95% confidence interval of the possible predictors. The independent risk warning index of VTE screened by Logistic regression analysis was used to construct the VTE risk warning model, and its screening efficiency was evaluated and compared by ROC. External validation of VTE risk warning model was carried out by using the four-division of diagnostic data to evaluate its test effectiveness. P < 0.05 was considered statistically significant.

Results

Univariate analysis of VTE risk warning indicators

According to relevant literature and clinical practice, the VTE risk warning indicators which are not included in Caprini score scale mainly include four parts as following: (1) General indicators, including gender, patient origin, nationality, payment methods, length of stay (days); (2) Related indicators of current medical history, including 17 variables such as cough, expectoration, hemoptysis, dyspnea, pleural chest pain, cyanosis, pain in the precardiac area, palpitations, shortness of breath after exertion, chest tightness and shortness of breath, syncope with unknown cause, pleural effusion, unilateral lower limb Pain, deep venous tenderness in the lower limbs, pigmentation in the lower limbs, walking fatigue in the lower limbs, and increased local skin temperature in the lower limbs; (3) Relevant indicators of previous history, mainly including 7 variables such as hypertension, diabetes, smoking, systemic connective tissue disease, renal insufficiency, liver disease (hepatitis or liver damage), anemia; (4) The relevant indexes of the laboratory inspection items, mainly including 11 variables such as prothrombin time (PT), thrombin time (TT), activated partial thrombin time (APTT), Fibrinogen (FIB), Fibrinogen Degradation Product (FDP), International Normalized Ratio (INR), D-Dimerization, Albumin, platelet count, white blood cell count, number of red blood cells. In addition, we took the Caprini score as a risk warning indicator in the univariate analysis. The detailed results are shown in Table 1.

Table 1 Single factor analysis of VTE risk warning indicators in modeling dataset

Multivariate analysis of VTE risk early warning indicators

Univariate analysis was performed on 41 variables, of which there were 15 variables with statistical significance of P < 0.05. In order not to omit possible VTE risk early warning related variables, increase the sensitivity of risk early warning model and allow more possible variables to be included in the variable, the variables with P < 0.3 in the univariate analysis or consistent with literature reports and clinical experience were included in the subsequent multivariate analysis. Therefore, we adopted a total of 28 variables. After colinear analysis, all variables had VIF less than 3, it can be considered no co-linearity among VTE risk warning indicators, which can be included in multi-factor logistic regression analysis, as shown in Table 2.

Table 2 Colinearity analysis of 28 VTE risk warning indicators including multivariate analysis

The above 28 variables with P < 0.3 were included in Logistic regression analysis, and up to 7 independent risk warning indicators were screened out, namely pleural chest pain X10 (P < 0.001), shortness of breath after exercise X14 (P = 0.045), Chest tightness and shortness of breath X15 (P < 0.001), unilateral lower extremity pain X18 (P < 0.001), smoking X25 (P = 0.005), fibrinogen degradation product X34 (P < 0.001), Caprini score X41 (P = 0.004). The logistic regression was used to obtain regression coefficient, standard error, Wald chi-square value, P value, its corresponding OR value, and 95% confidence interval of the independent risk warning indicators, as shown in Table 3.

Table 3 Logistic regression parameter estimation of patients in VTE group and control group

Construction of VTE risk warning model

According to above results The model independent variable assignment method was shown in Table 4. The final VTE risk warning model was as follows:

$$ \mathrm{p}={\mathrm{e}}^{\mathrm{x}}/\left(\ 1+{\mathrm{e}}^{\mathrm{x}}\ \right), $$
$$ \mathrm{x}=-4.840+2.557\bullet {\mathrm{X}}_{10(1)}+1.432\bullet {\mathrm{X}}_{14(1)}+2.977\bullet {\mathrm{X}}_{15(1)}+3.445\bullet {\mathrm{X}}_{18(1)}+1.086\bullet {\mathrm{X}}_{25(1)}+0.249\bullet {\mathrm{X}}_{34}+0.282\bullet {\mathrm{X}}_{41} $$
Table 4 The way to evaluate the value of the clinical variable of the VTE risk warning model

Where e was the logarithm of natural numbers;

Pleural chest pain X10, shortness of breath after exercise X14, chest tightness and shortness of breath X15, unilateral lower extremity pain X18, smoking X25 and other variables were binary values (not specific medical history, 1 for yes, 0 for none). The unit of fibrinogen degradation product (X34) was (μg/ml). Caprini score (X41) was based on Caprini risk assessment scale, with no unit.

Evaluation and comparison of VTE risk warning model test efficacy

According to VTE risk warning model formula, the predicted probability of VTE occurrence was calculated by ROC curve analysis. The area under ROC curve (AUC) was 0.960 (95% CI: 0.940, 0.976), the standard error was 0.009, and Z = 52.279. The Hosmer-Lemeshow test (H-L test) was performed on the VTE risk warning model, and the χ2 was 55.441.

Caprini risk assessment scale and VTE risk warning model were used to predict the VTE truncation value (95% CI), which were 5 (4,5), 0.438 (0.263, 0.504), respectively. The VTE sensitivity was predicted to be 26.1 and 92.6% each, specificity 96.5 and 91.8%, accuracy 61.3 and 92.2%, and Youden index 0.23 and 0.84. AUC values were 0.596 (95%CI: 0.552, 0.638) and 0.960 (95%CI: 0.940, 0.976). The difference between above two groups was statistically significant (Z = 14.521, P < 0.0001), as shown in Fig. 1.

Fig. 1
figure 1

Comparison of ROC curves of VTE screening by Caprini risk assessment scale and VTE risk warning model

External validation of VTE risk warning model

The validation data set included 63 VTE patients and 85 non-VTE patients. There was no significant difference in the distribution of general clinical variables between validation data set and modeling data set (P > 0.05), which avoided the deviation of results due to uneven distribution of clinical variables.

The validation data set was substituted into the established VTE risk warning model formula to calculate the prediction probability of the occurrence of VTE in each patient, and the model truncation value was used to evaluate the efficiency prediction of validated data set. The sensitivity was 77.8%, specificity 84.7%, accuracy 81.8% and Youden index 0.625. It indicated that VTE risk warning model had a higher prediction efficiency both in the internal population and external population.

Discussion

VTE risk early warning model

In this study, multivariate logistic regression analysis was performed on general information, medical history data, blood parameters, and Caprini scores of 257 VTE patients to screen out VTE risk warning indicators other than Caprini assessment scale. A clinical diagnostic model was developed, including 4 medical history data, 2 laboratory data, and 1 scale score. The area under ROC curve (AUC) of this model was 0.960 (95%CI: 0.940, 0.976). A good disease risk prediction model was not just a simple mathematical combination of dependent and independent variables, but also had actual clinical importance behind it. Our original intention was to reach high prediction efficiency, differentiation ability and sensitivity of the prediction model. The most commonindicator evaluating the discriminability of prediction models was AUC, also known as the C statistic. The larger AUC, the better the discriminant ability of the prediction model. AUC < 0.6 indicates poor differentiation, 0.6–0.75 certain differentiation ability, and > 0.75 good differentiation ability. This research model showed high prediction efficiency for VTE (AUC = 0.960). In this study, a validation data set consisting of 63 cases of VTE patients and 85 cases of non-VTE patients was selected for model validation. The accuracy of VTE risk warning model for VTE prediction was 81.8%. The high AUC of this prediction model may be related to the modeling data of VTE group and control group matched 1: 1. Of course, the incidence of VTE was relatively low in the actual clinical process, which required continuous improvement and adjustment in a wider range of later use in order to reach clinical value maximization. The sensitivity of this model to VTE early warning was significantly higher than that of Caprini risk assessment scale. For VTEs in life-threatening situation, early identification will benefit the most. Therefore, the warning model’s high sensitivity was in line with expectations. Although the degree of sensitivity and specific it was often difficult to achieve perfect synchronization state of ideal, the early warning model of VTE specificity was 5% lower than Caprini risk assessment scale. It indicated that some of the non-VTE patients with risk factors were identified by some early warning indicators, leading to a certain amount of false positives.

Clinical status of VTE early warning mechanism

In order to take timely and effective measures to prevent the occurrence or further progress of VTE, clinicians and nurses should be kept informed of VTE early warning, including high risk of occurrence and early identification. There were many methods for clinical VTE evaluation, each with a certain scope of application and the results are barely satisfactory. A retrospective single-center study on patients who underwent thoracic surgery showed the areas under the receiver operating characteristic (ROC) curve of Caprini was 0.74 (P < 0.0001), Rogers 0.52 (P = 0.62), Padua 0.69 (P < 0.0001), and Khorana 0.64 (P = 0.0017), respectively [29]. In another study, ROC indicated that the Caprini score showed a significant but moderate relationship to VTE (AUC = 0.64; p = 0.004) [30]. Other studies had reached similar conclusion [31,32,33]. Though, many embedded VTE warning software has been developed and integrated with electronic medical record system but such software was mostly based on Caprini Risk Assessment Scale, or Padua Assessment Scale, etc. [33,34,35,36]. Vyas et al. [37] adopted the analysis way of Ishikawa Fishbone Diagram, and found that main reason for the improper prevention of DVT were the lacks of unified standard specifications, the computerized input system for doctors’ orders and effective risk assessment methods [38]. Also, others improved the Caprini Risk Assessment Scale [31, 32], but a lot of useful information in the electronic medical record system was not really used. These evaluation scales had not passed domestic large-scale clinical certification, and the accuracy and sensitivity of VTE screening were not very high. Therefore, the embedded automatic assessment and early warning system designed based on these scales usually have some inherent deficiencies.

Clinical significance of VTE early warning model

The prevention and treatment of VTE is a hot topic in the medical field, and it’s also a difficult point in clinical work. The VTE prediction model were established with purpose of making accurate assessment and diagnosis of VTE in the first time and avoiding adulterating human factors as much as possible. We know that there is a lot of VTE-related information in the electronic medical record system [27, 33,34,35, 39,40,41], and such information needs to be further explored and fully utilized in the clinical VTE warning. The VTE risk warning model made full use of Caprini risk assessment scale, which was widely used in clinical medicine and surgery, with the important clinical symptoms and signs of VTE patients and laboratory examination indicators, to carry out comprehensive and multi-dimensional warning and achieve higher prediction efficiency. The work intensity of Chinese medical staff is very high, and it’s a great challenge to monitor patients’ conditions consistently. We screened six independent warning indicators except the Caprini score, including pleural chest pain, shortness of breath after exercise, chest tightness and shortness of breath, unilateral lower extremity pain, smoking, fibrinogen degradation product. We set up standard terms and captured the records of standard terms in electronic medical record system in order to establish electronic active alarm system which can prompt doctors and nurses to take timely responses. It is of great clinical importance to develop embedded electronic VTE active alarm systems based on VTE risk warning model.

The deficiency and prospect of this research

We could not avoid the sample selectivity bias caused by the retrospective study. During this study, prothrombin time, D-dimer, and leukocytes in blood biochemical indicators were statistically important in univariate analysis, but they failed to enter the model during multivariate analysis. In addition, several articles had shown that platelets, inflammatory indicators, and the ratio of certain cell counts were also important in VTE early warning. Therefore, many blood biochemical indicators in clinical had potential value in the prediction and warning of VTE, which needs to be proved by more high-quality studies. This study only explored newly discovered independent warning indicators of VTE, and the mechanism of each warning indicator needs to be further studied. In addition, due to the limitation of various factors in the single-center study, the all-dimensional and multi-dimensional VTE risk warning model based on series of clinical comprehensive indicators needs to be constantly improved, verified and promoted in more centers and larger samples.

Conclusions

In this study, VTE risk warning model includes seven independent risk factors, namely pleural chest pain, shortness of breath after exercise, chest tightness and shortness of breath, unilateral lower extremity pain, smoking, fibrinogen degradation product, Caprini score. A high early warning effect has been verified on VTE in hospitalized patients and the VTE risk warning model has certain clinical application value.