Introduction

Significant advancements have been made in the management of varices, with preventive measures and early interventions being key pillars in reducing variceal bleeding-related morbidity and mortality1. Traditionally, upper gastrointestinal endoscopy has been the gold standard for diagnosing varices and assessing their severity. However, this invasive procedure has drawbacks, including patient discomfort, associated risks, and limited accessibility in certain healthcare settings2. In recent years, non-invasive techniques for predicting varices needing treatment (VNT) have emerged as promising alternatives. Among these novel approaches, spleen stiffness assessment has garnered increasing attention. The spleen plays a pivotal role in the pathophysiology of portal hypertension, acting as a blood reservoir, and alterations in its stiffness have been linked to the presence and severity of varices3.

Previous reports have demonstrated that measuring spleen stiffness provides valuable insights into the dynamics of portal hypertension4,5. By integrating spleen stiffness evaluation with LSM and other clinical parameters, clinicians can gain a more comprehensive understanding of the patient's liver disease and their risk of developing varices. These non-invasive techniques have contributed to the recent publication of the Baveno VII criteria, which suggest a cut-off of spleen stiffness at 40 kPa for screening VNT6. However, spleen stiffness measurement (SSM) using 2D-shear wave elastography (SWE) and point-SWE has shown a higher failure rate compared to liver stiffness measurement (LSM) and has the disadvantage of varying cut-offs depending on the machine. Additionally, due to the spleen's inherent stiffness relative to the liver, there is a limitation in potentially overestimating spleen stiffness when performing SSM with an SSM@50 Hz probe used for LSM measurement. Therefore, a novel spleen-dedicated probe using SSM@100 Hz has recently been developed, demonstrating higher accuracy than the existing SSM@50 Hz7.

The purpose of this study is to investigate whether SSM using the SSM@100 Hz probe can predict VNT and clinically significant portal hypertension (CSPH), and to explore its correlation with hepatic venous pressure gradient (HVPG) in patients with liver cirrhosis. Additionally, the study aims to identify factors associated with the failure of SSM.

Results

Patients’ characteristics

Finally, 257 patients were selected for analysis (Fig. 1). The baseline characteristics of the 257 patients analyzed in the study are presented in Table 1. The most common etiology of liver disease was HBV (40.08%), followed by alcohol (34.63%). All patients with a viral etiology (HBV or HCV) had achieved viral suppression through antiviral treatment. VNT was detected via endoscopy in 52 (20.23%) of the patients. The group with VNT (+) exhibited significantly greater spleen length, spleen volume, LSM, and SSM compared to the VNT (−) group (Table 1).

Figure 1
figure 1

Flow chart.

Table 1 Baseline characteristics of patients.

Factors related to the measurement failure of spleen stiffness

SSM failed in 15 out of 257 patients, resulting in a measurement success rate of 94.16%. Factors related to SSM failure were analyzed, and in multivariate analysis, the probability of SSM failure significantly increased when the spleen length was short (odds ratio [OR] 0.891, 95% CI 0.843–0.942, P < 0.001), the spleen volume was small (OR 0.961, 95% CI 0.943–0.980, P < 0.001), or the body mass index (BMI) was high (OR 1.143, 95% CI 1.003–1.330, P = 0.043) (Supplementary Table 1). Next, we derived cut-off values for each factor (Table 2). A spleen length less than 92.90 mm, a spleen volume less than 168.59 cm3, or a BMI higher than 24.17 kg/m2 were identified as cut-off points with a predicted probability of SSM failure of areas under the curve (AUC) 0.89, 0.94, and 0.65, respectively.

Table 2 Prediction of spleen stiffness measurement failure.

Accuracy of pre-existing prediction models for VNT

First, we compared the ability to predict VNT of previously published non-invasive tests (NITs) (Table 3). The NITs used for comparison included platelet count, liver stiffness, spleen parameter (length or volume), LSPS, PSR, and VRS. Additionally, we evaluated the accuracy of the criteria for screening VNT in the recently published Baveno VII guideline. The sensitivity of the Baveno VII criteria defined as LSM and platelets was 0.96 with AUC 0.70 (0.66–0.75), missed VNT rate 2.15%, and spared endoscopy rate 44.39%. When combining Baveno VII with SSM applied to LSM and platelet standards, the spared endoscopy rate significantly increased to 82.92%, but sensitivity and missed VNT rate did not improve compared to the existing Baveno VII model.

Table 3 Accuracy of pre-existing prediction models for varices needing treatment.

Identifying a new cut-off of SSM for VNT

We identified a new cut-off of SSM that could improve the diagnostic performance while maintaining an accepted risk of missed VNT (< 5%). In the final analysis, SSM 38.9 kPa was presented as the new standard (Table 4). The 38.9 kPa cut-off improved sensitivity from 0.88 to 0.92 compared to the existing 40 kPa cut-off, reduced the missed VNT rate from 3.59 to 2.25%, and increased the spared endoscopy rate from 78.53 to 84.87% (McNemar test p-value 0.019). Additionally, the AUC was 0.88, which was higher than the existing 40 kPa AUC of 0.83 and was statistically significant (Delong test p-value 0.017). When analyzed separately into viral etiology and non-viral etiology, the SSM 38.9 kPa cut-off showed superior performance compared to the SSM 40 kPa cut-off in both viral etiology (Delong test p-value 0.019) and non-viral etiology (Delong test p-value 0.032). Next, a sensitivity analysis was performed targeting cACLD patients. There were a total of 112 patients with cACLD, and baseline characteristics are listed in Supplementary Table 2. In an analysis of 112 cACLD patients with an LSM of 10 kPa or higher, who had not experienced prior decompensation, the newly proposed SSM model (with a cut-off of 38.9 kPa) demonstrated superior efficacy compared to the Baveno VII criteria, which uses LSM and platelet count (DeLong test p-value < 0.001) (see Supplementary Table 3). While the difference in AUC between the cut-offs of 38.9 kPa and 40 kPa was not statistically significant, the cut-off of 38.9 kPa resulted in a slightly lower missed VNT rate (2.60% vs. 2.82%) and a higher rate of spared endoscopies (84.26% vs. 77.52%) than the 40 kPa cut-off.

Table 4 Cut-off of spleen stiffness for predicting varices needing treatment.

The cut-off of 38.9 kPa for SSM was also effective in predicting future incidence of variceal bleeding. Upon follow-up over a median duration of 18 months, four patients with an SSM of 38.9 kPa or higher experienced variceal bleeding, whereas no patients with an SSM below 38.9 kPa did. Kaplan–Meier analysis further revealed a significant difference (Fig. 2, log rank p = 0.002).

Figure 2
figure 2

Incidence of new-onset variceal bleeding according to SSM cut-off.

Spleen stiffness comparison with HVPG

The correlation between HVPG and spleen stiffness or liver stiffness was analyzed for the 49 patients whose HVPG was measured. HVPG and liver stiffness (Fig. 3A, Pearson correlation coefficient 0.570, P < 0.001) or spleen stiffness (Fig. 3B, Pearson correlation coefficient 0.486, P < 0.001) showed a significant correlation. However, the scatterplot with a smooth line revealed that the correlation between HVPG and spleen stiffness was significant only at HVPG levels of 16 mm Hg or less (Fig. 3B). As a result, stratified analysis was conducted based on HVPG levels of 16 mm Hg. A significant correlation was observed between spleen stiffness and HVPG less than 16 mm Hg (Pearson correlation coefficient 0.608, P < 0.001), but this significance disappeared in HVPG levels of 16 mm Hg or higher (Pearson correlation coefficient 0.150, P = 0.624).

Figure 3
figure 3

Pearson correlation analysis (A) between liver stiffness and HVPG, (B) between spleen stiffness and HVPG.

Accuracy of pre-existing prediction models and new cut-off of SSM for CSPH

We analyzed how well the previously published NIT model and the new SSM cut-off of 38.9 kPA predicted CSPH (over 10 mm Hg of HVPG) (Table 5). The 38.9 kPa cut-off, which specifically derived and validated for diagnosing VNT, was also applied to predict CSPH. As expected, this cut-off value exhibited lower sensitivity and negative predictive value (NPV) for prediction of CSPH than when applied to VNT.” For SSM, cut-offs of 40 kPa, 50 kPa, and 38.9 kPa are used to rule in CSPH, showing sensitivities of 79%, 53%, and 84%, respectively, and specificities of 91%, 100%, and 90%. The accuracy for these cut-offs ranges from 63 to 86%, with the 38.9 kPa cut-off demonstrating the highest accuracy and area under the curve (AUC). The Baveno VII model outlines criteria for ruling out and ruling in CSPH with varying degrees of sensitivity, specificity, and AUC values.

Table 5 Accuracy of pre-existing prediction models for CSPH.

Discussion

SSM began to gain attention relatively recently compared to LSM and is now an officially recommended tool in the 2021 Baveno VII guideline. However, the current evidence level remains relatively low at C2, with limited research findings, particularly regarding the optimal cut-off value of SSM for non-viral etiologies. The present study analyzed SSM@100 Hz and endoscopy data from 257 cirrhosis patients, providing insights into: (1) the success rates of SSM and factors associated with measurement failures, (2) the predictive ability of newly proposed SSM cut-off 38.9 kPa for VNT and CSPH compared to other NITs, and (3) the correlation between SSM and HVPG.

The first finding of our study is the success rate and factors related to SSM failure. The success rate using SSM@100 Hz in our study was 94.16%, significantly higher than the success rates reported for pSWE or 2D-SWE. SSM is known to be more challenging to measure than LSM, with some studies reporting success rates as low as 52.9%8,9. The success rate of SSM@100 Hz observed in the present study was comparable to the 95% success rate for LSM and 70% success rate for SSM reported in previous studies10. This success rate aligns well with recently published prospective studies11,12. The factors associated with SSM failure were consistent with those reported in prior research13, including small spleen length or volume and high BMI. Therefore, SSM might be less suitable for patients with normal spleen volume who have not progressed to advanced fibrosis. Obesity also affects SSM, as it does with LSM.

The second significant finding of our study was the efficacy of SSM@100 Hz as a predictive tool for VNT. Notably, the accuracy of SSM was higher than that of other NITs. There are two primary reasons for the higher accuracy of SSM. First, SSM is less affected by liver necroinflammation compared to LSM14,15. Second, SSM more directly reflects HVPG, making it a more accurate hemodynamic marker for acute changes compared to LSM14,16,17. In fact, SSM has shown to accurately predict hemodynamic response after NSBB administration, a prediction not achieved by LSM18,19. The clinical utility of SSM remained high even when different measurement methods or probes were used5,20. A meta-analysis of studies evaluating SSM with 2D-SWE and point-SWE found that SSM's predictive rate for high-risk esophageal varices (HREV) was AUC 0.87, and for any esophageal varices (E. varix) was AUC 0.904, which is similar to the findings of our study. Both our study and the meta-analysis reported high negative predictive values (NPVs) for SSM, indicating its potential as a valuable screening tool to avoid unnecessary endoscopy4,21,22. In particular, our study verified the newly proposed cut-off SSM@100 Hz 38.9 kPa. Our study demonstrated a marginally enhanced performance of the SSM cut-off at 38.9 kPa compared to the conventional 40 kPa threshold. However, 1.1 kPa difference between two is small. Therefore, the existing Baveno criteria is still clinically relevant and useful.

Comparatively, our results resonate well with the burgeoning body of literature exploring the use of the 100-Hz SSM probe for liver cirrhosis patients. Notably, previous studies have similarly reported on the efficacy of SSM in predicting variceal outcomes, providing a consensus on the reliability of SSM measurements20,23. However, our study extends these findings by suggesting a refined cut-off point that potentially enhances diagnostic accuracy and patient care, a comparison that underscores the importance of continuous evaluation and adaptation of diagnostic thresholds. Moreover, our findings highlight the paramount importance of diagnosing CSPH over merely identifying VNT. This approach shifts the diagnostic focus towards a more holistic understanding of the patient's portal hypertension status, facilitating earlier and potentially more effective interventions.

While we emphasized the superior accuracy of SSM compared to NITs, we acknowledge the need to discuss other algorithms' performances as detailed in our methods and Table 3. Notably, the LSPS showed slightly higher accuracy than the combined Baveno VII-SSM model. However, despite similar accuracies of various models, the Baveno VII-SSM is supported by extensive evidence and validations, reinforcing its use in clinical guidelines.

The role of etiology in the diagnostic accuracy of SSM presents a nuanced view. Our study observed a low rate of patients with Hepatitis C Virus (HCV) infection, which contrasts with the demographic predominantly seen in older studies. This shift in patient demographics could reflect changes in disease prevalence or treatment outcomes over time. On the other hand, the lower rate of metabolic associated steatohepatitis liver disease (MASLD) patients in our cohort represents a limitation and points towards the need for further research encompassing a broader spectrum of liver diseases. Remarkably, the etiology did not influence the diagnostic accuracy of SSM in our study, a finding that emphasizes the robustness of SSM as a predictive tool across different liver disease etiologies.

Finally, our study demonstrated a correlation between SSM and HVPG. However, this correlation was more pronounced in the HVPG < 16 mm Hg group, and its significance decreased when HVPG was higher than 16 mm Hg. This observation is likely due to the development of various porto-systemic collaterals in cases of HVPG 16 mm Hg or higher, indicating severe portal hypertension or impending decompenstation. Stefanescu et al.13 also reported a high correlation between HVPG and SSM, but upon visual examination of the scatter plot, it becomes evident that the correlation diminishes at HVPG levels of 16 mm Hg or higher. Similar findings have been reported for the correlation between LSM and HVPG, showing a stronger association at lower HVPG levels24,25. However, given that HVPG was measured in only 49 patients in our study, it is essential to supplement these results with data from a larger number of patients in the future.

Our study has several strengths, including the use of a newly developed SSM@100 Hz probe and providing information on the cut-off and accuracy of SSM for non-viral etiology, which was previously lacking. Additionally, by presenting factors and criteria related to SSM measurement failure, which were not well-reported in other studies, we aimed to enhance the clinical utility of SSM. However, there are several limitations to consider. First, retrospective designs may introduce selection bias. Second, since only SSM@100 Hz was used in all patients, the criteria presented in this study may not be equally applicable to point-SWE or 2D-SWE.

In conclusion, our study not only reaffirms the efficacy of SSM@100 Hz in predicting the presence of VNT and variceal bleeding but also proposes a new diagnostic cut-off that could refine current practices. By drawing parallels with existing literature and addressing the significance of CSPH diagnosis and the impact of etiology, we contribute to the evolving landscape of non-invasive liver disease diagnostics.

Materials and methods

Patients

This retrospective cohort study was conducted from January 2020 to December 2022. The inclusion criteria for this study were as follows: (1) patients diagnosed with compensated liver cirrhosis by imaging or pathological examination, (2) age ranging from 19 to 70 years, and (3) esophagogastroduodenoscopy performed within 6 months from the time of SSM. Patients who had experienced previous episodes of decompensation but currently presented with compensated liver cirrhosis were included in the analysis. The exclusion criteria were as follows: (1) patients with decompensated cirrhosis at the moment of inclusion, (2) aspartate transaminase (AST) or alanine transaminase (ALT) levels exceeding 200 IU/L, (3) any grade of gastric varices, (4) previous history of transjugular intrahepatic portosystemic shunt, balloon-occluded retrograde transvenous obliteration, or plug-assisted retrograde transvenous obliteration, (5) patients with blood diseases that may affect SSM, (6) patients with difficulty in SSM due to severe obesity or ascites, and (7) patients with untreated or uncontrolled hepatocellular carcinoma and cholangiocarcinoma.26,27 The study protocol was approved by the Institutional Review Boards of Soonchunhyang University Bucheon Hospital (IRB number: SCHBC 2023-01-015, Date of registration: 03-Feb-2023). The study adhered to the ethical guidelines of the World Medical Association Declaration of Helsinki. Written consents were waived by the IRB of Soonchunhyang University Bucheon Hospital due to the retrospective nature of the study.

Spleen stiffness measurement

SSM was performed using a newly developed 100 Hz probe (SSM@100 Hz) with the FibroScan® Expert 630 model (Echosens, France). After confirming the spleen hilum area with the ultrasound probe that came with the Fibroscan® machine, we placed the probe on the relevant area and measured the spleen stiffness 10 times consecutively. The SSM measurements were conducted by two physicians with extensive experience in over 100 SSM cases. The criteria used for spleen stiffness measurement were similar to those employed for liver stiffness evaluation, requiring a minimum of 10 measurements, a success rate of at least 60%, and an interquartile range (IQR) less than 30% of the median value13,28,29. Ultrasound examinations were performed to measure the longitudinal spleen length, thickness, and width by three experienced physicians with more than 10 years of pertinent experience. The spleen volume was calculated using the formula: π/6 × spleen length × thickness × width30.

Endoscopy and HVPG

All patients underwent endoscopy. A standard esophagogastroduodenoscopy was performed by three experienced physicians with more than 10 years of experience. The endoscopic findings related to esophageal varices were documented, including the variceal grade and the presence of red signs.

If HVPG was measured within 6 months of SSM, HVPG values were also collected. The method for measuring HVPG was consistent with the protocol published previously31,32. Two experienced interventional radiologists, each with over a decade of expertise, conducted the HVPG measurements. The preferred access route was through the right jugular vein, where a 6-French balloon catheter was inserted into the right hepatic vein to measure the free hepatic venous pressure (FHVP). The wedge hepatic venous pressure (WHVP) was determined by inflating the balloon catheter within the right hepatic area. The HVPG was then derived by calculating the difference between the WHVP and the FHVP.

Prediction models for VNT

VNT was defined as the presence of any of the following three conditions: (1) grade 1 esophageal varices with red color sign, or (2) medium to large esophageal varices (grade 2 or 3)33,34. Regarding VNT prediction, our study compared the following previously published scoring systems: LSM evaluated by the FibroScan® Expert 630 model, LSPS (platelet count to longitudinal spleen diameter ratio)35, PSR (platelet count/spleen diameter ratio)36, and VRS (varices risk score)37. Additionally, the recently published Baveno VII criteria using LSM and PLT criteria (LSM ≤ 20 kPa and platelet ≥ 150 k) or SSM single criteria (SSM ≤ 40 kPa) using SSM@100 Hz were also used for comparison6. The Combined Baveno VII criteria were defined as initially applying the criteria of LSM < 20 kPa and PLT > 150 k; if these criteria were not met, then applying SSM < 40 kPa was the next step. For VNT prediction, the analysis initially encompassed the entire patient group, followed by a sensitivity analysis conducted specifically on the compensated advanced chronic liver disease (cACLD) patient group. cACLD was defined as a patient having an LSM of 10 kPa or higher and no history of decompensation.

Prediction models for CSPH

CSPH was defined as patients with HVPG of at least 10 mm Hg. Regarding CSPH prediction, we compared the following previously published scoring systems: LSM evaluated by the FibroScan® Expert 630 model, SSM single criteria (SSM ≤ 40 kPa), Baveno VII model, Baveno VII-SSM single cut-off model, and Baveno VII-SSM dual cut-off model38. In the Baveno VII model, CSPH was ruled out for patients with an LSM of 15 kPa or lower and a platelet count of at least 150 k, and ruled in for those with an LSM over 25 kPa. In the Baveno VII-SSM single cut-off model, CSPH was ruled out if at least two of the following criteria were met: an LSM of 15 kPa or lower, a platelet count of 150 k or higher, and an SSM of 40 kPa or lower; it was ruled in if at least two of these conditions were present: an LSM greater than 25 kPa, a platelet count less than 150 k, and an SSM over 40 kPa. The Baveno VII-SSM dual cut-off model applied the same criteria, with a cutoff of less than 21 kPa for SSM to rule out, and more than 50 kPa to rule in, CSPH.

Statistical analysis

We assessed the effectiveness and performance of noninvasive tests using metrics including sensitivity, specificity, accuracy, positive and negative predictive values, AUC, missed VNT rate, and the rate of avoided endoscopies. The rate of missed VNT was defined as the number of patients having VNT divided by the number of patients who ruled out VNT.

Initially, we verified the efficacy of existing published parameters, such as the Baveno VII criteria, and subsequently introduced a new threshold aimed at enhancing diagnostic performance while keeping the risk of missed VNT below 5% (with a negative predictive value greater than 95%). The DeLong test was employed to compare the AUC across different prediction models. McNemar test was used to compare the rate of spared endoscopies by each criteria. Continuous baseline characteristics were presented as means (± standard deviations) and compared using Student’s t-test. Categorical characteristics were presented as counts and percentages and compared between groups using the chi-squared test. Logistic regression analysis was conducted to evaluate the factors related to SSM failure. Factors that showed significance in the univariate analysis were included in the multivariate analysis. Statistically significant differences were defined as P < 0.05. All statistical analyses were performed using R version 4.3.1 (The R Foundation for Statistical Computing, Vienna, Austria).

Ethics declarations and Informed consent statement

The study protocol was approved by the Institutional Review Boards of Soonchunhyang University Bucheon Hospital (IRB number: SCHBC 2023-01-015, Date of registration: 03-Feb-2023). The study adhered to the ethical guidelines of the World Medical Association Declaration of Helsinki. Written consents were waived by the IRB due to the retrospective nature of the study.