Introduction

Osteoporosis is a bone condition characterized by a decrease in bone density and worsening quality of bone microarchitecture. The most common screening modality for this disease is dual-energy X-ray absorptiometry (DEXA). However, the screening rate for osteoporosis remains low. DEXA is not routinely performed before spinal surgery in clinical practice due to its high medical costs and radiation exposure [1,2,3]. Osteoporosis may lead to severe complications after spinal surgery, such as fusion failure [1]. An increased screening rate may reduce osteoporosis-related complications.

CT is a common diagnostic imaging modality for the preoperative assessment in lumbar spine surgery. Using CT scans to measure vertebral HU values has shown promise in previous studies. A low HU value may be associated with complications, such as pseudoarthrosis after lumbar fusion surgery [4]. Schreiber et al [5] found that the HU value obtained from a routine CT scan significantly correlates with DEXA results, and Zou et al [6] found that the HU value is a good predictor of pedicle screw loosening after lumbar fixation. Furthermore, HU measurement allows clinicians to screen for osteoporosis preoperatively without additional radiation exposure and medical costs [7]. However, HU measurements did not include the cortical bone in the method described; therefore, in the real world, this assessment may lose a significant amount of information needed to assess bone quality [5]. The mean HU value of the vertebral region of interest (ROI) may differ according to different observers. Cancellous bone is heterogeneous; therefore, HU based on axial sections may not accurately reflect the bone quality [2, 5, 8, 9].

Radiomics features are large amounts (200+) of quantitative features mathematically extracted from medical images, believed to reflect intra-region heterogeneity [10]. Radiomics hypothesizes that these quantitative features may provide unknown information related to specific diseases [10,11,12,13]. In oncology, radiomics is mainly used for the non-invasive estimation of the clinical diagnosis and prognosis [14]. Recently, Wu et al [15] applied a CT-based radiomics method to predict the outcome of COVID-19 patients, and Mookiah et al [16] used the radiomics features derived from multidetector CT to evaluate the trabecular bone quality and osteoporosis in the laboratory. However, no study has assessed bone quality based on the high-throughput radiomic features extracted from routing CT scans. In this study, radiomic features were extracted from the 3D segmentation of the entire vertebral body, comprising both the cancellous and cortical bones to determine the efficacy of a radiomics model based on routine preoperative lumbar spine CT scans in screening for osteoporosis.

Materials and methods

Study participants

The local Ethics Committee of our institution approved this retrospective study (approval no. 2021KJCX038). Consecutive patients were included by reviewing the database of our department from January 2020 to June 2021. We identified 99 patients who had lumbar spine CT and DEXA within 14 days. Exclusion criteria were (1) related vertebral body tumor, (2) ankylosing spondylitis, and (3) diffuse idiopathic skeletal hyperostosis.

Using DEXA, 34 of the 99 patients in the study were diagnosed with osteoporosis. For each patient, the L1–L4 vertebral bodies were chosen for the analysis; thus, a total of 396 vertebral bodies were initially included in this study. However, 10 of these were excluded for severe metal artifacts affecting the image evaluation. Among the 386 vertebral bodies considered, 127 (32.9%) had osteoporosis, based on DEXA results. These vertebral bodies were randomly divided into two groups at a 7:3 ratio using a stratified random sampling method. Thus, the training cohort included 270 participants while the test cohort included 116. A pipeline depicting patient selection is displayed in Fig. 1. The clinical characteristics were obtained from the patient records in our hospital.

Fig. 1
figure 1

Flowchart for selecting the study population

Image acquisition

All CT images and DEXA results were collected from our department’s picture archiving and communication system (PACS). Each patient underwent a lumbar spine CT scan using a helical 64-channel CT scanner. The following CT scanning parameters were used: tube voltage of 120 kVp, tube current of 300 mA, and slice thickness of 1.25 mm at 0.625-mm intervals. The DEXA scans were performed in each patient’s spine and hip, and the lumbar vertebrae were divided into two groups based on diagnosis: osteoporosis and non-osteoporosis groups.

Image segmentation, radiomic feature extraction, and HU measurement

3D Slicer (\https://www.slicer.org), a free and open-source software, was used for the 3D segmentation of the vertebral bodies. The images were segmented semi-automatically in the vertebral region using the segmentation threshold and seed growing module. The posterior portion of the vertebral body was removed to alleviate the influence of the vertebral pedicle. An example of segmentation is presented in Fig. 2.

Fig. 2
figure 2

Semi-automatic segmentation of the vertebral body. a Segmentation on a sagittal slice. b Segmentation on a coronal slice. c 3D volumetric reconstruction. d The posterior potion of the vertebral body was excluded to reduce the influence of the vertebral pedicle

Among the features obtained from the segmented vertebral bodies, 1040 were automatically extracted by applying the 3D Slicer pyradiomics module. Resampling and z-score normalization were performed to guarantee the repeatability of the feature extraction. The extracted radiomic features are provided in Supplementary material S1.

The HU values were measured from PACS data using a technique previously described [5]. The ROIs were positioned on the midbody axial CT image for each vertebra. A ROI was defined as a single maximally sized ellipse including the maximum trabecular bone while avoiding the cortical bone of the vertebral body. The measurement was performed in one slice per vertebra. The mean HU value obtained was used for further investigation.

A total of 36 vertebrae were randomly selected to evaluate the reliability of HU measurements and radiomic features. One month after the first author (Y.W.J.) measured the HU values, the radiomic features of the vertebrae were extracted. The co-author (R.W.) repeated the measurement and extraction process to evaluate the inter-observer reliability. The two observers were blinded to the clinical information when performing the measurements. The intraclass correlation coefficient (ICC) was used to determine the intra- and inter-observer variability.

Establishment of radiomic signature and HU models

Dimension reduction was performed before the radiomic signature construction to eliminate overfitting or any type of bias in our radiomic model. First, the radiomic features with ICCs (both in inter- and intra-observer classes) > 0.75 were regarded as reliable and selected. The minimum redundancy-maximum relevance (mRMR) algorithm was used to determine the relevance and redundancy of each feature. The 30 features with the highest-ranked mRMR were selected to construct the model. The optimal feature subset was then chosen using the least absolute shrinkage and selection operator (LASSO) regression model (a selection method for linear regression). This method can shrink the regression coefficients and reduce some to zero; the remaining coefficients were applied to construct the final model. Tenfold cross-validation was used to tune the regularization parameter (λ). Using the following formula, the radiomics score (Rad score) of each vertebral body was calculated.

$$ Rad\ score={\sum}_{i=0}^n\mathrm{C}i\times \mathrm{X}i+b $$

where Xi represents the ith selected feature, Ci is the corresponding feature coefficient, and b is the intercept.

The radiomic signature model was constructed using the radiomic features extracted from the training set. The model’s capability to identify osteoporosis was assessed in both the training and test sets. In contrast, a ROC curve based on HU values was used to establish the separation criteria between osteoporosis and non-osteoporosis. Using the ROC curve, the most appropriate HU cutoff value was determined by Youden’s index. A logistic regression analysis based on HU was also conducted.

Evaluation of the HU and radiomics signature models’ performance

The performance of the HU and radiomic signature models was assessed according to the AUC in both training and test sets. We use decision curve analysis (DCA) to determine the clinical benefit of the two models proposed.

Statistical analysis

All statistical analyses were conducted using R statistical software (ver. 3.4.2, http:// www.r-project.org). The chi-squared test or Fisher’s exact test was used to compare the categorical data between the two groups. “mRMRe” package in R was used to implement the mRMR algorithm. The “glmnet” package was used to implement the LASSO algorithm. The ROC curves were plotted using the “pROC” package. The DeLong test was then used to compare the ROC curves of the two models. Finally, the “rmda” package was used to perform the DCA. Statistical significance was set at p < 0.05.

Results

Clinical characteristics

There were no differences in clinical characteristics between the training and test cohorts (Table 1).

Table 1 Clinical characteristics in the training and validation sets

Establishment of the radiomics signature

From the lumbar spine CT images, 1040 quantitative features were initially extracted, and 846 had an ICC > 0.75. The results of the inter- and intra-observer agreement analyses are provided in Supplementary material S2. Observer 1 performed the segmentation and radiomic extraction on the samples. After applying the mRMR algorithm, 30 features were retained for the LASSO regression method. Tenfold cross-validation was applied to select the tuning parameter (λ) in the LASSO regression (Fig. 3). With an optimal λ of 0.031 under the 1-SE criteria, the remaining 12 features (Table 2) were used to establish the radiomics signature. As Table 3 shows, the Rad score of the osteoporosis group (0.62 ± 0.18) significantly differed (p < 0.05) from that of the non-osteoporosis group (0.18 ± 0.22). The total Rad scores are shown in Supplementary material S3.

Fig. 3
figure 3

The LASSO regression model was used for the selection of radiomics features. a A 10-fold cross-validation was used to select the tuning parameter (λ). The y-axis correspond the binomial deviance while the x-axis correspond log (λ). The two vertical dotted lines represented the one standard error of the minimum (1-SE) and minimum criteria, respectively, for the specific values. b Thirty radiomic features coefficient profile versus the log (λ) sequence

Table 2 The 12 features chosen for the radiomics model
Table 3 Comparison between the HU and Rad score in the osteoporosis and non-osteoporosis subject groups

Evaluation of the HU and radiomics signature models’ performance

The ROC curves of the HU and radiomics signature models are illustrated in Fig. 4. The DeLong test showed that the radiomic signature model was more effective than the HU model in predicting osteoporosis. In the training set, the AUC values for the radiomic signature and HU models were 0.960 and 0.883, respectively (p < 0.05), while in the test set, they were 0.915 and 0.836, respectively (p < 0.05) (Table 4). Based on the DCA, the radiomics model outperformed the HU model (Fig. 5).

Fig. 4
figure 4

ROC analysis showing that the performance of the radiomics signature model (red line) was better than that of the HU model (blue line) in both the training (a) and test (b) cohorts

Table 4 Performance of the HU and radiomic signature models
Fig. 5
figure 5

DCA revealed that the Rad score model (blue line) was more advantageous than the HU model (red line). The x axis corresponds to the threshold probability while the y axis corresponds to the net benefit. The gray line represents the assumption that all lesions are due to osteoporosis. The black line indicates that no lesion is due to osteoporosis

Discussion

The present study demonstrates that a radiomic signature model based on preoperative lumbar spinal CT can be used to diagnose osteoporosis. In both training and test sets, the radiomic signature model showed excellent prediction performance. Moreover, the radiomic signature model was more efficient for detecting osteoporosis than the HU model, based on AUC and the DCA curve. A positive result from the radiomic model could alert physicians to perform additional DEXA to confirm the presence of osteoporosis. In contrast, further testing and treatment may be unnecessary when the radiomic model shows a low probability of osteoporosis. This method may decrease medical costs and radiation exposure.

In the radiomic signature model development, 846 candidate radiomic features with ICCs > 0.75 were reduced to 12 features by combining the mRMR and LASSO methods. Combining multiple imaging features in the radiomic signature model can successfully stratify patients into low- or high-score groups with significantly different probabilities of osteoporosis. Additional examinations or treatment should be considered in patients with a higher probability of osteoporosis. Some selected features, such as skewness in the first-order feature class, appear to be independently related to osteoporosis; however, it is challenging to reliably correlate a single feature with the pathological state [17]. Therefore, constructing a multi-feature model is a more feasible approach for osteoporosis screening [18].

Osteoporosis is one of the most important factors associated with certain complications after spinal fusion surgery, including screw loosening, non-union, and cage subsidence; surgeons should be aware of its presence in patients. Some elective spine surgeries could be postponed after pharmacological intervention for osteoporosis. Bone mineral density can be derived from quantitative CT (QCT), ultrasound, and DEXA. However, the World Health Organization defines osteoporosis solely based on DEXA measurements. According to the International Society for Clinical Densitometry (ISCD) recommendations, the DEXA measurement should include the spine and hips, and osteoporosis should be diagnosed based on the lowest T-score between the spine and hips. QCT, which focuses on the cancellous bone, can be used to determine bone quality. The clinical application of QCT is often hindered by high economic costs and the required specific training [19]. As QCT is not routinely applied in our hospital, we could not use QCT images in this retrospective study.

At present, HU measurement is still the most frequently used method for opportunistic osteoporosis screening; therefore, it is reasonable to compare the radiomic method to the HU method. Da Zou et al [20] used the HU value to screen osteoporosis in patients with lumbar degenerative disease with 88.5% specificity and 60.8% sensitivity. Cohen et al [21] performed a validation study of opportunistic osteoporosis screening with routine CT on a multiethnic Middle Eastern population. The sensitivity obtained was 76%, and the specificity was 74%. The efficacy of the HU model in this study is similar to that of previous studies. Furthermore, both the Delong test and DCA analysis showed that the radiomics model outperformed the HU model.

Radiomics analysis based on routine preoperative lumbar CT scans could provide an alternative for bone health screening. However, this method cannot replace DEXA, which is still the standard examination recommended by the ISCD. Nonetheless, our method could alert spine surgeons to investigate and possibly treat osteoporosis. Currently, the procedure for radiomics analysis may seem complex; however, the progress of automated segmentation might allow the integration of the feature extraction and calculation into a software program. The radiomics method will eventually become a one-button operation. The present study demonstrated that radiomics analysis based on lumbar spine CT scans might be an effective method to screen for osteoporosis before surgery.

The study has some limitations. First, it was a retrospective study performed at one institution. Further prospective studies with larger sample sizes are warranted. Second, we only focused on the correlation between the radiomic signature and DEXA results, which may not reflect the risk of osteoporosis-related complications, such as pseudarthrosis after lumbar spinal fusion. Finally, the radiomic method requires additional software operations compared to the HU method. Nonetheless, we believe that the radiomic method is worthwhile because it includes more imaging information, such as the cortical bone. The radiomic features extracted from vertebral bodies may be useful for analyzing other osteoporosis-related diseases.

In conclusion, we developed and validated a CT-based radiomic signature model to detect osteoporosis before lumbar spinal surgery. The radiomic method may provide valuable information and facilitate surgical decision-making without additional medical costs and radiation exposure.