Patients
This retrospective cross-sectional study was reviewed and approved by the Biomedical Research Ethics Committee of two institutions, and the requirement for patient consent was waived.
The patient enrollment process and excluded criterion for this study are showed in Fig. 1. We retrospectively analyzed 329 patients at two different hospitals in China. A total of 185 consecutive patients with COVID-19 pneumonia, 97 men (mean age, 53.9 years; age range, 26–88 years) and 88 women (mean age, 58.3 years; age range, 22–83 years), were included at Wuhan Huoshenshan hospital between February 11 and 25, 2020. Seven consecutive patients with COVID-19 pneumonia, 5 men (mean age, 47.8 years; age range, 28–75 years) and 2 women (mean age, 59 years; aged 47 and 71), were included at Changhai hospital between January 25 and February 9, 2020. One hundred thirty-seven consecutive patients with other types of viral pneumonia, 72 men (mean age, 54.9 years; age range, 19–92 years) and 65 women (mean age, 52.2 years; age range, 20–95 years), were included at Changhai hospital between April 2011 and January 2020. Viral nucleic acid detection was used to confirm COVID-19 pneumonia and the other types of viral pneumonia. All patients were divided into training set and validation set. The prediction model was developed for a training set that consisted of 136 patients with COVID-19 pneumonia who had positive initial result by viral nucleic acid detection, and 103 patients with the other types of viral pneumonia. Ninety consecutive patients constituted an independent validation sample of 56 patients with COVID-19 pneumonia and 34 patients with other types of viral pneumonia. Fifty-six patients with COVID-19 pneumonia included 49 patients who had initial negative result by viral nucleic acid detection but positive result on CT from Wuhan Huoshenshan hospital and 7 patients who came from Changhai hospital. All clinical results were extracted from the patients’ electronic medical records in the two-hospital information system. All patients with COVID-19 pneumonia were divided into four clinical types including mild, moderate, severe, and critical types based on the clinical classification of COVID-19 pneumonia from the 7th edition of the National Commission of China classification [18].
CT scanning
Pulmonary CT was performed using 64-, 256-, and 128-slice multidetector row CT scanners (64: Somatom, Siemens Healthcare; 256: Brilliance-16P, Philips Healthcare; 128: uCT 760, United Imaging Healthcare). CT scans were obtained with the following parameters: 120 kV, adaptive tube current, a matrix of 512 × 512, and a beam collimation of 64 × 0.6 mm2, 256 × 0.6 mm2, and 128 × 0.6 mm2. Non-enhanced CT at a slice thickness of 1.0 mm, 1.0 mm, and 0.625 mm was performed, respectively. Images were captured at window settings that allowed viewing of the lung parenchyma (window level and width, − 600 to − 700 HU and 1200–1500 HU, respectively) and the mediastinum (window level and width, 20–40 HU and 400 HU, respectively). The scanning range covered the area from the height level of the superior aperture of the thorax to the diaphragm.
Imaging analysis
We used the original cross-sectional images for analysis. All images were analyzed by two chest radiologists (reader 1 and reader 2, both with 8 years of experience) who were blinded to the clinical details. When their results were not consistent, the final results were determined by consensus.
All lesions were evaluated for the following parameters: (a) location: right, left, or bilateral lungs; (b) distribution: peripheral, central, or diffuse; (c) attenuation: ground glass attenuation including ground glass opacity (GGO) and crazy-paving pattern, consolidation, and mixed patterns of ground glass attenuation and consolidation [19, 20]; (d) maximum lesion range: ≤ 5 cm, 5–10 cm, > 10 cm, only for the biggest one; (e) lobe involvement: the five lung lobes were divided into categories of ≤ 2 lobes, 2–4 lobes, and = 5 lobes; (f) number of lesions: 1, 2, 3, or more; (g) air bronchogram; (h) hilar and mediastinal lymph nodes enlargement: short-axis diameter of a lymph node > 10 mm [21]; and (i) pleural effusion.
Radiomics workflow
The radiomics workflow included (a) image segmentation, (b) feature extraction, (c) feature reduction and selection, and (e) predictive model building.
In this study, we used the artificial intelligence software (uAI-Discover-NCP R001, United Imaging Healthcare, China) to segment images and extract the radiomics features from the lung parenchyma window. A total of 1409 2D and 3D features from primary lesion were extracted. Feature selection comprised three steps: variance analysis, spearman correlation analysis, and least absolute shrinkage and selection operator method (LASSO) logistic regression algorithm. This method has been shown to be effective in prior radiomics studies [22]. A retrospective power analysis was performed. The sequential method of Bonferroni correction was applied to adjust the baseline significance level (α = 0.05) for multiple testing bias [23, 24]. Finally, radiomics scores (Rad-scores) were calculated for each patient via a linear combination of selected features that were weighted by their respective coefficients. More information about radiomics feature extraction methodology is reported in Supplementary digital content 1.
We developed a clinical model using significantly associated CT characteristics including location, distribution, attenuation, maximum lesion range, lobe involvement, number of lesions, air bronchogram, hilar and mediastinal lymph node enlargement, and pleural effusion. Consequently, the mixed model, which combined the Rad-score and significantly associated CT characteristics, was developed and we hoped to improve the accuracy of predicting COVID-19 pneumonia.
Statistical analysis
Normal distribution and variance homogeneity tests were performed on all continuous variables; those with a normal distribution are expressed as the mean and standard deviation while those with non-normal distributions are expressed as medians and ranges. First, we examined group differences in all variables. The Kruskal-Wallis H test (skewed distribution) and chi-square tests (categorical variables) were used to determine statistical differences between the two groups. Second, univariable regression analysis was applied to estimate the effect size of the relationships between all variables and the two groups of viral pneumonia. The group with other types of viral pneumonia was considered as a reference group. Third, multivariable logistic regression analysis was conducted to develop a model for predicting COVID-19 pneumonia in the primary sample, and a nomogram was then constructed. The discrimination performance of established models was quantified by the receiver operating characteristic curve. Area under the curve (AUC) estimates in the prediction models were compared using the Delong non-parametric approach [25]. Calibration curves were plotted via bootstrapping with 500 resamples to assess the calibration of the radiomics model, accompanied by the Hosmer-Lemeshow goodness-of-fit test. The performance of the radiomics model was then tested in an independent validation sample by using the formula derived from the training set. Finally, to estimate the clinical usefulness of the nomogram, decision curve analysis (DCA) was performed by calculating the net benefits for a range of threshold probabilities.
A two-tailed p value less than 0.05 was considered statistically significant. All analyses were performed with R (R version 3.3.3; R Foundation for Statistical Computing; http://www.r-project.org).