Background

Inflammatory bowel disease (IBD) is a recurrent chronic intestinal inflammatory condition, mainly including Crohn's disease (CD) and ulcerative colitis (UC). The worldwide prevalence of IBD continues to rise, affecting an estimated seven million individuals [1]. Diagnosis difficulties, especially in early stages with intermittent symptoms, may lead to underestimation of actual cases. Delayed diagnosis is common, with reports indicating approximately 10% of patients endured gastrointestinal symptoms for at least five years before confirmation [2]. Long-term accumulation of inflammation can cause progressive intestinal damage, thereby elevating the risks of strictures, penetrating lesions, and even malignancy [3]. The association between delayed diagnosis and increased surgery risk has been demonstrated [4]. Moreover, since CD and UC differ in disease behavior and burden, CD patients usually face greater risks of complications and bowel resection, requiring longer hospital stays and higher medical costs [5, 6]. Therefore, timely and accurate diagnosis is crucial, as it could potentially lead to earlier initiation of biologic therapy and halting disease progression.

The diagnosis of IBD currently relies on a comprehensive assessment of the clinical data and various examination results by a multidisciplinary team. However, atypical symptoms, endoscopic limitations, and low histological positive rates reduce the reliability of these results [7,8,9]. "Endoscopic Skipping," as was previously described, occurs when endoscopy fails to detect proximal small bowel inflammation or inflammation confined to the intestinal wall [8, 10]. A study including 1,471 CD patients found that 8% had ileal active lesions detectable only by computed tomography and magnetic resonance enterography (CTE/MRE), suggesting that imaging severity should be considered when discrepancies exists between radiology and ileoscopy [11]. These findings demonstrate the value of radiological findings in the early detection of IBD. Notably, compared to endoscopic mucosal healing, achieving transmural healing as seen on cross-sectional imaging is increasingly recognized as a more important treatment goal in CD, reflecting a deeper and more comprehensive response to therapy [12]. CTE/MRE is routinely used in current guidelines to assess discontinuous segmental lesions, ulcers, and fistulas, complementing endoscopic and histological findings [13]. In addition to displaying intestinal and extraintestinal lesions, CTE/MRE also provides anatomic information that can be used for body composition analysis [14, 15]. Previous studies have successfully utilized radiological features extracted from visceral adipose tissue (VAT) to distinguish CD from UC [16]. Additionally, VAT quantitative parameters obtained by CTE are related to more complex behavior and worse prognosis in IBD [15, 17, 18]. Nonetheless, the previous complex analysis, especially in differentiating IBD subtypes, and the lack of interpretation of intestinal lesions in CT images, has hindered the adoption of the aforementioned conclusions in clinical practice.

Currently, established imaging reporting and grading systems, such as LI-RADS for the liver [19], BI-RADS for the breast [20], and PI-RADS for the prostate [21], have significantly enhanced disease recognition and diagnostic interpretation by radiologists. Developing a similar grading system for IBD patients, like IBD-RADS, would further leverage the strengths of cross-sectional imaging in detecting inflammation confined to the intestinal wall, aligning with current diagnostic and therapeutic goals. Additionally, integrating it into clinical workflows would enable radiologists to focus on critical signs and produce more targeted imaging reports for suspected cases. Gastroenterologists would also benefit from standardized reports, enhancing communication among departments and providing crucial evidence, especially in cases with atypical endoscopic results. However, such an approach is lagging in the diagnosis of IBD. Therefore, the purpose of this study was to establish an IBD imaging report and data system (IBD-RADS) for the identification of different subtypes using the information provided by baseline imaging data of IBD patients.

Methods

Ethics statement

The multicenter retrospective study followed the Declaration of Helsinki, analyzed patients with IBD at three hospitals in China (Tongji Hospital of Huazhong University of Science and Technology, West China Hospital of Sichuan University, and the First Affiliated Hospital of Fujian Medical University), and obtained the consent of the ethics committee of each institution, where informed consent was waived due to the anonymity of the data.

Patient cohorts

We collected two patient cohorts, including a derivation cohort for IBD-RADs construction and a subsequent validation cohort. The derivation cohort included IBD patients from three institutions between 2012 and 2021, and the inclusion criteria were: (a) diagnosis of CD or UC determined by a multidisciplinary team based on the latest guidelines at the time; (b) patients underwent CTE scans on admission; (c) age ≥ 18 years old. Patients with incomplete or poor-quality CT images or with history of previous bowel resection were excluded. Newly diagnosed IBD patients between January and May 2022 were included as the validation cohort, and the inclusion and exclusion criteria were consistent with the derivation cohort. The gender, age, height, weight, baseline laboratory indicators (C-reactive protein (CRP), erythrocyte sedimentation rate (ESR) and serum albumin (Alb)) and the distribution of intestinal lesions based on imaging, endoscopy and histological results were recorded. In a prior study [15], we reported on 316 patients in the current study. The prior report evaluated the discriminative ability of VAT radiomics. The current study included a larger dataset, and proposed a grading system with simplified quantitative parameters.

Image acquisition and evaluation

All CTE scans were performed following the latest imaging protocol [22]. Patient fasted for 4–6 h and avoided gas-producing liquids, then ingested 1000—1500 ml of 2.5% mannitol solution before the scan. After pre-contrast scanning, a contrast-enhanced CT was performed after a rapid bolus of iopromide (Ultravist 370, 370 mg/ mL, Bayer Schering Pharma, Berlin, Germany) (1.5 mL/ kg) at a rate of 3—5 mL/s, followed by a 20 mL saline flush using a power injector. Images were routinely acquired in the arterial, intestinal, or portal venous phases.

Two radiologists (Z.X. and Y.L., with 4 years of experience) independently evaluated images blinded to the clinical information. Radiologists evaluated the derivation cohort based on their own clinical experience. They assessed active inflammation of the proximal small intestine (including jejunum and proximal ileum), terminal ileum (TI), and colorectum on CTE, using signs such as intestinal wall thickening (≥ 3 mm), mucosal enhancement, intestinal stricture (with or without upstream intestinal dilatation), and intestinal fistula formation [23]. A lesion involving a single segment was defined as single-segmental lesion, whereas lesions involving multiple sites within a single segment or those involving multiple segments were defined as multi-segmental lesions. Active inflammatory regions on CT images that extend continuously along the bowel were defined as continuous lesions, and those distributed in multiple areas with normal segments in between were defined as skip lesions. Moreover, asymmetric enhancement (AE) of the intestinal wall along the mesenteric border, a characteristic finding of CD [23], along with the presence of intestinal fistula, were recorded as “present” or “absent”, with the assessment of AE referencing Figure S1 (Additional file 1: Fig. S1). Given the limited resolution of CT images, patients with suspected IBD at participating institutions have routinely undergone pelvic MRI upon admission to assess for perianal lesions since 2016. Therefore, for patients who have undergone MRI scans, pelvic MRI scans were used to confirm the presence of perianal fistula, characterized by a hyperintensity track on T2-weighted sequences, regardless of fat suppression, that is connected to the internal orifice of the anorectum [24, 25]. For patients from 2012 to 2015, their perianal fistulas were identified through a combination of CT images, previous perianal surgeries, clinical symptoms, and physical examinations. After finishing the evaluation of the derivation cohort, the two radiologists were trained in radiological diagnosis of IBD and evaluated the validation cohort using the same criteria three weeks later. The results of the two evaluators were compared, and any discrepancies were adjudicated by a senior radiologist (Y.S.), with 13 years of experience.

Image segmentation and fat parameter extraction

Based on a previously developed VAT automatic segmentation algorithm [26, 27], we incorporated new labels including subcutaneous adipose tissue (SAT) and skeletal muscle, obtaining a comprehensive segmentation model, as shown in Fig. 1. To ensure segmentation consistency, arterial-phase CT images of all patients were selected and input into the automated segmentation model, which labels and outputs various components. To minimize heterogeneity from different imaging sources, we normalized both spatial and intensity before feeding the images into the neural network for processing. Although the model has demonstrated high segmentation accuracy in previous studies [27], the output volumes of interest (VOIs) were individually reviewed and manually adjusted to ensure accuracy. The calibrated VOIs were used to extract parameter information. The current study focused only on the segmentation results of VAT and SAT using arterial phase CT images. The fat parameters related to VAT content, distribution and attenuation were calculated using the segmentation results. Due to incomplete SAT scans in obese patients within the pelvic region, we only analyzed the soft tissue in the lumbar region (the upper border of the first lumbar vertebra to the lower border of the fifth lumbar vertebra). The volume ratio of VAT and SAT, denoted as Visceral-to-Subcutaneous Fat Ratio (VSR), can be expressed as:

$$VSR\;=\frac{\mathrm{Number}\;\mathrm{of}\;\mathrm{SAT}\;\mathrm{pixels}}{\mathrm{Number}\;\mathrm{of}\;V\mathrm{AT}\;\mathrm{pixels}}$$
Fig. 1
figure 1

Overall flowchart of the patient inclusion and analysis process. CD, Crohn's disease; UC, Ulcerative colitis; ROC, Receiver operating characteristic curves

This formula calculates VSR by dividing the total count of pixels representing VAT by the total count of pixels representing SAT. Additionally, we calculated the VAT area in each slice within the lumbar region, and then calculated the mean, standard deviation (SD), and coefficient of variation (CV) of areas in the region to reflect the uniformity of VAT distribution. We also extracted CT attenuation values of all VAT pixels in the region and calculated the median, 25% quartile, and 75% quartile.

Statistical analysis

The automatic segmentation and quantitative analysis of fat parameters were performed using Python version 3.7, while the statistical analysis was conducted using SPSS 26.0 and MedCalc Statistical Software version 20.100. Based on the distribution of the data, we used Student's t-test or Mann–Whitney U test, chi-square test or Fisher's exact test to compare the differences in various indicators between IBD patients, derivation cohort and validation cohort. Binary logistic regression was used in the derivation cohort to select the predictive factors. Univariate analysis was first performed, and then variables with a P < 0.05 were included in the multivariate analysis. Based on the results of above analysis, the diagnostic performance of individual and combined indicators was evaluated using receiver operating characteristic curves (ROC) and Precision-Recall (PR) curve analysis. The cutoff value of VSR for distinguishing CD from UC was determined using the Youden index in the derivation cohort, and a value above this cutoff indicates that VAT constitutes the majority of the adipose tissue compared to SAT, which is defined as "Visceral Fat Predominance (VFP)." The development of IBD-RADS is based on key indicators identified from the derivation cohort analysis. These indicators were subsequently validated through diagnostic performance analysis in the validation cohort, including ROC and PR curve analysis. The comprehensive integrated discrimination index (IDI) was also calculated. In addition, the consistency of the two radiologists in image evaluation was evaluated by the Kappa coefficient (< 0.4: fair; 0.4–0.6: moderate; 0.6–0.8: substantial; > 0.8: almost perfect). Statistical significance was set at α = 0.05 (two-sided).

Results

Patient demographic characteristics

Figure 1 shows the inclusion flowchart. We excluded 98 patients with incomplete or poor image quality from 704 patients and eventually enrolled 365 CD and 241 UC patients for the construction of IBD-RADS (n = 606). For validation, we selected 155 newly diagnosed IBD patients in a subsequent period. Table 1 summarizes the baseline characteristics of all patients. The median age of CD patients was around 30 years, while that of UC patients was 40 years, consistent in both cohorts. CD patients were predominantly male and had higher levels of inflammatory markers, as well as lower body mass index and Alb levels. In CD patients, the lesions were mainly distributed in the ileocolon (56.2% in derivation and 66.0% in validation) followed by the ileum (27.1% in derivation and 22.7% in validation); while the majority of UC patients had diffuse lesions (53.1% in derivation and 53.4% in validation), followed by left-sided colon lesions (32.4% in derivation and 32.8% in validation). There was no statistical difference in lesion location between the two cohorts (P = 0.20 and 0.99).

Table 1 Baseline characteristics at enrollment of all patients

Comparison of radiological results between CD and UC

Comparisons of imaging signs and fat parameters between IBD patients in both cohorts are summarized in Table 2. The differences of imaging signs were consistent in the two cohorts. Although inflammation in the TI was predominantly in CD patients, it was also found in sixty-seven UC patients. Inflammation in the proximal small bowels and lesions confined to small bowels only existed in CD. Lesions in IBD patients were mainly multi-segmental, with CD primarily affecting the ileocolonic region and UC affecting the colon. The lesion distribution of UC patients was continuous, while most CD patients exhibited a skip pattern (76.2% and 82.5%). The proportion of AE was higher in CD patients, reaching 78.6% (287/365) and 86.6% (84/97) in the derivation and validation cohorts, respectively. Except for twenty patients with CD and fourteen patients with UC in the derivation cohort, all other patients underwent pelvic MRI scans. Perianal fistula was more common in CD patients, and intestinal fistula was only observed in CD. For quantitative parameters, CD patients had a greater VSR in both cohorts (P < 0.001 and P = 0.02); for VAT distribution, although VAT content was lower in CD, their CV was larger (P = 0.004 and P = 0.07); moreover, the attenuation values in CD were generally higher, especially 75% quartile (< 0.001 and P = 0.02).

Table 2 Radiological evaluation and analysis of all patients

Screening and validation of diagnostic indicators

Logistic regression analysis included clinical indicators, imaging signs (AE and perianal fistula), and quantitative parameters (VSR, CV and attenuation). Limited small bowel lesions, intestinal fistula, and skip distribution were excluded because they were not observed in UC cases, which prevented model convergence. Multivariate analysis revealed that age, AE, perianal fistula, and VSR were significant predictors of CD (Additional file 2: Table S1). ROC analysis showed that the specificity of limited small bowel lesions, intestinal fistula, and skip distribution reached 100%, and AE also showed good diagnostic performance in distinguishing between CD and UC (area under the curve, AUC, 0.877 [0.848, 0.902]). Perianal fistula (AUC, 0.639 [0.600, 0.678]) and VSR (AUC, 0.657 [0.617, 0.694]) showed similar diagnostic performance (P = 0.52), with a VSR cut-off value of 0.97 for defining “VFP”. The clinical indicators of age (AUC, 0.732 [0.695, 0.767]), BMI (AUC, 0.697 [0.650, 0.741]), and CRP (AUC, 0.670 [0.629, 0.709]) also performed well. Details are summarized in Table S2 (Additional file 2: Table S2).

We compared the diagnostic performance of single and combined indicators, as summarized in Table 3. The joint indicator performed better than a single indicator, regardless of the cohort. Figure 2 shows the ROC and PR curves for both cohorts. In the derivation cohort, the AUC of the joint indicator was greater than that of a single sign (P < 0.001), and IDI analysis showed the improvement in discrimination ability. There was no significant difference when AE was combined with perianal fistula or VSR (P = 0.27). The combination of AE, perianal fistula and VSR obtained the largest AUC, which was significantly larger than AE + perianal fistula (P < 0.001) and AE + VSR (P = 0.03). The same trend was also observed in the validation cohort. Additionally, the comparison of AUC among the inflammatory marker CRP, body mass index, and various imaging markers showed that CRP and body mass index were similar to perianal fistula (P = 0.43 and 0.21) and VSR (P = 0.68 and 0.32), but significantly lower than AE (both P < 0.001) and combined imaging markers (both P < 0.001).

Table 3 Diagnostic Performance Analysis of Single and Combined Indicators
Fig. 2
figure 2

The Receiver operating characteristic and Precision-Recall curves of derivation and validation cohorts. a-b In the derivation cohort, the AUC of AE was 0.877, and no significant difference was observed when AE combined with perianal fistula or VSR (AUC: 0.901 vs. 0.912; P = 0.27). The combination of three indicators obtained the largest AUC (0.929), significantly larger than AE + perianal fistula (0.901; P < 0.001) and AE + VSR (0.912; P = 0.03). The AUPRC of imaging signs combined with VSR was greater than that of combined imaging signs alone (0.961 vs. 0.946). c-d In the validation cohort, the AUC of AE was 0.916. The combination of three indicators also obtained the largest AUC (0.964) and AUPRC (0.983 vs. 0.981). AUC, area under the curve; AE, Asymmetric enhancement; VSR, Visceral-to-subcutaneous fat ratio; AUPRC, area under the precision-recall curve

Construction and validation of IBD-RADS

The IBD-RADS based on the above analysis are summarized in Fig. 3. In IBD-RADS, AE was the major criterion, and perianal fistula and VFP were auxiliary criteria. Limited small bowel disease, intestinal-fistula, and skip distribution were highly specific indicators, favoring the diagnosis of CD in particular. We proposed five grades according to the above indicators, and Fig. 3 shows the grade distribution in IBD patients, as detailed in Table 4. Grade 4 and 5 can be initially diagnosed as CD, with 94.3% (derivation) and 95.8% (validation) of CD patients being classified. Grade 3 met only the major criterion and was likely to be diagnosed with CD (57.1% (8/50) and. 50.0% (2/4)), with 2.2% (10/462) of CD patients classified into this category. Grade 1 and 2 did not meet the major criterion. Among them, Grade 1 did not have any imaging signs mentioned above, were less likely to be diagnosed with CD (3.6% (6/167) and 2.5% (1/40)) and preferred UC. Grade 2 met only the auxiliary criteria and also preferred UC. Figure 4 shows imaging and endoscopic images of three patients classified as Grade 5b (a-b) and Grade 1 (c), respectively.

Fig. 3
figure 3

Criterion, definitions, and interpretations of IBD-RADS, and its distribution in the two cohorts. IBD-RADS includes one major criterion, two auxiliary criteria, and three special support items. The grading system classifies patients into five grades, with Grade 4 and 5 further subdivided into three and two categories, respectively. Based on the distribution in the derivation and validation cohorts, we recommend that Grade 1 and 2 patients should be initially classified as UC, while Grade 3 and 4 are more indicative of CD. Patients classified as Grade 5 can be preliminarily classified as CD. CD, Crohn's disease; UC, Ulcerative colitis

Table 4 Distribution of patients in the Derivation and Validation cohorts
Fig. 4
figure 4

CTE, pelvic MRI and endoscopic images of two Grade 5 CD patients and one Grade 1 grade UC patient. a The CTE images of a 24-year-old male with CD showed asymmetric enhancement (green arrow) and intestinal fistula (yellow asterisk). The visceral and subcutaneous fat were marker as yellow and blue, respectively, and the VSR is 1.19 (> 0.97, visceral fat predominance). He was classified as Grade 5b. b The CTE, pelvic MRI and endoscopic images of a 24-year-old male patient with CD showed discontinuous asymmetric enhancement (green arrow), perianal fistula (yellow arrow) and longitudinal ulcer (green asterisk), and the VSR was 2.55 (> 0.97, visceral fat predominance). He was classified as Grade 5b. c The CTE image of a 55-year-old female patient with UC showed symmetric enhancement of the sigmoid colon with a VSR of 0.38 (< 0.97). She was classified as Grade 1. CTE, CT enterography; CD, Crohn's disease; UC, Ulcerative colitis

Interobserver reproducibility

The Kappa coefficients for each imaging sign are summarized in Table S3 (Additional file 2: Table S3). Whether in which cohort, the consistency of indicators was between substantial and almost perfect, with the consistency of UC generally higher than that of the CD group. After training, the consistency of the evaluators in assessing the validation cohort significantly improved, as reflected in each item.

Discussion

We identified discriminative imaging features and quantitative parameters from baseline radiological examinations of IBD patients upon admission. Subsequently, an imaging-based grading system, IBD-RADS, specifically designed for CD was constructed and validated. IBD-RADS highlights the importance of radiological findings and the complementary role of quantitative parameters to imaging signs. Notably, Grade 3 to 5 correctly classified most CD patients (derivation: 96.5%, validation: 98.0%), making it valuable for the initial stratification of clinically suspected IBD patients.

The IBD-RADS consists of five imaging signs and one quantitative parameter, with AE of the bowel wall along mesenteric border on CTE images serving as the major criterion due to its high sensitivity and specificity in both cohorts. Longitudinal ulcer along the mesenteric border is a recognized feature of CD, as the mesenteric and antimesenteric borders have different blood supply [28]. The mesenteric border side of TI, a susceptible target for CD, is supplied by short vessels, which act as effective terminal arteries and supply a mucosa more susceptible to ischemia, resulting in preferential ulceration [29]. Imaging signs corresponding to this feature (asymmetric bowel wall thickening, enhancement, and pseudodiverticula on the antimesenteric side) have been described and considered characteristic of active CD [23]. In this study, we investigated AE, and confirmed that it is a sign that occurrs in 80% of CD patients and is one of the independent predictors of CD. In addition, we also found that the presence of isolated small bowel disease, intestinal fistulas, and skip distribution in the CTE images were three favoring items for the diagnosis, occurring only in CD. However, since these signs can occur in non-IBD population, they are not used as major criteria. Favoring items maximize their diagnostic utility only when major criteria are met, as various infectious diseases and malignancies can lead to similar imaging findings [30, 31].

Previous guidelines have emphasized the importance of perianal lesions for CD diagnosis [13]. In the current study, perianal fistula confirmed by imaging was used as an auxiliary criterion, and its addition significantly improved the diagnostic performance of the major criterion alone. However, CTE only reveals extensive and obvious lesions, while pelvic MRI is the best method for visualizing the anatomy of fistulas, especially in clinically asymptomatic patients [23, 32]. Therefore, we recommend suspected IBD patients receive both CTE and pelvic MRI for initial imaging screening. Another auxiliary criterion is VFP, which means a higher proportion of VAT compared with SAT and significantly improves the diagnostic performance of using major criterion or imaging signs alone. The role of adipose tissue in IBD is constantly being interpreted, and relevant clinical, imaging and cytomolecular studies are gradually confirming that VAT is an endocrine organ that regulates immunity and inflammation rather than just an energy storage tissue [33]. Significant differences in VAT have been observed between patients with CD and UC. In CD patients, increased VAT is often associated with “creeping fat,” where mesenteric fat wraps around inflamed intestinal segments, a phenomenon that is uncommon in UC patients [34]. The presence of creeping fat in CD is linked to intestinal fibrosis and disease progression [35]. SAT levels are primarily linked to nutritional status, whereas VAT accumulation is affected by both nutritional factors and disease processes, underscoring its complex role. The VFP, derived from VSR, which accounts for both VAT and SAT, effectively highlights the critical role of VAT in disease pathology. Previous studies have extracted VAT texture features from CT images to identify IBD subtypes, but manual segmentation and a lack of exploration of intestinal lesions limit their clinical application [16]. Volume-based VSR was also calculated in a study using automatic segmentation, but its predictive ability for CD disease progression is limited [17]. We speculate that this index has greater diagnostic value, as VFP in a single slice has been proven useful in differentiating between CD and intestinal tuberculosis [36,37,38,39]. To avoid selection bias, we performed analysis of the entire lumbar region and automated the analysis process including segmentation and parameter extraction. Interestingly, despite lower BMI and poorer nutritional status, CD patients have a higher VAT proportion than UC patients, highlighting the crucial role of VFP in disease diagnosis. It is noteworthy that although the VAT distribution and attenuation parameters did not show statistical significance in multivariate analysis, the differences between CD and UC patients were not negligible, thus confirming the value of fat parameters in IBD studies.

Although multiple diagnostic indicators were proposed, our analysis showed that using auxiliary criteria alone cannot effectively stratify IBD patients. Maximum diagnostic efficacy can only be achieved by using a multiparametric apporach. Additionally, integrating IBD-RADS into current clinical workflows is essential to maximize its effectiveness. Radiologists can use this grading system to initially classify suspected IBD patients, expediting diagnosis by gastroenterologists and enabling earlier treatment planning. A standardized grading system can also enhance communication among various healthcare providers in a multidisciplinary team, including radiologists, gastroenterologists, and surgeons, ensuring that all team members share a common understanding of the disease status. Furthermore, it will facilitate data collection for follow-up and monitoring of patient treatment responses. This allows for the accumulation of standardized data across centers, supporting the continuous improvement of the grading system and uncovering its broader applications. The clinical integration of IBD-RADS requires meeting several conditions, with standardized training for radiologists being the most crucial component. Due to the subjectivity of imaging evaluation, even radiologists with similar levels of experience may differ in their interpretation of findings. Our study results indicate that the consistency of evaluations for all indicators improved among junior radiologists after training. Meanwhile, implementation requires time and resources, such as software updates for automated segmentation models and additional time for system integration into practice. Although an initial investment is necessary, standardized processes can significantly improve diagnostic efficiency and accuracy over the long term. Finally, we suggest establishing a continuous evaluation and feedback mechanism. Regularly updating the grading standards based on the latest research and clinical experience of the multidisciplinary team will help optimize the system's application. However, it is important to note that the imaging-based grading system is a stratification tool rather than a diagnostic standard; it may occasionally produce errors, and when necessary, should be used in conjunction with other assessments, such as endoscopic and pathological findings, to accurately evaluate disease activity.

Our study has several limitations. First, the cross-center study led to inevitable differences in equipment, so we selected indicators with greater consistency and accessibility to establish our grading system. Second, the retrospective design may introduce potential selection bias, as only patients who met the inclusion criteria and had radiological data were included, which may not represent the overall IBD population. Third, our study focused on the diagnostic value of radiology, and the concordance between IBD-RADS and endoscopy needs to be explored further. Fourth, although clinical indicators were compared, they were only used as a reference to the grading system, and were not our focus, but it is worth investigating in the follow-up research. Fifth, the current IBD-RADS requires further prospective validation, including its application to diverse ethnic groups from various regions and comparison with non-IBD intestinal diseases. Sixth, although the criteria were mainly based on CTE, they are equally applicable to MRE, which is debatable considering that MRE is not the preferred choice for the initial diagnosis. Finally, a comparison between intestinal ultrasound and the IBD-RADS system would provide a more comprehensive perspective for clinical practice. However, due to the limitations of the imaging data obtained in this study, such an analysis could not be conducted and warrants further prospective studies.

Conclusions

In conclusion, we propose a new imaging report and data system for clinically suspected IBD patients, aimed at improving the differentiation between CD and UC. This system utilizes easily assessable, consistent imaging signs and stable quantitative parameters obtained through automated calculations, making it well-suited for clinical application. Future research should focus on prospective validation and refining the grading system to optimize its diagnostic performance and explore its integration into routine clinical workflows.