A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Pei, Yuanyuan; Wang, Guijuan; Cao, Haiwei; Jiang, Shuanglan; Wang, Dan; Wang, Haiyu; Wang, Hongying; Yu, Hongkui

doi:10.1038/s41746-023-00930-8

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Article
Open access
Published: 30 September 2023

Volume 6, article number 182, (2023)
Cite this article

Download PDF

You have full access to this open access article

npj Digital Medicine

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Download PDF

2460 Accesses
5 Altmetric
Explore all metrics

Abstract

Ileocolic intussusception is one of the common acute abdomens in children and is first diagnosed urgently using ultrasound. Manual diagnosis requires extensive experience and skill, and identifying surgical indications in assessing the disease severity is more challenging. We aimed to develop a real-time lesion visualization deep-learning pipeline to solve this problem. This multicenter retrospective-prospective study used 14,085 images in 8736 consecutive patients (median age, eight months) with ileocolic intussusception who underwent ultrasound at six hospitals to train, validate, and test the deep-learning pipeline. Subsequently, the algorithm was validated in an internal image test set and an external video dataset. Furthermore, the performances of junior, intermediate, senior, and junior sonographers with AI-assistance were prospectively compared in 242 volunteers using the DeLong test. This tool recognized 1,086 images with three ileocolic intussusception signs with an average of the area under the receiver operating characteristic curve (average-AUC) of 0.972. It diagnosed 184 patients with no intussusception, nonsurgical intussusception, and surgical intussusception in 184 ultrasound videos with an average-AUC of 0.956. In the prospective pilot study using 242 volunteers, junior sonographers’ performances were significantly improved with AI-assistance (average-AUC: 0.966 vs. 0.857, P < 0.001; median scanning-time: 9.46 min vs. 3.66 min, P < 0.001), which were comparable to those of senior sonographers (average-AUC: 0.966 vs. 0.973, P = 0.600). Thus, here, we report that the deep-learning pipeline that guides lesions in real-time and is interpretable during ultrasound scanning could assist sonographers in improving the accuracy and efficiency of diagnosing intussusception and identifying surgical indications.

Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children

Article Open access 19 December 2019

Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study

Article Open access 16 October 2020

Using Deep Learning to Detect the Presence and Location of Hemoperitoneum on the Focused Assessment with Sonography in Trauma (FAST) Examination in Adults

Article 07 June 2023

Introduction

Intussusception is one of the common acute abdomens in children, with the ileocolic type being the most prevalent, which is usually diagnosed urgently using ultrasound¹. Nonsurgical enemas administered within 24 h of onset relieve symptoms in approximately 84% of patients^2,3. Less than one-third of patients present with the classic triad of symptoms (abdominal pain, palpable mass, and blood stained stools), and some pediatric diseases have similar clinical manifestations⁴. Thus, its diagnosis can be easily delayed or missed during the initial emergency visit, delayed treatment can cause sepsis or even hypovolemic shock⁵. Patients with severe symptoms or failed enemas require prompt identification of surgical indications^6,7.

The sensitivity and specificity of ultrasound for intussusception can achieve 92–100% of diagnosis^8,9. However, the speed of the ultrasound scan exceeds 25 frames per second (FPS). An ultrasound scan includes thousands of image frames, and only a few frames with clear pathological features are useful for the diagnosis of intussusception and are thus easily missed by the human eye. In addition, noise, distortion, and artifacts degrade the quality of ultrasound images¹⁰. Therefore, specialized imaging knowledge and skilled sonographers are required to diagnose whether it is intussusception, and the recognition of surgical indications to assess disease severity is more challenging.

Artificial intelligence (AI) has demonstrated general applicability in diagnosing pediatric diseases using medical images^11,12. Previous studies have also proposed deep-learning (DL) approaches to extract frames with complete pathological features from ultrasound videos for diagnosis^13,14. AI using images labeled with a priori knowledge to diagnose intussusception has also been reported. However, they did not consider the algorithm’s speed and provide surgical indications^15,16. Furthermore, intuitively understanding the internal decision-making process of DL is challenging, thereby hindering its translation into clinical practice.

We aim to develop and validate a deep-learning pipeline for real-time navigation of diagnostic planes during ultrasound scanning to identify ileocolic intussusception and provide surgical indications using heterogeneous multicenter datasets of images and videos for retrospective testing and external validation. The tool’s scalability is prospectively evaluated using a real-world clinical dataset. The performance of junior sonographers with AI-assistance is compared with those of junior, intermediate, and senior sonographers. We also attempt to visually interpret the “black box” of DL’s internal decision-making to boost the sonographer’s confidence in the algorithm.

Results

Characteristics of patients

Figure 1 illustrates the flow diagram of this study. Epidemiological characteristics, medical image features, and final diagnoses for the three data sets are summarized in Table 1. In the retrospective datasets with 14,085 images from 8736 children with ileocolic intussusception, the median age was eight months (range, 3–36), 63.5% were male, 8.4% required surgery, and the classic triad of symptoms was observed in less than one-third of the patients.

**Fig. 1: The AI system for diagnosing ileocolic intussusception and providing surgical indications.**

Table 1 Datasets for training, validation, testing, prospective pilot study, and Characteristics of patients.

Full size table

Training, validating, and selecting the optimal deep-learning model

The three AI (Faster RCNN, YOLOv5, and Modified YOLOv5) were trained using a training set of 9,750 images from 6,081 patients and a validation set of 3,249 images from 2,023 patients. Subsequently, the average-AUC and median FTP of the modified YOLOv5 were comparable with those of Faster R-CNN (average-AUC: 0.976 [95% CI, 0.967–0.983] vs. 0.793 [95% CI, 0.782–0.805], P < 0.001; median FTP: 102 [range 93–109] vs. 25 [range, 20–31], P < 0.001) and YOLOv5 (average-AUC: 0.976 [95% CI, 0.967–0.983] vs. 0.981 [95% CI, 0.972–0.986], P = 0.698; median FTP of 102 [range, 93–109] vs. 63 [range 57–59], P < 0.001) in the internal test set of 1,086 images. Thus, we selected the best-performing modified version of YOLOv5.

Testing on the internal retrospective image test set

The deep-learning system was tested on the internal test set containing 1,086 images from 632 patients. Three types of images with nonsurgical doughnut, nonsurgical sleeve, and surgical signs were identified with an AUC of 0.977 (95% CI, 0.968–0.986), 0.966 (95% CI, 0.949–0.983), and 0.973 (95% CI, 0.955–0.992), respectively, with a confusion matrix and an average-AUC of 0.972 (95% CI, 0.936–1.000). The corresponding AUC for diagnosing these patients with ileocolic intussusception requiring surgery was 0.962 (95% CI, 0.940–0.985) (Fig. 2 and Table 2).

**Fig. 2: Prediction results of the deep-learning system on the internal test set of 1,086 ultrasound images in 632 patients.**

Table 2 Performance of the deep-learning system on the internal test set of 1086 ultrasound images in 632 patients.

Full size table

Model generalisation to an external retrospective video dataset

We added the “Con-best.py” program to the YOLOv5 model to select the optimal standard diagnostic plane with the highest Confidence in each video. Subsequently, the AI system was tested using 184 ultrasound videos from 184 patients (including cases with no intussusception, nonsurgical intussusception, and surgical intussusception) with an AUC of 0.958 (95% CI, 0.909–1.000), 0.953 (95% CI, 0.919–0.988), and 0.956 (95% CI, 0.902–1.000), respectively, with an average-AUC of 0.956 (95% CI, 0.961–0.991) and a median FPS of 91 (range, 83–101) (Table 3). The model not detecting the nonsurgical doughnut, nonsurgical sleeve, or surgical signs in a video indicated that the patient did not have ileocolic intussusception.

Table 3 Performance of the deep-learning system on the external retrospective 184 videos in 184 patients.

Full size table

Visual interpretation of deep-learning internal decision-making

Four images were cropped at any angle, and their brightness and contrast were adjusted. Then they were stitched into a “Mosaic” enhanced image (Fig. 3a). The learning effect of each convolutional layer can be explained using the visualized feature maps. Here, the last convolutional layer of YOLOv5 was chosen to map the features to the range of 0–255 and convert them into images. The binary grayscale image revealed that the convolutional layer learned to recognize the doughnut sign (Fig. 3b). A heat map, which visualizes the activation of convolutional layer features, further aided in determining whether the model can correctly identify image features. A heat map was created by extracting the activation value from the last convolutional layer of YOLOv5 and multiplying it with the average gradient feature map value. The red and yellow regions in Fig. 3c signify the model’s identification of the lesion area. The AI system automatically identified the images as a nonsurgical doughnut, nonsurgical sleeve, and surgical signs and labeled the corresponding lesion areas with a Confidence of 0.97, 0.97, and 0.98, respectively. The values in Fig. 3d–f represent the Confidence, calculated using Eq. 1. In this equation, P (object) was assigned 1 when the category was accurately predicted and 0 otherwise.

$${Confidence}=P({object})* {IoU}\left({truth}- > {pred}\right)$$

(1)

**Fig. 3: Visual interpretation of deep-learning internal decision-making.**

Generalizing the model to real-world ultrasound diagnostic scenarios and comparing the performances of four groups of sonographers

We connected this tool to ultrasound machines to assess the performance of the deep-learning pipeline in a real-world ultrasound scanning scenario, and the four sonographer groups with different skill levels conducted replicate ultrasound examinations on each of the 242 patients suspected to have intussusception. Sonographers with equal skill levels from the same department were assigned to the same group to ensure consistency among the observers in each group. The four sonographer groups were as follows: two senior sonographers (13 and 15 years of experience); two intermediate sonographers (6 and 7 years of experience); two junior sonographers (2 years of experience); and two junior sonographers with AI-assistance. The diagnostic results of both sonographers within each group were averaged, with average AUCs of 0.973 (95% CI, 0.938–1.000), 0.919 (95% CI, 0.862–0.979), 0.857 (95% CI, 0.808–0.912), and 0.966 (95% CI, 0.923–0.999), along with median scanning times (in minutes) of 3.14 (interquartile range [IQR], 2.02–4.13), 6.98 ([IQR], 5.89–8.03), 9.46 ([IQR], 7.91–11.17), and 3.66 ([IQR], 2.91–4.21), respectively (Fig. 4, Table 4, and Supplementary Fig. 1).

**Fig. 4: The performance of the four groups of sonographers was compared among 242 volunteers.**

Table 4 Performance of the four groups of sonographers in the diagnosis of 242 volunteers.

Full size table

Discussion

In this study, we developed a real-time multiobjective detection and tracking deep-learning model using multicenter heterogeneous datasets and ultrasound imaging characteristics to diagnose ileocolic intussusception and provide surgical indications and tested retrospectively and prospectively. We assessed the retrospective and prospective data. This model achieved average AUCs of 0.972 and 0.956 on the internal image test set and the external video dataset, respectively. In the real-world ultrasound diagnostic scenarios in 242 volunteers, the performance of junior sonographers was significantly improved with AI-assistance (average-AUC: 0.966 vs. 0.857, P < 0.001; median scanning time: 9.46 min vs. 3.66 min, P < 0.001), which surpassed that of intermediate sonographers (average-AUC: 0.966 vs. 0.919, P = 0.039; median scanning time: 9.46 min vs. 6.98 min, P < 0.001), but was comparable to that of senior sonographers (average-AUC: 0.966 vs. 0.973; P = 0.600). Overall, this diagnostic tool can assist sonographers in managing children with ileocolic intussusception.

Accurate and timely diagnosis of ileocolic intussusception and recognizing surgical indications are critical for selecting treatment plans and achieving positive treatment outcomes^1,2,3,7. Studies have proposed using deep-learning to diagnose intussusception in children with plain abdominal radiographs^15,16. However, X-rays are less sensitive and specific for diagnosing intussusception than ultrasound¹⁷. Our algorithm achieved a higher AUC using three ultrasound datasets. Additionally, this tool processed images at a median FPS of 91 (range, 83–101) during the ultrasound scan. It displayed Confidence values and anchor boxes to guide the sonographer in adjusting the scan position in real-time, thereby enhancing the diagnostic accuracy and efficiency of less experienced sonographers. The examination time of junior sonographers was reduced from 9.46 ([IQR], 7.91–11.17) min to 3.66 min ([IQR], 2.91–4.21), which is particularly valuable because children <age 3 often do not cooperate during ultrasound scans and tend to cry. Furthermore, our algorithm identified surgical indications, facilitating the assessment of disease severity and increasing confidence in selecting the appropriate treatment options. Nevertheless, intussusception is a dynamic disease. Despite the diagnosis of a surgical indication by an experienced sonographer, a few patients whose abdomens were opened did not require partial bowel resection. Therefore, in clinical practice, even with surgical indications, preference is given to conservative, nonsurgical enemas to minimize surgical risk. Surgery is considered only after 1–3 failed enemas, depending on the status of each patient’s surgical indication.

The proposed model is stable and compatible because it was trained using a multicenter and multi-device dataset and tested with image, video, and real-world clinical datasets. YOLOv5 has three sets of multiscale adaptive anchor boxes that can be adapted to deviations in the physical size of images caused by different ultrasound systems¹⁸. The tool can be easily applied in clinical practice by connecting it to an ultrasound machine, which is particularly helpful for junior and intermediate sonographers. Furthermore, we have added a “Con-best.py” file to YOLOv5, which can select the optimal standard plane with the highest confidence in a postultrasound video. This addition will further improve the accuracy of the sonographer’s diagnosis because even skilled senior sonographers find it challenging to select the standard plane with the highest confidence in a high-speed ultrasound scan.

The AI system detects differential and subtle features of medical images, even beyond the observational ability and comprehension of the clinicians^19,20. Differences in ultrasound imaging between nonsurgical and surgical intussusception have also been studied^{21,22,23,24,25,26,27}. Based on the combined evidence, we suggest that a deep-learning pipeline trained using a dataset labeled with a priori knowledge can diagnose intussusception and provide surgical indications.

Our study has some limitations. First, the model was trained and validated using ileocolic intussusception ultrasound datasets and did not involve other types of intussusception, such as ileoileocolic, enteroenteric (including jejunojejunal and ileoileal), and colocolic types. However, over 90% of intussusceptions are ileocolic^3,28, this limitation affects the generalizability of the model. When used to diagnose all types of intussusception, false negatives may lead to delayed treatment, whereas false positives may result in unnecessary enemas or surgical interventions. Second, the few surgical intussusception samples might have also affected the model’s performance, despite increasing the number of images using image enhancement techniques. Third, the AI system cannot diagnose intussusception in adults due to the adult intestine’s high gas and fat content and the poor quality of ultrasound images, thereby necessitating computed tomography scans or X-rays.

In conclusion, a deep-learning pipeline based on heterogeneous multicenter ultrasound datasets and their imaging features can assist sonographers in diagnosing pediatric ileocolic intussusceptions and provide surgical indications for assessing disease severity. Further training and validation using datasets involving additional types of intussusception and new technologies are needed to enhance the generalizability and performance of the model.

Methods

Datasets

The heterogeneous multicenter dataset included a retrospective dataset of images for training and internal testing, a retrospective dataset of videos for external testing, and a prospective dataset of volunteers to compare the performance of junior sonographers with AI-assistance with that of junior, intermediate, and senior sonographers (Fig. 1). After screening, the final eligible data were collected from six hospitals: three regional hospitals affiliated with the Guangzhou Women’s and Children’s Medical Center, namely: the Children Branch (4781 images from 2842 patients, 61 videos from 61 patients, and 242 volunteers), Zengcheng Branch (2360 images from 1409 patients and 27 videos from 27 patients), and Zhujiang New Town Branch (1748 images from 1184 patients and 33 videos from 33 patients); Children’s Hospital of Zhengzhou University (2791 images from 1,651 patients and 25 videos from 25 patients); Kaifeng Children’s Hospital (1361 images from 981 patients and 17 videos from 17 patients); and Dongguan Children’s Hospital (1044 images from 669 patients and 25 videos from 25 patients).

All parents of the volunteers agreed to their children’s participation in the prospective study. In the retrospective dataset, parents were informed in the initial admission form that their children’s clinical data might be used for the study. Subsequently, the data of those whose parents did not object were included. The study was approved by the local ethics committee and institutional review board of each hospital (Guangzhou Women and Children’s Medical Center: [2021] No. 486B01; Children’s Hospital Affiliated to Zhengzhou University: 2022-H-K29; Kaifeng Children’s Hospital: [2021] 127; Dongguan Children’s Hospital: LL2022121501).

Retrospective image datasets

The initial image dataset included 15,776 images of ileocolic intussusception containing nonsurgical doughnut, nonsurgical sleeve, and surgical signs in 9725 patients aged 3–36 months who underwent ultrasound between January 2017 and December 2021. It also included 1–8 images with typical pathological features in each child from the electronic medical record systems. Inclusion criteria were based on the discharge outcomes of the patients, image quality, and the consensus of three ultrasound experts who reviewed the data (Prof. Haiwei Cao, Prof. Hongkui Yu, and Prof. Hongying Wang with 13, 15, and 21 years of experience, respectively). In total, 1,048 initially incorrectly extracted diagnostic planes from 629 patients were excluded: (1) 264 from 147 patients with incomplete or poor quality pathologic features; (2) 614 from 379 patients whose initial ultrasounds were misdiagnosed as other abdominal conditions, but who were ultimately diagnosed with nonsurgical or surgical intussusception; (3) 143 from 91 patients initially diagnosed with nonsurgical or surgical intussusception using ultrasound, but eventually diagnosed with other abdominal conditions; (4) 27 from 12 patients misdiagnosed as intussusception requiring surgery, but did not need partial bowel resection on opening.

Retrospective video datasets

The video dataset included patients first seen in the emergency department for acute abdominal conditions between October 2021 and June 2022, and only those the emergency department physicians initially diagnosed with suspected intussusception based on experience and rapid laboratory tests were sent to our ultrasound department for further examination. Patients with other types of intussusception (n = 15) and other abdominal diseases (n = 7), according to the actual outcome of the patient’s final discharge, were excluded. In addition, poor-quality videos (n = 11) were excluded. The final 184 patients with ultrasound diagnoses of no intussusception, nonsurgical ileocolic intussusception, or surgical ileocolic intussusception were selected, with one video selected for each patient.

Prospective volunteer datasets

Patients who were initially diagnosed with suspected intussusception in the emergency department and referred to our ultrasound department for further examination were recruited as volunteers between April 2023 and May 2023. Based on the patient’s final discharge record, patients with other types of intussusception (n = 21) and other acute abdominal diseases (n = 27) were not included in the statistical analysis. The final 242 patients, including those with no intussusception, nonsurgical ileocolic intussusception, and surgical ileocolic intussusception, were used to prospectively validate the scalability of this system and compare the performance of the four sonographer groups with different skill levels in real-world scenarios.

Ultrasound equipment

The ultrasound equipment used was as follows: (1) LOGIQ E10 and E10 R7 (GE Healthcare, United States) with the C2-9-D array probes. (2) EPIQ Elite (Philips Medical System, the Netherlands) with a C10-3v probe. (3) Acuson Sequoia 512 (Siemens Medical Solutions, United States) with a 4C1 vector transducer. (4) Acuson S3000 (Siemens Healthcare, Germany) with an EC9-4 probe. (5) RS85 Prestige (Samsung Medison Co., South Korea) with a C2-8 probe. (6) DC-80 and Resona Version 7.0 (Mindray Medical, China) with the 6C2 probes. (7) Arietta 850 (Esaote, China) with an UST-9123 abdominal probe. (8) Aplio i800 (Toshiba Medical Systems Corp, Tokyo, Japan) with a P7-3 abdominal probe. (9) Edge II (FUJIFILM SonoSite Inc., United States) with a C35x convex array probe. (10) Aplio i900 (Canon Medical Systems, Japan) with a PVT-375BT convex array probe. (11) MyLab X8 (Esaote, Italy) with a C353 convex probe. (12) S50 (SonoScape Medical Co., China) with a C6-2E micro-convex array probe.

Ultrasound image analysis

According to the guidelines for managing intussusception in children²⁹, ultrasound imaging of intussusception displays typical features. It appears as a doughnut and a sleeve sign in the transverse and longitudinal view, respectively. These images with clear and complete pathological features are standard planes^8,9. Compared with temporary intussusception, surgical intussusception presents more coexisting features such as longer intussusception, thicker edematous intestinal wall, larger doughnut diameter, pneumoperitoneum, signs of peritonitis, peritoneal fluid, and “trapped” fluid between the intestinal walls, which may indicate a higher surgical risk (Fig. 5)^21,22,23,24. In this study, In this study, the aforementioned evidence was the ‘gold standard’, while the patient’s final discharge record and the ultrasound expert’s labeling of the images were employed as the ‘silver standard’ for diagnosis.

**Fig. 5: Three types of ileocolic intussusception lesions were labeled by ultrasound experts.**

Two pediatric ultrasound experts (Prof. Haiwei Cao and Prof. Hongkui Yu) selected 14,085 standard planes in 8,736 patients with ileocolic intussusception via consensus. Subsequently, three types of lesions (nonsurgical doughnut, nonsurgical sleeve, and surgical sign) were labeled using their expertise and in conjunction with the patient’s discharge records using an online image labeling and intelligence enhancement tool (Roboflow, https://www.roboflow.com). Due to the fact that some ultrasound images of intussusception are not easily distinguishable into the surgical or nonsurgical types, and there were also some initially extracted standard planes that were incorrect, leading to inconsistencies between the initial diagnostic results and the actual results of the final discharge, which require review in the discharge record and exclusion by the ultrasound experts. In cases of disagreement, a third pediatric ultrasound expert (Prof. Hongying Wang) was consulted.

AI Model

We modified the YOLOv5 algorithm (https://github.com/ultralytics/yolov5) for real-time “intelligent navigation” of the standard planes to diagnose ileocolic intussusception and provide surgical indications during ultrasound scanning (Fig. 6). Previous studies have also proposed the term “intelligent navigation”^30,31. Here, “intelligent navigation” refers to the automatic recognition of standard planes during ultrasound scanning. The model displays anchor boxes and Confidence values on the lesions, prompting the sonographer to adjust the position and direction of the scan to capture the optimal standard plane.

**Fig. 6: The modified YOLOv5 for diagnosis of pediatric intussusception and providing surgical indications.**

The YOLOv5 predicts the boundaries of targets in images, classifies and localizes them using probability, and achieves end-to-end image detection, with the ability to process images at 45–155 FPS¹⁸. Within the YOLOv5 framework, ‘depth_multiple’ signifies the model’s depth, determining the number of modules (number × depth). Similarly, ‘width_multiple’ represents the model’s width, regulating the count of convolutional channels (number × width). We selected ‘depth_multiple’ = 0.33 and ‘width_multiple’ = 0.50. In this configuration, the network’s depth was reduced by a factor of three, and the number of convolutional channels was reduced by half. This approach enhances image processing speed and decreases reliance on high-end computer hardware configurations.

Data augmentation is used to expand datasets to improve convolutional neural network performance, making models more robust and preventing overfitting³². The Roboflow was used to pre-process, normalize and label the three types of lesions in the selected standard planes. Optical features of the image (brightness, hue, and saturation) were adjusted, along with the features geometry of the images. The “Mosaic” enhancement feature of YOLOv5 was used to stitch four images together into a single image. This significantly improves the model’s ability to recognize images with weak features³³.

The model outputs standard planes with different confidence values during each ultrasound scan. Therefore, we added a “Con-best.py” file to select the standard plane with the highest confidence value from each video after the ultrasound scan.

Statistical analysis

Normalized confusion matrices were used to depict the classification results of the three types of patients. AUC with 95% confidence intervals (CIs) of the four sonographer groups (junior, intermediate, senior sonographers, and junior sonographers with AI-assistance) and the three AI (Faster R-CNN [https://github.com/rbgirshick/py-faster-rcnn], YOLOv5, and modified YOLOv5) were compared using the DeLong test and “pROC” package. Wilcoxon signed-rank test was used to compare the scanning time of the four observer groups and the median FPS of the three AIs because the non-normality of these data distributions was assessed beforehand using the Kolmogorov–Smirnov test. The performances of the four sonographer groups were evaluated using accuracy, sensitivity, specificity, AUC, and Fleiss’ Kappa. The shortest two-sided 95% CIs were reported for each experiment. Data were analyzed using R statistical software (version 4.1.1, R Core Team, 2021). P < 0.05 was considered indicative of a statistically significant difference.

Data availability

The ultrasound images and videos are not publicly available by hospital regulations to protect patient privacy. Limited data access is obtainable upon reasonable request by contacting the corresponding author.

Code availability

The modified YOLOv5 model has been made publicly available at Github (https://github.com/yuanyuanpei7/pediatric-intussusception).

References

Applegate, K. E. Intussusception in children: evidence-based diagnosis and treatment. Pediatr. Radiol. 39(Suppl 2), S140–143 (2009).
Article PubMed Google Scholar
Kelley-Quon, L. I. et al. Management of intussusception in children: A systematic review. J. Pediatr. Surg. 56, 587–596 (2021).
Article PubMed Google Scholar
Waseem, M. & Rosenberg, H. K. Intussusception. Pediatr. Emerg. Care 24, 793–800 (2008).
Article PubMed Google Scholar
Caruso, A. M. et al. Intussusception in children: not only surgical treatment. J. Pediatric. Neonat. Individual. Med. (JPNIM) 6, e060135 (2017).
Google Scholar
Chang, Y. J., Hsia, S. H. & Chao, H. C. Emergency medicine physicians performed ultrasound for pediatric intussusceptions. Biomed. J. 36, 175–178 (2013).
Article PubMed Google Scholar
Edwards, E. A. et al. Intussusception: past, present and future. Pediatr. Radiol. 47, 1101–1108 (2017).
Article PubMed Google Scholar
Lehnert, T., Sorge, I., Till, H. & Rolle, U. Intussusception in children–clinical presentation, diagnosis and management. Int. J. Colorectal. Dis. 24, 1187–1192 (2009).
Article PubMed Google Scholar
Plut, D., Phillips, G. S., Johnston, P. R. & Lee, E. Y. Practical Imaging Strategies for Intussusception in Children. AJR Am. J. Roentgenol. 215, 1449–1463 (2020).
Article PubMed Google Scholar
O’Brien, A. J. & Brady, R. M. Point-of-care ultrasound in paediatric emergency medicine. J. Paediatr. Child. Health 52, 174–180 (2016).
Article PubMed Google Scholar
Gupta, P. et al. Diagnostic accuracy of Doppler ultrasound, CT and MRI in Budd Chiari syndrome: systematic review and meta-analysis. Br. J. Radiol. 93, 20190847 (2020).
Article PubMed PubMed Central Google Scholar
Davendralingam, N., Sebire, N. J., Arthurs, O. J. & Shelmerdine, S. C. Artificial intelligence in paediatric radiology: Future opportunities. Br. J. Radiol. 94, 20200975 (2021).
Article PubMed Google Scholar
Siegel, E. L. What Can We Learn from the RSNA Pediatric Bone Age Machine Learning Challenge? Radiology 290, 504–505 (2019).
Article PubMed Google Scholar
Pu, B., Li, K., Li, S. & Zhu, N. Automatic fetal ultrasound standard plane recognition based on deep learning and IIoT. IEEE Trans. Industr. Inform. 17, 7771–7780 (2021).
Article Google Scholar
Chen, H. et al. Ultrasound standard plane detection using a composite neural network framework. IEEE Trans. Cybern. 47, 1576–1586 (2017).
Article PubMed Google Scholar
Kwon, G. et al. Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study. Sci. Rep. 10, 17582 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. et al. Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children. Sci. Rep. 9, 19420 (2019).
Article PubMed PubMed Central Google Scholar
Cogley, J. R., O’Connor, S. C., Houshyar, R. & Al Dulaimy, K. Emergent pediatric US: what every radiologist should know. Radiographics 32, 651–665 (2012).
Article PubMed Google Scholar
Thuan, D. Evolution of Yolo algorithm and Yolov5: The State-of-the-Art object detention algorithm. https://urn.fi/URN:NBN:fi:amk-202103042892 (2021).
Zhang, K. et al. Clinically Applicable AI System for Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography. Cell 181, 1423–1433.e1411 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, G. et al. A deep-learning pipeline for the diagnosis and discrimination of viral, non-viral and COVID-19 pneumonia from chest X-ray images. Nat. Biomed. Eng. 5, 509–521 (2021).
Article CAS PubMed PubMed Central Google Scholar
del-Pozo, G. et al. Intussusception: trapped peritoneal fluid detected with US–relationship to reducibility and ischemia. Radiology 201, 379–383 (1996).
Article CAS PubMed Google Scholar
He, N. et al. Risk factors associated with failed sonographically guided saline hydrostatic intussusception reduction in children. J. Ultrasound. Med. 33, 1669–1675 (2014).
Article PubMed Google Scholar
Kim, P. H. et al. Predictors of failed enema reduction in children with intussusception: a systematic review and meta-analysis. Eur. Radiol. 31, 8081–8097 (2021).
Article PubMed Google Scholar
Issa, K., Ali, W. & Al-Abbas, B. Factors Associated with Success of Sonographically Guided Hydrostatic Reduction of Ileocolic Intussusception in Children. SN Compr. Clin. Med. 3, 242–246 (2021).
Article Google Scholar
Zhang, M. et al. Prediction of Outcomes of Ultrasound-Guided Saline Enema in the Treatment of Pediatric Intussusception: A Retrospective Case-Control Study. J. Ultrasound. Med. 41, 2739–2746 (2022).
Article PubMed Google Scholar
Gondek, A. S., Riaza, L., Cuadras, D., Castellarnau, X. T. & Krauel, L. Ileocolic intussusception: Predicting the probability of success of ultrasound guided saline enema from clinical and sonographic data. J. Pediatr. Surg. 53, 599–604 (2018).
Article PubMed Google Scholar
Khorana, J. et al. Prognostic indicators for failed nonsurgical reduction of intussusception. Ther. Clin. Risk Manag. 12, 1231–1237 (2016).
Article PubMed PubMed Central Google Scholar
Mandeville, K. et al. Intussusception: clinical presentations and imaging characteristics. Pediatr. Emerg. Care 28, 842–844 (2012).
Article PubMed Google Scholar
Ito, Y. et al. Japanese guidelines for the management of intussusception in children, 2011. Pediatr. Int. 54, 948–958 (2012).
Article PubMed Google Scholar
Yeo, L. & Romero, R. Fetal Intelligent Navigation Echocardiography (FINE): a novel method for rapid, simple, and automatic examination of the fetal heart. Ultrasound Obst. Gyn. 42, 268–284 (2013).
Article Google Scholar
Yeo, L., Luewan, S. & Romero, R. Fetal Intelligent Navigation Echocardiography (FINE) Detects 98% of Congenital Heart Disease. J. Ultrasound Med. 37, 2577–2593 (2018).
Article PubMed PubMed Central Google Scholar
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1–48 (2019).
Article Google Scholar
Zhu, X., Lyu, S., Wang, X., & Zhao, Q. TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision) (2021).

Download references

Acknowledgements

We thank all included children and their parents for their cooperation with this study. This study was supported by the Science and Technology Planning Project of Guangdong Province (No. 2020B1111170001). The authors declare that they have no conflict of interest.

Author information

These authors contributed equally: Yuanyuan Pei, Guijuan Wang.

Authors and Affiliations

Provincial Key Laboratory of Research in Structure Birth Defect Disease and Department of Pediatric Surgery, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangdong Provincial Clinical Research Center for Child Health, Guangzhou, China
Yuanyuan Pei
School of Computer Science, South China Normal University, Guangzhou, China
Guijuan Wang
Ultrasonic Department, Kaifeng Children’s Hospital, Kaifeng, China
Haiwei Cao
Ultrasonic Department, Dongguan Children’s Hospital, Dongguan, China
Shuanglan Jiang
Ultrasonic Department, Children’s Hospital Affiliated to Zhengzhou University, Zhengzhou, China
Dan Wang
Department of Ultrasonography, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, China
Haiyu Wang, Hongying Wang & Hongkui Yu
Department of Ultrasonography, Shenzhen Baoan Women’s and Children’s Hospital, Jinan University, Shenzhen, China
Hongkui Yu

Authors

Yuanyuan Pei
View author publications
You can also search for this author in PubMed Google Scholar
Guijuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haiwei Cao
View author publications
You can also search for this author in PubMed Google Scholar
Shuanglan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Dan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haiyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongkui Yu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.P., H.W., and H.Y. designed the study with support from all the co-authors. Y.P. led the development of the method with support from H.W. and H.Y. Y.P. wrote the code and carried out the experiments. H.C., S.J., D.W., H.W., H.W., and H.Y. provided ultrasound images, videos, and diagnostic labeling. Y.P., G.W., H.W., and H.Y. analyzed and interpreted the results. H.W. and H.Y. provided technical and material support. Y.P., H.W., and H.Y. were involved in drafting the paper. H.Y. is the guarantor of the study. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Hongying Wang or Hongkui Yu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary_Figure

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pei, Y., Wang, G., Cao, H. et al. A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study. npj Digit. Med. 6, 182 (2023). https://doi.org/10.1038/s41746-023-00930-8

Download citation

Received: 29 May 2023
Accepted: 14 September 2023
Published: 30 September 2023
DOI: https://doi.org/10.1038/s41746-023-00930-8
Springer Nature Limited

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Abstract

Similar content being viewed by others

Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children

Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study

Using Deep Learning to Detect the Presence and Location of Hemoperitoneum on the Focused Assessment with Sonography in Trauma (FAST) Examination in Adults

Introduction

Results

Characteristics of patients

Training, validating, and selecting the optimal deep-learning model

Testing on the internal retrospective image test set

Model generalisation to an external retrospective video dataset

Visual interpretation of deep-learning internal decision-making

Generalizing the model to real-world ultrasound diagnostic scenarios and comparing the performances of four groups of sonographers

Discussion

Methods

Datasets

Retrospective image datasets

Retrospective video datasets

Prospective volunteer datasets

Ultrasound equipment

Ultrasound image analysis

AI Model

Statistical analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary_Figure

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation