Exploring deep learning strategies for intervertebral disc herniation detection on veterinary MRI

Huang, Shoujin; Deng, Guoxiong; Kang, Yan; Li, Jianzhong; Li, Jingyu; Lyu, Mengye

doi:10.1038/s41598-024-67749-5

Exploring deep learning strategies for intervertebral disc herniation detection on veterinary MRI

Article
Open access
Published: 19 July 2024

Volume 14, article number 16705, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Exploring deep learning strategies for intervertebral disc herniation detection on veterinary MRI

Download PDF

Shoujin Huang¹^na1,
Guoxiong Deng¹^na1,
Yan Kang¹,
Jianzhong Li²,
Jingyu Li¹ &
…
Mengye Lyu¹

433 Accesses
Explore all metrics

Abstract

Intervertebral Disc Herniation (IVDH) is a common spinal disease in dogs, significantly impacting their health, mobility, and overall well-being. This study initiates an effort to automate the detection and localization of IVDH lesions in veterinary MRI scans, utilizing advanced artificial intelligence (AI) methods. A comprehensive canine IVDH dataset, comprising T2-weighted sagittal MRI images from 213 pet dogs of various breeds, ages, and sizes, was compiled and utilized to train and test the IVDH detection models. The experimental results showed that traditional two-stage detection models reliably outperformed one-stage models, including the recent You Only Look Once X (YOLOX) detector. In terms of methodology, this study introduced a novel spinal localization module, successfully integrated into different object detection models to enhance IVDH detection, achieving an average precision (AP) of up to 75.32%. Additionally, transfer learning was explored to adapt the IVDH detection model for a smaller feline dataset. Overall, this study provides insights into advancing AI for veterinary care, identifying challenges and exploring potential strategies for future development in veterinary radiology.

Enhanced deep leaning model for detection and grading of lumbar disc herniation from MRI

Article 05 July 2024

Deep learning assisted segmentation of the lumbar intervertebral disc: a systematic review and meta-analysis

Article Open access 21 August 2024

IVD-Net: Intervertebral Disc Localization and Segmentation in MRI with a Multi-modal UNet

Discover the latest articles, news and stories from top researchers in related subjects.

Introduction

Intervertebral Disc Herniation (IVDH) is a common and severe spinal disease in dogs, accounting for 2.3–3.7% of veterinary hospital admissions^1,2,3. Its manifestation in canine patients varies, depending on the type and location of the herniation. Clinical symptoms can range from mild discomfort and pain to severe neurological deficits. In more severe instances, IVDH may cause muscle atrophy and paralysis of the hind limbs, significantly impacting the quality of life of the affected dogs^2,4,5.

With the increasing availability of veterinary magnetic resonance imaging (MRI), there is an optimistic perspective for the early detection of IVDH⁶. However, a global shortage of radiologists skilled in veterinary MRI interpretations leads to diagnostic challenges^7,8,9. While artificial intelligence (AI) or deep learning research has advanced IVDH detection in humans^{10,11,12,13,14}, the anatomical differences between humans and animals, especially the smaller size of animal intervertebral discs, create unique challenges in adapting these methods for veterinary applications. Moreover, while a few studies^15,16 have applied AI techniques to canine IVDH, their focus has predominantly been on image quality improvement¹⁵ or image-level disease classification¹⁶. The localization of IVDH lesions on segment level, as an arguably more challenging yet crucial task, remains largely unexplored yet.

Addressing this gap, recent advances in deep learning-based object detection methods have paved the way. Object detection methods generally fall into two types: one-stage and two-stage detectors¹⁷. The two-stage methods, such as the influential Faster Region-based Convolutional Neural Network (R-CNN) method¹⁸, involve a two-step process. Initially, the Region Proposal Network (RPN) identifies potential regions that may contain objects. Subsequently, the detection network undertakes the tasks of classifying and accurately localizing these identified regions with bounding boxes. Although Faster R-CNN is indeed substantially “faster” than earlier two-stage methods, two-stage detection methods still generally involve more computational steps and complexity compared to one-stage methods¹⁷.

In contrast, one-stage methods, such as the You Only Look Once (YOLO) algorithms¹⁹, adopt a more efficient approach. YOLO divides the image into a grid, predicting bounding boxes and class probabilities directly from this grid structure. Widely recognized for its high inference speed, this method is particularly suitable for real-time applications on mobile devices. However, one-stage methods generally trade off some accuracy, especially in detecting smaller or more irregularly shaped objects, compared to its two-stage counterparts.

As one-stage detection methods continue to advance, they are increasingly seen as capable of matching the accuracy of two-stage methods^20,21,22. However, there is still debate over whether the state-of-the-art one-stage methods can fully replace two-stage methods, especially in specialized areas^23,24. For IVDH detection on veterinary MRI, where the objects of interests, i.e., discs, are small in size and the difference between normal and herniated discs is subtle, the suitability of one-stage methods remains a question.

To address the above research gap, this study aims to investigate the feasibility and methodology of AI-assisted detection of IVDH, with a specific focus on pet dogs. Our primary hypothesis is that simply adopting the latest object detection models from the computer vision field may not be the optimal solution for accurately detecting IVDH lesions in the context of veterinary radiology. Here, “accuracy” can be quantitatively assessed by the Average Precision (AP) metric. Our experiments reveal that more traditional two-stage detection models outperform more popular one-stage models in terms of IVDH detection accuracy. Furthermore, we propose a novel spinal localization module, and demonstrate that it can be used to enhance the IVDH detection accuracy for various models. Lastly, we show that it is possible to adapt the IVDH detection model to pet cats via transfer learning, potentially broadening the applicability of the proposed method.

Materials and methods

Dataset compilation

From September 2019 to August 2022, our study collected 487 mid-sagittal plane MRI images from 213 pet dogs. All pet owners were informed of the details of the study and signed a consent form before their dog participated in the experiments. All procedures were approved by the Ethics Committee of Shenzhen Technology University (reference number: SZTU20200208), and were carried out in accordance with relevant guidelines and regulations. All animal experiments were complied with the ARRIVE guidelines (https://arriveguidelines.org). The dog samples represented a variety of breeds, sexes, ages, and weights. The most frequent breeds included Poodles (n = 55), Mixed breeds (n = 31), French Bulldogs (n = 18), Pomeranians (n = 16), and Welsh Corgis (n = 13). The age range of the dogs was from 0.3 to 18 years (mean value = 5.62, standard deviation = 3.94), and their weights varied from 1.5 to 46 kg (mean value = 9.45, standard deviation = 7.83).

MRI data acquisitions were performed on a super-conductive animal MRI scanner (1.5 T vPetMR, GSMED) at a local veterinary hospital (Pet Burgh YangZi Pet Hospital). A multi-slice 2D T2-weighted fast spin-echo sequence was used with the following imaging parameters: repetition time (TR) = 2895 ms, echo time (TE) = 110 ms, matrix size = 256 × 384, and slice thickness = 3.5 mm. For all dogs, anesthesia was induced intravenously with propofol (2.5 mg/kg), and maintained with inhaled isoflurane at a 1.5% concentration during imaging. For simplicity, only T2-weighted sagittal MRI images were used in this study.

The MRI images were initially in Digital Imaging and Communications in Medicine (DICOM) format and were later converted to .bmp files. Two veterinary radiologists interpreted the images and marked the spine region and intervertebral disc herniation (IVDH) lesions using the labelMe software (https://github.com/labelmeai/labelme). One radiologist holds a bachelor's degree in veterinary medicine and has 7 years of experience in veterinary radiology, while the other holds a master's degree in veterinary medicine and has 4 years of experience in veterinary radiology. Subsequently, the dataset was divided randomly into a training set (50%, 106 dogs) and a test set (50%, 107 dogs). Figure 1 shows the distributions of the subject weights, ages, and the number of annotations per subject. Out of the 213 dogs, 139 (64 in the training set and 75 in the test set) had at least one IVDH lesion, while 74 (42 in the training set and 32 in the test set) had none.

Proposed methodology

Figure 2 presents an overview of our proposed IVDH detection workflow where a coarse-to-fine strategy is employed. After obtaining annotations of the spine and IVDH lesions on MRI images, a preprocessing model, termed the spine localization module, is first trained to identify the spine regions. Once the spine regions are detected, they are cropped from the full-sized images, effectively eliminating irrelevant background tissues. The IVDH detection model (IVDH detection module) is subsequently trained on these cropped images, providing bounding boxes and confidence scores for IVDH lesions.

During testing, the spine localization module first processes the raw images to identify the spine region and crop images. The cropped image is then fed into the IVDH detection module, which locates the IVDH lesions with bounding boxes and provides a confidence score for each detection.

Implementation details

Since the spine detection task is relatively simple, we implemented only one model, i.e., Dynamic R-CNN²⁵ for the spine localization module.

For the IVDH detection module, our experiments involved various well-known one-stage models, including YOLOv3²⁶, FCOS²⁷, YOLOF²¹ and YOLOX²⁰, as well as various two-stage models, including faster R-CNN¹⁸, Cascade R-CNN²⁸, Grid R-CNN²⁹, Cascade Region Proposal Network (Cascade RPN)³⁰, and Dynamic R-CNN²⁵. These models were selected for their high impact in the field of computer vision. They encompass a broad spectrum from earlier methods such as Faster R-CNN (proposed in 2015) and YOLOv3 (proposed in 2018) to more recent approaches like Dynamic R-CNN (proposed in 2020) and YOLOX (proposed in 2021). Additionally, the YOLOX model was implemented in a small (YOLOX-S) and large (YOLOX-L) version, respectively.

We used the mmdetection framework³¹ to implement these models. Following standard configurations, the FCOS, YOLOF, and all two-stage models utilized ResNet50³² backbones pretrained on ImageNet³³, and were trained on the IVDH dataset for 100 epochs. The YOLOv3 and YOLOX models, which do not have official ResNet50 backbones, had different configurations: YOLOv3 used a DarkNet53³⁴ backbone pretrained on ImageNet, and was trained on the IVDH dataset for 273 epochs; YOLOX models used CSPDarkNet³⁵ backbones without pretraining and were trained on the IVDH dataset for 300 epochs. Note that it is a common practice not to pretrain the backbones of YOLOX²⁰. For comparative purposes, we also trained IVDH detection models using a more straightforward, end-to-end approach, i.e., directly on the full-sized images. All model training and testing were conducted on an Ubuntu 22.04 server equipped with two NVIDIA RTX 3090 GPU cards.

Evaluation metrics

Two key evaluation metrics are employed to quantify the IVDH detection accuracy: the precision-recall (PR) curve and average precision (AP)³⁶. Before defining PR curves and AP, the Intersection over Union (IoU)³⁶ is needed to define a true positive detection. IoU is calculated as follows:

$${IOU = \frac{{area\, \left( {B_{p} \cap B_{gt} } \right)}}{{area\,\left( {B_{p} \cup B_{gt} } \right)}}}$$

(1)

where $B_{p}$ is the area of the predicted bounding box and $B_{gt}$ is the area of the ground truth bounding box. The IoU evaluates how well the predicted bounding box overlaps with the ground truth box. In this study, we used an IoU threshold of 0.5, meaning that a predicted box must have an IoU of at least 0.5 with a ground truth box to be considered a true positive detection.

Given a specific IoU threshold, a PR curve can be plotted as the graphical representation of model precision (the ratio of true positive predictions to the total positive predictions) and recall (the ratio of true positive predictions to all actual positive instances) at various confidence threshold levels. Then, average precision (AP) score can be defined as the area under the PR curve:

$${AP = \mathop \int \nolimits_{0}^{1} p(r)\,dr}$$

(2)

In Eq. (2), p(r) is the precision as a function of recall r. The integral computes the area under the curve of precision plotted against recall, from 0 to 1. In practice, since the PR curve is typically discrete, the AP score is calculated as the weighted mean of precisions achieved at each threshold, with the increase in recall from the previous threshold used as the weight³⁷. Therefore, AP score is a comprehensive metric that combines the insights of both precision and recall, providing a holistic view of the model performance in object detection tasks.

Statistical analysis

Statistical analysis is conducted on the AP score difference between one-stage and two-stage methods, as well as the impact of the spine localization module. A bootstrap-based hypothesis test method³⁸ is employed. Specifically, our test set is resampled with replacement to the same size, repeated 10,000 times to create 10,000 bootstrap datasets. For each bootstrap dataset, the AP scores of the detection methods are recalculated, resulting in estimated AP score distributions. To assess the statistical significance of the observed AP differences, the bootstrap distributions of AP differences are shifted to have a mean of zero to approximate the null distributions³⁸. Subsequently, the p-value is calculated based on the proportion of samples with a value as large or larger than the observed difference. Such hypothesis tests are performed between the top-ranked one-stage and two-stage methods, the mid-ranked one-stage and two-stage methods, and the bottom-ranked one-stage and two-stage methods. Additionally, hypothesis tests are conducted for each method before and after incorporating the spinal localization module.

Transfer learning to the feline IVDH detection

Considering that IVDH affects various animal species beyond dogs^1,39,40, we pilot to extend our research to include cats. A feline IVDH dataset was constructed, consisting of 111 images from 63 cats, using the same acquisition and annotation methods as with the canine dataset. This dataset was also collected from a local veterinary hospital with written consent from the pet owners, and subsequently divided into training (n = 33) and test (n = 30) sets.

With limited training samples, this feline dataset was used to explore the adaptability of models across species. We focused primarily on the Dynamic R-CNN model and evaluated four training strategies: (1) directly applying the canine model without model retraining (no retraining), (2) retraining the model on the feline dataset (retraining on cats), (3) retraining the model on a combined dataset of both dogs and cats (retraining on dogs and cats), and (4) using the canine model weights as a starting point and fine-tuning on the feline dataset (transfer learning). For strategies (1), (2), and (3), we set the learning rate at 2.5 × 1e⁻³. In contrast, for the transfer learning approach, the learning rate was 2.5 × 1e⁻⁵. Other model parameters remained consistent across all four methods.

Results

Figure 3 illustrates typical results for canine IVDH detection without the use of the spine localization module. Limited by space, only four representative methods are presented: YOLOv3, YOLOX, Faster R-CNN, and Dynamic R-CNN. These methods are chosen as they represent the earliest and most recent advancements in one-stage and two-stage methods. The first three rows show typical examples where all models effectively identify and locate most of the IVDH lesions, albeit with some occurrences of false positives or negatives. It is observed that two-stage models tend to have fewer incorrect predictions compared to one-stage models. The fourth row presents a more complex case, where the models are more likely to generate false positives outside the spinal area, emphasizing the need of developing a spine localization module.

Figure 4 plots typical results of spine localization and the resulting PR curve, demonstrating the high accuracy of the trained spine localization module, achieving a notable AP of 99.8%. This enables the proposed spine localization to be a highly reliable and fully automatic step. Table 1 compares the AP scores of all tested models, both with and without the spine localization module. The results show that two-stage models outperform one-stage models irrespective of the inclusion of the spine localization module. The incorporation of this module particularly benefits Faster R-CNN and Dynamic R-CNN, with AP score increases of 5.93% and 4.18%, respectively. The positive impact of the spine localization module is further visible in the PR curves shown in Fig. 5, where curves including the module generally surpass those without it at most recall levels.

Table 1 Average precision (AP) of the trained models with and without spine localization. The two-stage models outperformed the one-stage models, and the spine localization module improved IVDH detection accuracy for nearly all models.

Full size table

The statistical analysis reveals that two-stage models consistently achieve significantly better AP scores than their one-stage counterparts using a p-value threshold of 0.05. Specifically, without the spinal localization module, the top-ranked two-stage model significantly outperforms the top-ranked one-stage model (Cascade RPN versus YOLOX-s, p < 0.05), and similar significant differences are observed for the mid-ranked (Grid R-CNN versus FCOS, p < 0.05) and bottom-ranked (Faster R-CNN versus YOLOF, p < 0.01) models. With the spinal localization module, the top-ranked two-stage model again significantly outperforms the top-ranked one-stage model (Dynamic R-CNN versus YOLOX-l, p < 0.05), and the same pattern holds for the mid-ranked (Cascade RPN versus FCOS, p < 0.05) and bottom-ranked (Grid R-CNN versus YOLOF, p < 0.0001) models. Furthermore, the AP improvements brought by the spinal localization module are statistically significant (p < 0.05) for four models: YOLOX-l, Faster R-CNN, Cascade R-CNN, and Dynamic R-CNN. In addition, the p-value for FCOS is 0.054, which is close to the significance threshold.

Figure 6A presents a typical image with annotations from the feline dataset. Figure 6B shows the precision-recall (PR) curves for the four training strategies evaluated on feline dataset, with the corresponding AP scores. The results highlight the difficulty in training an IVDH detection model using only the limited feline data, leading to a low AP score of 29.82%. Models trained exclusively with canine data yield satisfactory results, achieving an AP score of 63.40%, suggesting certain similarities between the anatomical structures of the two species. Interestingly, training on a combined dataset of both cats and dogs result in a slightly lower score of 60.53%. This could be due to the complexity of learning the distinct image features of both species simultaneously. In contrast, the efficacy of our transfer learning strategy is evident, leading to the highest AP score of 67.65%. This underscores the effectiveness of transfer learning in adapting models for successful application to smaller, species-specific datasets.

Discussion

In this study, we explored the capability of AI-assisted intervertebral disc herniation (IVDH) detection in veterinary medicine. A key development is the use of a spine localization module in the preprocessing phase. This module not only effectively removes false positives from outside the spinal area but also concentrates the model attention on the spine region. Our internal tests indicate that this approach is more effective than directly applying models to full-sized images and subsequently using a spine localization module for false positive removal. These results highlight that in medical applications, tailored preprocessing/postprocessing strategies can be essential to pursue high accuracy.

Another key insight from our study is the superior performance of two-stage models. Despite the rising popularity of one-stage detectors, even the relatively old two-stage method Faster R-CNN outperforms the recent one-stage models YOLOX. The two-stage models, which first generates region proposals and then classifies these regions, is more effective for handling small target lesions. Also, the relatively slow inference speed of two-stage models is not a concern in most medical imaging applications like ours, since the imaging process itself takes minutes while running the two-stage model, even with the extra spine localization module, costs less than 1 s/subject. This finding cautions against the blanket application of the newest computer vision models to medical contexts without considering their suitability. Indeed, Fig. 7 shows that models with higher accuracy on COCO⁴¹, a natural image detection dataset, do not necessarily lead to higher accuracy on animal IVDH detection.

The detection accuracy achieved in our study (AP score up to 75.32%), generally falls below that achieved in human IVDH, e.g., AP scores of 89.3%¹⁴ and 92.4%¹² have been demonstrated on human lumbar disc herniation. Several factors likely contribute to this discrepancy. Firstly, the absolute size of spinal segments in dogs and cats is small. Secondly, a large imaging field-of-view (FOV) typically covering over 16 spinal segments was used in this study. A large FOV is necessary for veterinary MRI due to the varied anatomy of animals and their inability to communicate specific pain points. This, however, leads to relatively large voxel sizes, and even fewer voxels per spinal segment. On the other hand, similar human studies^10,11,12,13 often focus on imaging only a few spinal segments, and thus have substantially more voxels per segment. Finally, due to the absence of standardized diagnostic criteria for animal IVDH, the annotations on samples that present borderline conditions can be subjective and slightly inconsistent to certain degree, which further complicates the training of AI models. These factors together highlight the existing challenges and the importance of further research in AI-assisted veterinary medicine.

Our study also has limitations that should be addressed in future research. The absence of evaluations for other diseases means our findings may not fully reflect clinical accuracy. Additionally, we did not differentiate between acute and chronic intervertebral disc herniations, which is an important aspect of clinical diagnosis. Lastly, it is important to acknowledge that all images in this study were obtained from a single institution. Future work should explore multi-institutional, multi-vendor datasets to ensure broader applicability and reliability in different clinical settings.

The development of automated segment-level lesion localization, as in this study, is not in competition with image-level spinal disease classification¹⁶; rather, it serves as a complement, providing more detailed spatial information and additional interpretability of the image-level classification results. Combining these two methods could lead to a more comprehensive diagnostic approach, where the high-level perspective of image-level classification is merged with the detailed insights from segment-level analysis. Such integration has the potential to significantly improve diagnostic accuracy and inform more precise treatment strategies. From another perspective, optimizing imaging protocols and enhancing image reconstruction quality¹⁵ can also play a crucial role in supporting further improvements of animal IVDH diagnosis. For example, refining MRI sequences and applying advanced reconstruction algorithms for better contrast and spatial resolution could lead to clearer visualization of IVDH lesions, thereby aiding AI models in more reliable detection. One promising direction here is to incorporate 3D imaging techniques⁴² that can provide thinner slices to reveal subtle structural changes.

In summary, this study has demonstrated the feasibility of AI-assisted intervertebral disc herniation detection for veterinary care and has explored various strategies in preprocessing, model design, and training to effectively improve detection accuracy. Our proposed strategies, supported by experimental results, offer valuable guidelines for future research. They emphasize the importance of creating methodologies that go beyond merely replicating the latest AI advancements, focusing instead on addressing the specific challenges and needs of the target domain.

Data availability

The datasets generated and/or analysed during the current study are not publicly available due to privacy restrictions but are available from the corresponding author upon reasonable request.

References

Da Costa, R. C., De Decker, S., Lewis, M. J., Volk, H., CANSORT-SCI Consortium. Diagnostic imaging in intervertebral disc disease. Front. Vet. Sci. 7, 588338 (2020).
Article PubMed PubMed Central Google Scholar
Bergknut, N. et al. Incidence of intervertebral disk degeneration-related diseases and associated mortality rates in dogs. J. Am. Vet. Med. Assoc. 240, 1300–1309 (2012).
Article PubMed Google Scholar
Priester, W. A. Canine intervertebral disc disease—Occurrence by age, breed, and sex among 8,117 cases. Theriogenology 6, 293–303 (1976).
Article Google Scholar
Brisson, B. A. Intervertebral disc disease in dogs. Vet. Clin. Small Anim. Pract. 40, 829–858 (2010).
Article Google Scholar
Olby, N. J., da Costa, R. C., Levine, J. M. & Stein, V. M. Prognostic factors in canine acute intervertebral disc disease. Front. Vet. Sci. 7, 596059 (2020).
Article PubMed PubMed Central Google Scholar
Besalti, O., Pekcan, Z., Sirin, Y. S. & Erbas, G. Magnetic resonance imaging findings in dogs with thoracolumbar intervertebral disk disease: 69 cases (1997–2005). J. Am. Vet. Med. Assoc. 228, 902–908 (2006).
Article PubMed Google Scholar
Hennessey, E., DiFazio, M., Hennessey, R. & Cassel, N. Artificial intelligence in veterinary diagnostic imaging: A literature review. Vet. Radiol. Ultrasound 63, 851–870 (2022).
Article PubMed Google Scholar
White, C., Maddox, T. W. & Mortier, J. R. Survey of factors affecting satisfaction and success of residents enrolled in European College of Veterinary Diagnostic Imaging (ECVDI) residency programs. Vet. Radiol. Ultrasound 64, 992–998 (2023).
Article PubMed Google Scholar
Owens, J. M. et al. Veterinary radiology—History, purpose, current status and future expectations. Vet. Radiol. Ultrasound 60, 358–362 (2019).
Article PubMed Google Scholar
Guinebert, S. et al. Automatic semantic segmentation and detection of vertebras and intervertebral discs by neural networks. Comput. Methods Programs Biomed. Update 2, 100055 (2022).
Article Google Scholar
Valarmathi, G. & Devi, S. N. Automatic localization and classification of intervertebral disc herniation using hybrid classifier. Biomed. Signal Process. Control 86, 105291 (2023).
Article Google Scholar
Tsai, J.-Y. et al. Lumbar disc herniation automatic detection in magnetic resonance imaging based on deep learning. Front. Bioeng. Biotechnol. 9, 708137 (2021).
Article PubMed PubMed Central Google Scholar
Ma, S., Huang, Y., Che, X. & Gu, R. Faster RCNN-based detection of cervical spinal cord injury and disc degeneration. J. Appl. Clin. Med. Phys. 21, 235–243 (2020).
Article PubMed PubMed Central Google Scholar
Prisilla, A. A. et al. An approach to the diagnosis of lumbar disc herniation using deep learning models. Front. Bioeng. Biotechnol. 11, 1247112 (2023).
Article PubMed PubMed Central Google Scholar
Kang, H., Noh, D., Lee, S.-K., Choi, S. & Lee, K. Deep learning-based reconstruction can improve canine thoracolumbar magnetic resonance image quality and reduce slice thickness. Vet. Radiol. Ultrasound 64, 1063–1070 (2023).
Article PubMed Google Scholar
Biercher, A. et al. Using deep learning to detect spinal cord diseases on thoracolumbar magnetic resonance images of dogs. Front. Vet. Sci. 8, 721167 (2021).
Article PubMed PubMed Central Google Scholar
Zou, Z., Chen, K., Shi, Z., Guo, Y. & Ye, J. Object detection in 20 years: A survey. Proc. IEEE 111, 257–276 (2023).
Article Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1137–1149 (2017).
Article PubMed Google Scholar
Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. You only look once: Unified, real-time object detection. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 779–788 (2016).
Ge, Z., Liu, S., Wang, F., Li, Z. & Sun, J. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021).
Chen, Q. et al. You only look one-level feature. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 13039–13048 (2021).
Tian, Y., Pu, L. Z. C. T., Singh, R., Burt, A. D. & Carneiro, G. One-stage five-class polyp detection and classification. in 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) 70–73 (2019). https://doi.org/10.1109/ISBI.2019.8759521
Carranza-García, M., Torres-Mateo, J., Lara-Benítez, P. & García-Gutiérrez, J. On the performance of one-stage and two-stage object detectors in autonomous vehicles using camera data. Remote Sens. 13, 89 (2021).
Article ADS Google Scholar
Xu, J., Ren, H., Cai, S. & Zhang, X. An improved faster R-CNN algorithm for assisted detection of lung nodules. Comput. Biol. Med. 153, 106470 (2023).
Article PubMed Google Scholar
Zhang, H., Chang, H., Ma, B., Wang, N. & Chen, X. Dynamic R-CNN: Towards high quality object detection via dynamic training. in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XV 16 260–275 (Springer, 2020).
Redmon, J. & Farhadi, A. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018).
Tian, Z., Shen, C., Chen, H. & He, T. Fcos: Fully convolutional one-stage object detection. in Proceedings of the IEEE/CVF International Conference on Computer Vision 9627–9636 (2019).
Cai, Z. & Vasconcelos, N. Cascade R-CNN: High quality object detection and instance segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 43, 1483–1498 (2021).
Article PubMed Google Scholar
Lu, X., Li, B., Yue, Y., Li, Q. & Yan, J. Grid r-cnn. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 7363–7372 (2019).
Vu, T., Jang, H., Pham, T. X. & Yoo, C. D. Cascade RPN: Delving into high-quality region proposal network with adaptive convolution. in Proceedings of the 33rd International Conference on Neural Information Processing Systems (Curran Associates Inc., 2019).
Chen, K. et al. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 770–778 (2016).
Deng, J. et al. ImageNet: A large-scale hierarchical image database. in 2009 IEEE Conference on Computer Vision and Pattern Recognition 248–255 (IEEE, 2009).
Redmon, J. & Farhadi, A. YOLO9000: Better, faster, stronger. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 7263–7271 (2017).
Bochkovskiy, A., Wang, C.-Y. & Liao, H.-Y. M. YOLOv4: Optimal speed and accuracy of object detection. Preprint at https://doi.org/10.48550/arXiv.2004.10934 (2020).
Padilla, R., Netto, S. L. & da Silva, E. A. B. A survey on performance metrics for object-detection algorithms. in 2020 International Conference on Systems, Signals and Image Processing (IWSSIP) 237–242 (2020). https://doi.org/10.1109/IWSSIP48289.2020.9145130
Padilla, R., Passos, W. L., Dias, T. L. B., Netto, S. L. & da Silva, E. A. B. A comparative analysis of object detection metrics with a companion open-source toolkit. Electronics 10, 279 (2021).
Article Google Scholar
Smucker, M. D., Allan, J. & Carterette, B. A comparison of statistical significance tests for information retrieval evaluation. in Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management 623–632 (Association for Computing Machinery, 2007). https://doi.org/10.1145/1321440.1321528
Farrell, M. & Fitzpatrick, N. Feline intervertebral disc disease. in Advances in Intervertebral Disc Disease in Dogs and Cats 36–49 (Wiley, 2015). https://doi.org/10.1002/9781118940372.ch6
Amey, J. A., Liatis, T., Cherubini, G. B., De Decker, S. & Foreman, M. H. Outcomes of surgically and conservatively managed thoracolumbar and lumbosacral intervertebral disc herniations in cats. J. Vet. Intern. Med. 38, 247–257 (2024).
Article PubMed Google Scholar
Lin, T.-Y. et al. Microsoft COCO: Common objects in context. In Computer Vision—ECCV 2014 (eds Fleet, D. et al.) 740–755 (Springer International Publishing, 2014). https://doi.org/10.1007/978-3-319-10602-1_48.
Chapter Google Scholar
Mugler, J. P. III. Optimized three-dimensional fast-spin-echo MRI. J. Magn. Reson. Imaging 39, 745–767 (2014).
Article ADS PubMed Google Scholar

Download references

Acknowledgements

The authors gratefully thank Ruixiang Jiang, Wenyue Xiao, and Dexing Wei for their kind technical support for data acquisition and annotation. This work was supported by Shenzhen Higher Education Stable Support Program (No. 20220716111838002), and Natural Science Foundation of Top Talent of Shenzhen Technology University (No. 20200208 and No. GDRC202134).

Author information

These authors contributed equally: Shoujin Huang and Guoxiong Deng.

Authors and Affiliations

Shenzhen Technology University, Shenzhen, China
Shoujin Huang, Guoxiong Deng, Yan Kang, Jingyu Li & Mengye Lyu
Shenzhen GoldenStone Medical Technology Co., Ltd., Shenzhen, China
Jianzhong Li

Authors

Shoujin Huang
View author publications
You can also search for this author in PubMed Google Scholar
Guoxiong Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yan Kang
View author publications
You can also search for this author in PubMed Google Scholar
Jianzhong Li
View author publications
You can also search for this author in PubMed Google Scholar
Jingyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Mengye Lyu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SJH: Methodology, Software, Visualization, Writing—Original draft. GXD: Conceptualization, Data acquisition, Methodology, Software. YK: Data acquisition, Writing—Review and Editing. JZL: Data acquisition, Writing—Review and Editing. JYL: Methodology, Writing—Review and Editing, Funding acquisition, Supervision. MYL: Conceptualization, Writing—Original draft, Funding acquisition, Supervision. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Jingyu Li or Mengye Lyu.

Ethics declarations

Competing interests

Jianzhong Li is the founder and CEO of the company GoldenStone. Shoujin Huang, Guoxiong Deng, Yan Kang, Jingyu Li, and Mengye Lyu declare that they have no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Huang, S., Deng, G., Kang, Y. et al. Exploring deep learning strategies for intervertebral disc herniation detection on veterinary MRI. Sci Rep 14, 16705 (2024). https://doi.org/10.1038/s41598-024-67749-5

Download citation

Received: 30 January 2024
Accepted: 15 July 2024
Published: 19 July 2024
DOI: https://doi.org/10.1038/s41598-024-67749-5
Springer Nature Limited

Exploring deep learning strategies for intervertebral disc herniation detection on veterinary MRI

Abstract

Similar content being viewed by others

Enhanced deep leaning model for detection and grading of lumbar disc herniation from MRI

Deep learning assisted segmentation of the lumbar intervertebral disc: a systematic review and meta-analysis

IVD-Net: Intervertebral Disc Localization and Segmentation in MRI with a Multi-modal UNet

Introduction