Deep learning based diagnosis for cysts and tumors of jaw with massive healthy samples

Yu, Dan; Hu, Jiacong; Feng, Zunlei; Song, Mingli; Zhu, Huiyong

doi:10.1038/s41598-022-05913-5

Deep learning based diagnosis for cysts and tumors of jaw with massive healthy samples

Article
Open access
Published: 03 February 2022

Volume 12, article number 1855, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Deep learning based diagnosis for cysts and tumors of jaw with massive healthy samples

Download PDF

Dan Yu¹,
Jiacong Hu²,
Zunlei Feng²,
Mingli Song² &
…
Huiyong Zhu¹

2730 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

We aimed to develop an explainable and reliable method to diagnose cysts and tumors of the jaw with massive panoramic radiographs of healthy peoples based on deep learning, since collecting and labeling massive lesion samples are time-consuming, and existing deep learning-based methods lack explainability. Based on the collected 872 lesion samples and 10,000 healthy samples, a two-branch network was proposed for classifying the cysts and tumors of the jaw. The two-branch network is firstly pretrained on massive panoramic radiographs of healthy peoples, then is trained for classifying the sample categories and segmenting the lesion area. Totally, 200 healthy samples and 87 lesion samples were included in the testing stage. The average accuracy, precision, sensitivity, specificity, and F1 score of classification are 88.72%, 65.81%, 66.56%, 92.66%, and 66.14%, respectively. The average accuracy, precision, sensitivity, specificity, and F1 score of classification will reach 90.66%, 85.23%, 84.27%, 93.50%, and 84.74%, if only classifying the lesion samples and healthy samples. The proposed method showed encouraging performance in the diagnosis of cysts and tumors of the jaw. The classified categories and segmented lesion areas serve as the diagnostic basis for further diagnosis, which provides a reliable tool for diagnosing jaw tumors and cysts.

A Location Constrained Dual-Branch Network for Reliable Diagnosis of Jaw Tumors and Cysts

Differential diagnosis of ameloblastoma and odontogenic keratocyst by machine learning of panoramic radiographs

Article Open access 06 February 2021

Deep learning object detection of maxillary cyst-like lesions on panoramic radiographs: preliminary study

Article 19 September 2020

Introduction

Odontogenic cysts and tumors of the jaw are the second most common disease after tooth impaction in the oral and maxillofacial areas. The cysts and tumors of the jaw are usually painless and asymptomatic unless they grow so large as to involve the entire jawbone, causing noticeable swelling or weakening it to cause pathologic fractures^1,2. Those manifested symptoms pose a severe threat to patient life quality. The majority of these cyst and tumor lesions can be identified at an earlier stage through a routine radiographic exam called the panoramic radiograph or orthopantomogram³. The treatment modalities for different types of cysts and tumors are different. Keratocystic odontogenic tumors (KCOTs), like other cystic lesions, are usually enucleated without radical jaw segmentation. Ameloblastomas (ABs), on the other hand, require more radical surgical removal than KCOTs, which drastically affects patients’ lives, causing facial deformity and subsequent social and emotional incompetence^4,5.

Accurate diagnosis of different types of cysts and tumors is a challenging task. Some cysts and tumors have very similar radiological characteristics. In the orofacial region, accurate differentiation between various cystic lesions and ABs can be challenging. This is mainly due to their common presentations on conventional dental panoramic radiographs⁶. The main differentiating feature in diagnosing KCOT from ABs is the significant anteroposterior extension of the unilocular radiolucent lesion in the posterior mandible marrow space. KCOTs can sometimes also present as multilocular lesions, which further complicate its differentiation from ameloblastomas⁴. Misdiagnosis between these two lesions is therefore a common clinical pitfall. So, an auxiliary diagnostic method for cysts and tumors is significant.

Recently, deep learning approaches have achieved promising results in the medical image analysis area⁷. Inspired by the successful application of deep learning, several works^{8,9,10,11,12,13,14,15,16,17,18,19,20,21} adopted Convolutional Neural Network (CNN) to diagnose radiolucent lesions in the oral and maxillofacial area. Lee et al.¹¹ adopted deep convolutional neural networks to screen orthognathic surgery, where the Grad-CAM²² is adopted to visualize whether the deep learning AI model considered and evaluated the correct region. However, the performance of deep learning-based methods heavily relies on a large number of labeled datasets. Existing CNN-based methods^18,23,24 first pre-train the whole classification network on ImageNet²⁵, and then finetune the network on several hundreds of lesion samples. The domain difference between normal image dataset ImageNet²⁵ and medical radiographs is huge, which heavily reduces the robustness and performance of the above transfer learning-based methods^18,23. On the other hand, the explainability of auxiliary diagnostic methods is an essential factor for diagnosing cysts and tumors. However, existing deep learning-based methods lack explainability, which is a disadvantage for diagnosing cysts and tumors. Furthermore, sufficient labeled samples can effectively improve the performance of deep learning-based methods. However, collecting and labeling massive lesion samples are time-consuming and heavily relies on the professional doctor’s experience. On the contrary, collecting massive healthy panoramic radiographs is more accessible and does not require a professional doctor’s annotation.

Therefore, the aim of this study is to develop an explainable and reliable method to diagnose cysts and tumors of the jaw with massive panoramic radiographs of healthy people based on deep learning. We develop a two-branch framework for diagnosing cysts and tumors of the jaw, where the position consistency constraint between the segmentation results and the response maps of classification is adopted to improve the reliability and explainability of the predicted results. Experiments show that the proposed two-branch network can simultaneously predict the category and area of lesion samples, which can serve as the diagnostic reference for further diagnosis of doctors.

Related works

Recently, the deep learning technique has achieved promising results in tumor image analysis tasks¹², such as brain tumor image analysis¹³, breast tumor analysis¹⁴, and liver tumor analysis¹⁵. Inspired by the successful application of deep learning techniques, several works are proposed to diagnose radiolucent lesions in the oral and maxillofacial area, which can be divided into classification methods and detection methods.

For the former category, Poedjiastoeti et al.²³ adopted the VGG-16 network for classifying the ameloblastomas and KCOTs. The VGG-16 network is pretrained on the ImageNet dataset and finetuned with 400 image samples. Lee et al.¹⁸ adopted the pretrained GoogLeNet Inception-v3 architecture to classify odontogenic keratocysts, dentigerous cysts, and periapical cysts with 1140 panoramic and 986 CBCT images. For the latter category, Ariji et al.¹⁶ proposed the first object detection framework (a pre-trained fully convolutional network) to detect the lesion area and classify them. Kwon et al.¹⁷ developed a deep CNN modified from YOLOv3 for detecting and classifying odontogenic cysts and tumors of the jaw with 1282 panoramic radiographs. Yang et al.²⁶ adopted the YOLOv2 for detecting and classifying dentigerous cyst, odontogenic keratocyst (OKC), and ameloblastoma with 1603 panoramic radiograph samples.

However, due to limited lesion samples, most above deep networks are firstly pretrained on other datasets such as ImageNet, then are finetuned with the jaw panoramic radiographs. The domain differences have severe limitations on the robustness and performance of those pre-trained networks. What’s more, deep learning based diagnosis methods for cysts and tumors of the jaw have a potential deficiency noninterpretability, which severely constrains the application of existing deep learning based methods. In several works^11,23, the Grad-CAM²² technique is adopted to visualize the category-related areas, the reliability of which will be disturbed by the inaccurate prediction.

Materials and methods

Dataset

This study was conducted at the First Affiliated Hospital, Zhejiang University School of Medicine. Waiver of informed consent for data collection was approved by the Clinical Research Ethics Committee of the First Affiliated Hospital, Zhejiang University School of Medicine (IIT20200430A-R2). Since 2005, the World Health Organization (WHO) has labeled OKCs as keratocystic odontogenic tumors (KCOTs) and has classified OKCs as tumors according to their behavior. Based on histopathological examinations by a board-certified oral pathologist at First Affiliated Hospital, Zhejiang University School of Medicine, we collected 10,000 panoramic radiographs of healthy peoples and 872 lesion samples, which contains 356 dentigerous cysts (DC) samples, 292 periapical cysts (PC) samples, and 94 ameloblastoma (AB) samples, 130 keratocystic odontogenic tumor (KCOT) samples. Those samples were acquired between December 2018 and February 2020. Even if histologically confirmed, all difficult to distinguish cases because of the severe distortion, artificial noise, blur, and poor quality in the radiographic image were excluded. For each lesion sample, an experimental dentist annotates the lesion area mask and lesion category. In our experiment, healthy panoramic radiographs are split into 9500 samples for pretraining and training, 300 samples for validation, and 200 samples for testing. For lesion samples, 70%, 20%, and 10% samples are used for training, validation, and testing. Figure 1 summarizes more details about the collected dataset.

Image preprocessing and augmentation

The size of the original panoramic radiograph is about 3000 × 1500, which is too large for the normal deep network. What’s more, through statistical calculation of lesion area position, we find that peripheral areas don’t contain lesions. So, we get the common center areas by throwing away useless peripheral areas, which can maintain lesion-related patches and remove useless parts as much as possible. In the experiment, the cropped patches are resized into 512 × 256. The data augmentation strategies we adopted include horizontal flipping, cut-and-pasting, and patch-covering based on the characteristics of medical images. The cut-and-pasting strategy denotes cutting the lesion area and pasting it on a healthy sample. The patch-covering denotes covering the lesion area and healthy area with a gray patch of lesion samples to augment lesion samples’ diversity. We survey lesion area size on all lesion samples, which gives the minimal and maximal size of the lesion area. For the lesion area of a lesion sample, the cover-patch size is randomly generated between the size of the lesion area and the maximal size. For the healthy area of a lesion sample, the cover-patch size is randomly generated between the minimal and the maximal sizes. In our experiment, a lesion sample will be covered with 20 patches, where half the patches are generated for covering the lesion area and the rest half the patches are used for covering the healthy area of the lesion sample. It’s worth noting that the patch-covering can generate very similar samples for the same lesion sample, which is an advantage for enhancing the reliability of predicted output’s interpretability.

Model architecture

The performance of deep learning-based methods highly relies on the number of training samples. The human can observe abnormity through mass observation of massive healthy samples. Inspired by the above fact, we propose a deep learning-based diagnosis method for cysts and tumors of the jaw with massive healthy samples. The proposed framework is composed of two parts: a self-supervised network and a two-branch network. The self-supervised network is adopted to learn basic knowledge from massive healthy samples. Then, the knowledge of the self-supervised network is used in the two-branch network by replacing the encoder of the two-branch network with the pretrained encoder of the self-supervised network.

For improving the reliability and explainability of diagnosis results, the two-branch network is devised to be composed of a classification sub-branch and segmentation sub-branch. The segmentation sub-branch will predict the lesion area, which can serve as the diagnosis reference for dentists and oral surgeons to diagnose the jaw tumors and cysts further.

In the experiment, the self-supervised network we adopted is MoCoV2²⁷. We adopted Unet²⁸ as the segmentation sub-branch. The classification sub-network, the self-supervised network, and the segmentation sub-branch share the same encoder. The remaining part of the classification sub-branch contains an average pooling layer, 2048 fully connection layers. The two-branch network architecture is given in Fig. 2.

Model training and inference

In the experiment, the whole model is trained in two stages. The self-supervised network is firstly trained on 9500 healthy samples with the default parameter setting in the work of Chen et al.²⁷. Then, the pre-trained encoder is used to initialize the encoder of the two-branch network. Next, the classification sub-branch and segmentation sub-branch are trained with CrossEntropy loss ${L}_{CE}$ and Mean Squared Error on 872 lesion samples and 500 healthy samples as follows:

$${L}_{CE}=\frac{1}{K}\sum_{k=1}^{K}{y}_{k}\;{\text{log}}\;{p}_{k},$$

$${L}_{MSE}=\frac{1}{K}\sum_{k=1}^{K}{\Vert \overline{{M }_{k}}-{M}_{k}\Vert }_{2}^{2},$$

where, $K$ is the number of training samples, ${y}_{k}$ and ${p}_{k}$ denote the ground-truth and predicted probability of $k$-th sample, ${M}_{k}$ and $\overline{{M }_{k}}$ denote the ground-truth and predicted mask of the $k$-th sample. For the classification and segmentation sub-branches, the learning rates are ${1e}^{-3}$ and ${1e}^{-2}$, respectively. In the training stage, the weights for the classification loss ${L}_{CE}$ and segmentation loss ${L}_{MSE}$ are set to 1:1. 872 lesion samples are composed of 648 cyst samples (DCs: 356, PCs:292) and 224 tumor samples (ABs: 94, KCOTs: 130). Healthy samples used in the training stage are also used in the pretrain stage.

To improve the reliability of the predicted results, we adopted the annotated segmentation mask to constrain the consistency between the segmentation results and classification results. Grad-CAM²² can visualize the high response to the final predicted probability. For the lesion samples, the feature of the lesion area should have a major contribution to the final classification, while the healthy areas should have no contribution. So, we adopted the annotated lesion mask to constrain the lesion area and healthy area with high gradient responses and no gradient responses regarding the final classification label. The constrain ${L}_{constrain}$ is implemented by maximizing the responses around the lesion areas and minimizing the responses in the unrelated background area as follows:

$${L}_{constrain}=\frac{1}{K}\sum_{k=1}^{K}\left(\sum_{n=1}^{N}(1-\overline{{M }_{k}^{d}}\left[n\right])*{R}_{k}\left[n\right]-\sum_{n=1}^{N}\overline{{M }_{k}^{d}}\left[n\right]*{R}_{k}\left[n\right]\right),$$

where, $N$ is the multiplication of width and height of the last layer feature map, ${R}_{k}\left[n\right]$ is ${R}_{k}$, $\overline{{M }_{k}^{d}}$ denotes the dilated lesion mask with disk strel of radius d (a random value between 6 and 12). For the healthy samples, the constraint will be omitted. Only the two-branch network is adopted to classify the lesion category and segment the lesion area in the testing stage. We can get the predicted lesion category (DC, PC, AB, KCOT, and healthy) and the predicted lesion area mask for each input panoramic radiograph. Meanwhile, Grad-CAM²² can visualize the high response to the final predicted lesion category. The predicted lesion area mask and high response map visualized by Grad-CAM can be used as the diagnosis reference for the doctor to diagnose the cysts and tumors of the jaw.

Results

Lesion classification performance

In the experiment, the numbers of training, validation, and testing samples for each category are given in Fig. 1. The diagnosis of jaw cysts and tumors contains the binary classification and five-class classification. The five-class classification distinguishes the detailed category of the cyst, tumor, and healthy sample. Table 1 shows the five-class classification performance. The proposed method’s average accuracy, precision, sensitivity, specificity, and F1 score are 88.72%, 65.81%, 66.56%, 92.66%, and 66.14%, respectively. Our method achieves better classification performance than the existing three methods. Figure 3 shows the ROCs and AUC scores of the five-class classification, where we can see that the AUC scores of DCs, PCs, ABs, KCOTs, and healthy samples are 0.83, 0.81, 0.81, 0.82, and 0.84, respectively.

Table 1 The five-class classification performance of cysts, tumors and healthy samples.

Full size table

Setting the ground-truth label as binary allows the two-branch network to be changed into the classifier for lesion and healthy samples. In the binary classification setting, the classification branch only classifies the lesion samples and healthy samples. Table 2 gives binary classification results of our method and other three works (Ariji et al.¹⁶, Kwon et al.¹⁷, and Yang et al.²⁶). For our method, lesion and healthy samples both achieve 90.66% accuracy, which is higher than the average accuracy score of the five-class classification. Our method still achieves better binary classification performance than the existing three methods. Figure 4 shows the ROCs and AUC scores of the binary-class classification, where we can see that the AUC scores of the lesion and healthy samples are both 0.89.

Table 2 The binary classification performance of lesion and healthy samples.

Full size table

Lesion area segmentation performance

Except for the classification performance, we give the segmentation performance of lesion samples. Furthermore, detection results of lesion areas are calculated by comparing the bounding boxes of predicted lesion masks between the ground-truth bounding boxes. Table 4 gives the segmentation and detection results of different lesion categories. For the binary classification and segmentation network, the segmentation and detection performance of lesion samples are given in Tables 5 and 6, where 85% of lesion samples can be detected by the proposed two-branch networks.

Explainable results

Deep learning-based methods usually lack explainability, which is the primary drawback of deep learning-based methods. Medical image analysis requires that the predicted results are reliable and explainable. The proposed method can simultaneously predict the lesion category and area, increasing the reliability and explainability of the predicted results. Meanwhile, a position constraint is proposed to constrain the consistency between the segmented results and the response map of classification in the proposed method. Figure 5 gives the visual results of the original input, segmentation results, response map w/o the constraint, and response map with the constraint. The response map is visualized by the Grad-CAM²². For response maps w/o constraint, there are some inaccurate response areas, which will disturb the final classification and diagnosis. The response maps with the constraint have more concentrated response values than response maps w/o the constraint. There are some slight responses in the left part of the image in the first row and the last column. The contrast between the slight responses and the high responses is large. The constrain can reduce the disturbance of unrelated responses dramatically. With the accurate segmentation results and response maps as references, doctors can further confirm the diagnosis results.

Discussion

From Table 1, we can see that all cysts and tumors have superior accuracy and specificity. What’s more, cysts (DCs and PCs) achieve higher sensitivity/recall scores than tumors (ABs and KCOTs), which means that tumors are more likely to be misclassified. From Tables 3 and 4, we can see that cysts (DCs and PCs) have better segmentation and detection performance than tumors (ABs and KCOTs), which means that cysts have easily identifiable features for the deep model. This is consistent with the classification performance in Table 1 that cysts (DCs and PCs) achieve higher sensitivity/recall scores than tumors (ABs and KCOTs).

Table 3 The segmentation performance of different cysts and tumors.

Full size table

Table 4 The detection performance of different cysts and tumors.

Full size table

Tables 1 and 2 show that the binary classification achieves higher accuracy (90.66%) than the average accuracy score (88.72%) of the five-class classification, which means that the binary classification network is more suitable for distinguishing lesions from healthy samples. In Table 2, healthy samples achieve lower sensitivity/recall scores than lesion samples, which indicates that part of healthy samples tends to be classified as lesion samples. Furthermore, Tables 5 and 6 show that only distinguishing the lesion samples from the healthy samples can achieve more accurate segmentation and detection performance. Lesion samples have higher sensitivity/recall scores than healthy samples. For the healthy samples misclassified as lesion samples, the doctor can further verify the diagnosis results. This is an advantage of the binary classification network.

Table 5 The detection performance of lesion samples for the binary classification task.

Full size table

Table 6 The segmentation performance of lesion samples for the binary classification task.

Full size table

In total, the binary classification achieves about 5–10% improvement than the five-class classification. We find that most misclassified lesion samples are classified into other kinds of lesions through statistics of misclassified samples. The tumors and cysts are easily misclassified, which is consistent with the clinical diagnosis. Odontogenic tumors and cysts do not reveal their distinct radiological characteristics until they reach a certain size. Early radiological appearances of odontogenic cysts and tumors are so indistinguishable from each other that even experienced oral and maxillofacial specialists are unable to guarantee their diagnosis results. In consideration of the better performance of the binary classification network, the results of the binary classification network can be used as the primary diagnostic reference. The predicted results of the five-class classification can be used for further diagnosis references. In clinical diagnosis, overall consideration of predicted binary and five-class classification networks results will achieve more reliable results.

Deep learning-based methods have achieved promising results in the medical image analysis area^7,8,9. However, the deep learning-based methods have a severe deficiency that the inference process and predicted results are not unexplainable. Medical image analysis is a special scene that requires the diagnosis results have high reliability and explainability. For increasing the reliability and explainability, we add the segmentation branch in the proposed method. Meanwhile, the proposed position constraint, which constrains the consistency between the segmented results and the response map of classification, also improves the reliability and explainability of the predicted results. Figure 5 intuitively visualize the segmentation and response map results. The segmentation and response maps are an essential reference for the further diagnosis of doctors, which is the advantage of the proposed two-branch network. There are two factors for increasing reliability and explainability. Firstly, the segmented result of the lesion sample can be used as the diagnostic basis for the doctor to make further verification. Secondly, the patch-cover strategy is adopted to cover the random area of the lesion sample, which can increase the reliability of the prediction. For the lesion sample, the network should predict it as healthy if the lesion area is covered with a patch. On the contrary, the lesion sample is still predicted as a lesion if only the healthy area is covered with a patch. From Fig. 5, we can see that the lesion areas are accurately segmented. Table 7 shows that the classification performance will drop by about 5% accuracy without the segmentation branch, which indicates that the segmentation improves the explainability and the classification performance.

Table 7 The performance without pretrain and segmentation.

Full size table

Another deficiency of deep learning-based methods is that the performance of deep learning-based methods is very dependent on massive samples. However, collecting and annotating massive lesion samples is time-consuming and relies on the specialized knowledge of doctors. In this paper, we proposed a deep learning-based diagnosis method for cysts and tumors of the jaw with massive healthy samples. Table 7 gives the performance without pretrain on massive healthy samples, where we can see that the classification results will drop by about 13% accuracy without the pretrain on massive healthy samples. Like humans can learn prior knowledge from the normal samples, it is verified that deep learning-based methods can also learn some necessary knowledge from healthy samples. It’s an inspiration for further study on deep learning-based medical image analysis.

Overall, with massive healthy samples, the two-branch network achieves promising results for diagnosing cysts and tumors of the jaw. Except for the predicted categories, the two-branch network provides the segmentation results of lesion samples, which significantly improves the reliability and explainability of results predicted by deep learning-based methods. However, the proposed method can’t give the symptom causes why the lesion sample is classified as the specific lesion category, which is a significant research direction. In the future, we will focus on mining the symptom features of jaw cysts and tumors by adding attention mechanism.

Conclusion

The cysts and tumors of the jawbone are usually painless and asymptomatic, which poses a serious threat to patient life quality. Proper and accurate detection at the early stage will effectively relieve patients’ pain and avoid radical segmentation surgery. Similar radiological characteristics of some cysts and tumors pose a severe challenge for the accurate diagnosis of cysts and tumors.

In this paper, we propose a deep learning-based method for diagnosing the cysts and tumors of the jaw. Unlike existing transfer learning-based methods, our proposed method can achieve promising diagnosis performance with massive healthy samples. We firstly collect 872 lesion panoramic radiographs and 10,000 healthy panoramic radiographs. Some data augmentation strategies are adopted to increase the diversity of training samples. Then, an encoder is pretrained on those massive healthy panoramic radiographs with self-supervised learning. Next, based on the pretrained encoder, a two-branch network is devised to classify the lesion category and segment the lesion area simultaneously. In the two-branch framework, the segmentation sub-network can effectively improve the classification performance and enhance the model’s explainability, which is advantageous for doctors to confirm the diagnosis result further. Further, the location consistency constraint is devised for constraining the consistency of predicted results between the segmentation sub-network and classification sub-network, which can effectively enhance the reliability and explainability of models. Exhaustive experiments demonstrate that the deep learning-based method achieves excellent results. The segmentation results can be served as reliable references for further diagnosis. It provides an effective tool for diagnosing cysts and tumors of the jaw. It’s worth noting that the proposed consistency constraint can be extended to other medical analysis areas, such as breast cancer analysis, hepatocellular carcinoma grading, brain diseases diagnosis. The predicted results with the consistency constraint are interpretable, which is more suitable for real medical diagnosis applications. The experiment results verify that the pretraining way effectively relieves the deep learning-based diagnosis method from relying on massive lesion samples, inspiring for future medical diagnosis tasks. Furthermore, we will focus on studying more techniques for improving the explainability of the medical diagnosis model in the future.

Data availability

The dental panoramic radiographs in the dataset used to develop the method and analyze the findings of this study are not publicly available due to the restriction by the First Affiliated Hospital, Zhejiang University School of Medicine in order to protect patients’ privacy.

References

González-Alva, P. et al. Keratocystic odontogenic tumor: A retrospective study of 183 cases. J. Oral Sci. 50, 205–212 (2008).
Article PubMed Google Scholar
Meara, J. G., Shah, S., Li, K. K. & Cunningham, M. J. The odontogenic keratocyst: A 20-year clinicopathologic review. Laryngoscope 108, 280–283 (1998).
Article CAS PubMed Google Scholar
Choi, J.-W. Assessment of panoramic radiography as a national oral examination tool: Review of the literature. Imaging Sci. Dent. 41, 1 (2011).
Article PubMed PubMed Central Google Scholar
Ariji, Y. et al. Imaging features contributing to the diagnosis of ameloblastomas and keratocystic odontogenic tumours: Logistic regression analysis. Dentomaxillofac. Radiol. 40, 133–140 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vinci, R., Teté, G., Lucchetti, F. R., Capparé, P. & Gherlone, E. F. Implant survival rate in calvarial bone grafts: A retrospective clinical study with 10 year follow-up. Clin. Implant Dent. Relat. Res. 21, 662–668 (2019).
PubMed Google Scholar
Apajalahti, S., Kelppe, J., Kontio, R. & Hagström, J. Imaging characteristics of ameloblastomas and diagnostic value of computed tomography and magnetic resonance imaging in a series of 26 patients. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 120, e118–e130 (2015).
Article PubMed Google Scholar
Altaf, F., Islam, S., Akhtar, N. & Janjua, N. K. Going deep in medical image analysis: Concepts, methods, challenges, and future directions. IEEE Access 7, 99540–99572 (2019).
Article Google Scholar
Pauwels, R., Araki, K., Siewerdsen, J. H. & Thongvigitmanee, S. S. Technical aspects of dental CBCT: State of the art. Dentomaxillofac. Radiol. 44, 20140224 (2015).
Article CAS PubMed Google Scholar
Lee, J. S. et al. Osteoporosis detection in panoramic radiographs using a deep convolutional neural network-based computer-assisted diagnosis system: A preliminary study. Dentomaxillofac. Radiol. 48, 20170344 (2018).
Article PubMed PubMed Central Google Scholar
Devlin, H. & Yuan, J. Object position and image magnification in dental panoramic radiography: A theoretical analysis. Dentomaxillofac. Radiol. 42, 29951683 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lee, K. et al. Deep convolutional neural networks based analysis of cephalometric radiographs for differential diagnosis of orthognathic surgery indications. Appl. Sci. 10, 2124. https://doi.org/10.3390/app10062124 (2020).
Article CAS Google Scholar
Hu, Z. et al. Deep learning for image-based cancer detection and diagnosis—A survey. Pattern Recogn. 83(83), 134–149 (2018).
Article ADS Google Scholar
Liu, Z. et al. Deep Learning Based Brain Tumor Segmentation: A Survey. ArXiv: Image and Video Processing (2020).
Jannesari, M. et al. Breast cancer histopathological image classification: a deep learning approach. In 2018 IEEE International Conference on Bioinformatics and Biomedicine, 2405–2412 (2018).
Yu, X. et al. Tendentious Noise-rectifying Framework for Pathological HCC Grading, British Machine Vision Conference (2021).
Ariji, Y. et al. Automatic detection and classification of radiolucent lesions in the mandible on panoramic radiographs using a deep learning object detection technique. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 128, 424–430 (2019).
Article PubMed Google Scholar
Kwon, O. et al. Automatic diagnosis for cysts and tumors of both jaws on panoramic radiographs using a deep convolution neural network. Dentomaxillofac. Radiol. 49, 20200185 (2020).
Article PubMed PubMed Central Google Scholar
Lee, J.-H., Kim, D.-H. & Jeong, S.-N. Diagnosis of cystic lesions using panoramic and cone beam computed tomographic images based on deep learning neural network. Oral Dis. 26, 152–158 (2020).
Article PubMed Google Scholar
Vinayahalingam, S., Kempers, S., Limon, L., Deibel, D. & Xi, T. Classification of caries in third molars on panoramic radiographs using deep learning. Sci. Rep. 11, 1–7 (2021).
Article Google Scholar
Wang, M. & Chen, H. Chaotic multi-swarm whale optimizer boosted support vector machine for medical diagnosis. Appl. Soft Comput. 88, 105946 (2020).
Article Google Scholar
Pei, H., Yang, B., Liu, J. & Chang, K. Active surveillance via group sparse Bayesian learning. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 3023092 https://doi.org/10.1109/TPAMI.2020.
Selvaraju, R. R. et al. Grad-cam: visual explanations from deep networks via gradient-based localization. In IEEE International Conference on Computer Vision, 618–626 (2017).
Poedjiastoeti, W. & Suebnukarn, S. Application of convolutional neural network in the diagnosis of jaw tumors. Healthc. Inform. Res. 24, 236 (2018).
Article PubMed PubMed Central Google Scholar
Kurita, Y. et al. Diagnostic ability of artificial intelligence using deep learning analysis of cyst fluid in differentiating malignant from benign pancreatic cystic lesions. Sci. Rep. 9, 1–9 (2019).
Article Google Scholar
Deng, J. et al. Imagenet: a large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255 (2009).
Yang, H. et al. Deep learning for automated detection of cyst and tumors of the jaw in panoramic radiographs. J. Clin. Med. 9, 1839 (2020).
Article PubMed Central Google Scholar
Chen, X., Fan, H., Girshick, R. & He, K. Improved baselines with momentum contrastive learning. arXiv preprint http://arxiv.org/abs/2003.04297 (2020).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Med. Image Comput. Comput. Assist. Interv. Soc. 234–241 (2015).
Mahdi, F. P., Motoki, K. & Kobashi, S. Optimization technique combined with deep learning method for teeth recognition in dental panoramic radiographs. Sci. Rep. 10, 1–12 (2020).
Article Google Scholar

Download references

Funding

This work is supported by the National Natural Science Foundation of China (No.62002318), the Major Scientific Research Project of Zhejiang Lab (No. 2019KD0AC01).

Author information

Authors and Affiliations

Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, 79# Qingchun Road, Hangzhou, 310003, People’s Republic of China
Dan Yu & Huiyong Zhu
Computer Science and Technology, Zhejiang University, 38# Zheda Road, Hangzhou, 310027, People’s Republic of China
Jiacong Hu, Zunlei Feng & Mingli Song

Authors

Dan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jiacong Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zunlei Feng
View author publications
You can also search for this author in PubMed Google Scholar
Mingli Song
View author publications
You can also search for this author in PubMed Google Scholar
Huiyong Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors contributed to this study as D.Y. collected and annotated all the dataset, and wrote manuscript; J.H. did the experiments and analysis; Z.F. developed the method; M.S. and H.Z. revised and edited final manuscript. All authors reivewed the manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Huiyong Zhu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, D., Hu, J., Feng, Z. et al. Deep learning based diagnosis for cysts and tumors of jaw with massive healthy samples. Sci Rep 12, 1855 (2022). https://doi.org/10.1038/s41598-022-05913-5

Download citation

Received: 15 July 2021
Accepted: 14 January 2022
Published: 03 February 2022
DOI: https://doi.org/10.1038/s41598-022-05913-5
Springer Nature Limited

This article is cited by

Classification of Caries Based on CBCT: A Deep Learning Network Interpretability Study
- Surong Chen
- Yan Yang
- Jingzhi Ma
Journal of Imaging Informatics in Medicine (2024)
Accuracy of machine learning in the diagnosis of odontogenic cysts and tumors: a systematic review and meta-analysis
- Priyanshu Kumar Shrivastava
- Shamimul Hasan
- Deborah Sybil
Oral Radiology (2024)

Deep learning based diagnosis for cysts and tumors of jaw with massive healthy samples

Abstract

Similar content being viewed by others

A Location Constrained Dual-Branch Network for Reliable Diagnosis of Jaw Tumors and Cysts

Differential diagnosis of ameloblastoma and odontogenic keratocyst by machine learning of panoramic radiographs

Deep learning object detection of maxillary cyst-like lesions on panoramic radiographs: preliminary study

Introduction

Related works