Introduction

Mycobacterium tuberculosis (MTB) is the most common cause of mortality among infections of the respiratory tract, with a reported 1.4 million deaths in 2019 [1]. However, much attention has been focused on Mycobacterium tuberculosis lung disease (MTB-LD), which has led to underestimation of the prevalence of nontuberculous mycobacterium lung disease (NTM-LD) [2]. According to epidemiologic data, an increase in the prevalence of NTM-LD has been observed in the USA [3] and South Korea [4]; however, no large-sample NTM-LD epidemiologic studies have been conducted in China [5, 6]. The isolation rate of NTM in different parts of China is generally increasing [7,8,9], and this indicates that the prevalence of NTM-LD in China is on the rise [10]. The main route of transmission of MTB is through inhalation of aerosols from patients with active MTB-LD. When managing patients with MTB-LD, it is necessary to consider the control of aerosol-borne infection. On the other hand, NTM-LD is an opportunistic pathogen that is commonly found in the environment [11]; however, it rarely causes human diseases unless the host is immunocompromised [12]. Therefore, control of aerosol generation by patients with NTM-LD is unnecessary [13]. Considering these issues, accurate and timely diagnosis of the two diseases can contribute to the management of patients.

The differential diagnosis of MTB-LD and NTM-LD poses a challenge to clinicians. The purified protein derivative test is the most common screening method for patients with suspected MTB; however, this test cannot distinguish between MTB-LD and NTM-LD [14]. Furthermore, acid-fast bacilli (AFB) smears of both diseases show positive results [15], while the use of bacterial cultures and strain identification is time-consuming. In many tuberculosis-endemic areas, patients who have a positive AFB smear receive empirical anti-tuberculosis treatment while waiting for the result of bacterial culture. However, patients with NTM-LD will not benefit from this management and can have drug-related adverse effects and incur unnecessary medical expenses. The unresponsiveness of patients with NTM-LD to anti-tuberculosis medications indicates a difference in drug sensitivity between NTM-LD and MTB-LD [16]. Delayed diagnosis of NTM-LD leads to a longer treatment time, prolonged disease course, poor prognosis, and treatment failure. Therefore, accurate diagnosis determines correct treatment, which could help avoid unnecessary drug-related adverse reactions and medical expenses.

Chest radiography is a common screening method for inchoate tuberculosis; however, it cannot be used to distinguish between NTM-LD and MTB-LD [17]. Compared to chest radiography, chest computed tomography (CT) scan can provide more information and clearly show changes in lung areas including cavity formation, parenchymal lesion, bronchiectasis, and tree-in-bud patterns[18] (Fig. 1). However, these changes can be seen in both NTM-LD and MTB-LD; therefore, CT scan alone is insufficient in differentiating between both diseases [19]. Genetic studies [20, 21] have indicated that similarities in the manifestations and imaging results of the two diseases are caused by their similar gene strains. Bronchiectasis in the right middle lobe and left upper lobe lingual segment is considered an important feature of NTM-LD [22, 23]; however, this feature is not rare in patients with MTB-LD [24]. On the other hand, pleural effusion is rare in the CT images of patients with NTM-LD [25]. A 2014 study found that 13.3% of patients with NTM-LD had pleural effusion [26], which may lead to the misdiagnosis of NTM-LD. The difficulty in the distinguishing between the CT images of the two diseases is caused by factors including the experience of the physician, time, and effort [19, 22, 27].

Fig. 1
figure 1

a Correct diagnosis NTM-LD: female, 50 years old, hemoptysis for 3 days, bronchiectasis in right middle lobe, left upper lobe lingual segment and left lower lobe (arrows) with multiple central lobular nodules and tree-in-bud around. b Correct diagnosis MTB-LD: male, 24 years old, cough and expectoration for 6 days, thick-walled cavity (arrow) in right upper lobe with smooth inner wall and local calcification of cavity wall, adjacent pulmonary nodules and tree-in-bud around. c NTM-LD misdiagnosed as MTB-LD: female, 30 years old, cough and expectoration for more than one year, irregular consolidation in left upper lobe with bronchiectasis (arrow). d MTB-LD misdiagnosed as NTM-LD: male, 58 years old, cough, expectoration and fever for 2 weeks, multiple consolidation and bronchiectasis in right middle lobe, left upper lobe lingual segment and lower lobe of both lungs (arrows), and multiple nodules in right lung.

Although chest CT findings are not reliable for distinguishing between NTM-LD and MTB-LD [19], CT aids in diagnosing NTM-LD and MTB-LD before definite mycobacterial culture because of its great availability and short examination time [28,29,30]. CT images could show the extent and severity of lesion involvement, and evaluate the efficacy of medication through re-examination.

In recent years, artificial intelligence (AI) technology represented by machine learning has increasingly gained popularity in medical imaging. The use of AI technology is expected to meet the challenge posed by the discrimination of NTM-LD and MTB-LD by improving diagnostic accuracy, and sensitivity, leading to a reduction in the time taken to make the correct diagnosis. Three dimensions-residential network (3D-ResNet) is a classic deep learning framework that is highly recognised for its performance and efficiency. Previous studies have shown that 3D-ResNet achieved surprising results in various medical scenarios, such as recessive ischemic stroke recognition [25], Alzheimer’s disease classification [31], thoracic vertebral segmentation [32], and colonoscopic polyp detection [33], with an accuracy range of 0.83 from 0.97 in the test set. Additionally, 3D-ResNet has contributed significantly in the diagnosis of lung diseases, such as corona virus disease 2019 (COVID-19) diagnosis using CT scan or chest X-ray (CXR) [34, 35], malignancy risk classification of lung nodules using CT scan [36], non-small cell lung cancer [37], lung nodules detection [38], and lung CT image classification [39], with an accuracy ranging from 0.80 to 0.99 on the test set. Moreover, to the best of our knowledge, this is the first attempt that makes use of deep learning based on 3D-ResNet to distinguish between NTM-LD and MTB-LD. At the same time, we also supplemented a man–machine experiment to evaluate the clinical applicability of the deep learning framework.

In our study, a deep learning framework of NTM-LD and MTB-LD based on chest CT images was developed to assist radiologists in diagnosing these diseases early in order to allocate appropriate treatment strategies to patients with these diseases.

Materials and methods

Patients and dataset

This study retrospectively collected data from patients with pulmonary infection that were admitted to Tianjin Haihe Hospital (Tianjin University, China) between January 2014 and January 2020, who underwent non-contrast CT examination and were infected by nontuberculous mycobacterium (NTM) or MTB.

Pathogenic microbiological examination was used as the reference in the diagnosis of NTM or MTB infection. Sputum samples were collected, and an AFB stain was performed. Mycobacterium culture was grown using Löwenstein-Jensen medium. NTM species were identified using matrix-assisted laser desorption ionisation-time of flight (MALDI-TOF) mass spectrometry. Specifically, a diagnosis of NTM-LD was based on the treatment of nontuberculous mycobacterial pulmonary disease: an official ATS/ERS/ESCMID/IDSA clinical practice guideline (2020) [40], and a diagnosis of MTB-LD patients was based on the National Health Commission of the People’s Republic of China. In: diagnostic criteria for pulmonary tuberculosis (WS 288–2017) [41].

We excluded patients who were diagnosed with both diseases, patients with other lung diseases (including infectious diseases, tumours, and interstitial lesions), patients with a history of lung surgery, or patients with poor CT image quality caused by respiratory motion and metal artefacts.

A complete flow chart of the data collection is shown in Fig. 2. A total of 1105 patients were included in this study, comprising 301 patients with NTM-LD and 804 patients with MTB-LD. The patients were randomly divided into three datasets for training, validating, and testing of a deep learning model (3D-ResNet) at a ratio of 8:1:1. Clinical information was collected from each patient, including age, sex, and the presence of symptoms, such as chest pain, cough, expectoration, fever, chest tightness, haemoptysis, gasp, dyspnoea, chills, fatigue, night sweats, and weight loss. CT images and clinical manifestations were collected within 1 month before samples were collected for pathogenic microbiological examination.

Fig. 2
figure 2

Flow chart of included and excluded data

We also supplemented the experimental results of the external test set. Following the data inclusion criteria of this study, we retrospectively collected 40 NTM-LD patients and 40 MTB-LD patients that were admitted to Xi’an Chest Hospital (Shanxi, China) between September 2019 and March 2021 to form the external test set.

Image data

The patients in this study were scanned using two spiral CT scanners (BrightSpeed, GE, America and Aquilion Prime 128, Canon Medical Systems, Otawara, Japan) under the same protocol. Prior to scanning, patients assumed a supine position and were instructed to inhale maximally and hold their breath to ensure the accuracy of the data during breathing phase. Afterwards, both lungs were scanned from the apices to the bases. FOV was adjusted according to the patient’s body shape. The CT scanning parameters of BrightSpeed with in-plane size of 512 × 512 were as follows: 120 kV, automatic tube current, 0.75 s/r, collimator width = 16 × 0.625 mm, which were reconstructed based on the standard and lung algorithm (thickness of 1.25 mm, interval of 1.25 mm). The voxel size of BrightSpeed was 0.31–1.14 mm3. The CT scanning parameters of Aquilion Prime 128 with an in-plane size of 512 × 512 were as follows: 120 kV, automatic tube current, 0.5 s/r, collimator width = 64 × 0.5 mm, which were reconstructed based on the FC 30 and FC 52 algorithm (thickness of 1.0 mm, interval of 0.8 mm). The voxel size of Aquilion Prime 128 was 0.33–0.71 mm3. The voxel sizes of all cases are shown in detail in supplementary material SS5.

Meanwhile, in order to compare the man–machine efficiency in the diagnosis of NTM-LD and MTB-LD, the results of the pathogenic microbiological examination were used to train the deep learning net, and an independent test set (n = 106) was randomly selected. Double-blind reading was employed by three radiologists with different experiences (senior experience: 10 years, reader 1; middle experience: 5 years, reader 2; junior experience: 3 years, reader 3) to determine whether the obtained CT images showed changes consistent with NTM-LD or MTB-LD. The average reading time of the learning model and radiologists for a single CT data was recorded.

Deep learning algorithm

The 3D-ResNet is a three dimensions modified version of the traditional two dimensions-residential network (2D-ResNet)[42], where 2D convolution is replaced by 3D convolution and pooling operation is replaced by 3D pooling, while the BatchNorm and activation function (ReLu) remained unchanged. The development of the 3D-ResNet-based NTM-LD/MTB-LD classifier in this study is shown in Fig. 3. Considering computation constraints, we normalised all CT data to a fixed scale of 64 × 256 × 256. For data preprocessing, the window width and level were adjusted to [1600, − 600], and the image intensity was normalised to [0, 1]. During training, 3D translation, rotation, Gaussian blur, and Gaussian noise were employed as data augmentation. To account for the smaller z-direction size of the input, we designed the 3D-ResNet to only have three stages, which consisted of four, six, and nine residual blocks, respectively. As a result, the feature map outputs by the final stage were down-sampled three times with a 1/8 resolution of the original input. For model training, the batch size was set to eight, and the initial learning rate was set to 0.1, which decayed by half after every 1000 iterations. We trained the model on four GeForce RTX 2080 Ti GPUs with the PyTorch framework for 10,000 iterations at maximum and would stop the training if the loss on the validation set failed to improve for 300 consecutive iterations.

Fig. 3
figure 3

Flow chart of the study. The first part was the training of 3D-ResNet, including 3D image preprocessing and 3D-ResNet construction of NTM-LD/MTB-LD classification network. The second part was the evaluation of deep learning model and man–machine comparison

Performance of the deep learning model

To contextualise the performance of the deep learning model, we invited three radiologists to blindly identify the pathogen that infected the patients in the testing dataset by only reading the CT scans. These readers were informed that the patients were only infected by one type of pathogen, either NTM or MTB. The reading time was also recorded for each radiologist during the interpretation of each CT scan.

To quantify the performance of the deep learning model and human experts, we computed different measures for accuracy and one measure for efficiency. The accuracy measures included area under the receiver operating curve (AUC), accuracy, sensitivity, specificity, precision, and F1 score. Efficiency was the average duration it took to classify each patient.

The deep learning model was an end-to-end predictive model, which directly learned the abstract mapping between lung CT images and resolving NTM-LD/MTB-LD. In order to demonstrate the reasoning process of the deep learning model more visually, we used the class activation map (CAM) to show the suspicious lung area detected by the model and reveal its potential value in making a diagnosis of NTM-LD and MTB-LD.

The network structure for 3D-ResNet is shown in Fig. 4, and it consists of four, six, and nine residual blocks, respectively.

Fig. 4
figure 4

Network structure of 3D-ResNet

Statistical analysis

SPSS version 22.0 (IBM Corp., Armonk, NY) was used to analyse the data. To analyse the difference between the clinical characteristics of the NTM-LD and the MTB-LD groups, an independent sample t test was used for numerical variables, and Pearson’s chi-square analysis was used for categorical variables. This study was a two-tailed significance test, and p-values of < 0.05 were considered statistically significant. Python 3.0 (Python Software Foundation, https://www.python.org/) was used to draw receiver operating characteristic curve (ROC), confusion matrix, and data visualisation. The Delong test was used to analyse ROC.

Results

Basic clinical characteristics

A total of 1105 patients were included in this study, and this consisted of 301 patients in the NTM-LD group (age: 53.57 ± 17.21, 96 females, 205 males), and 804 patients in the MTB-LD group (age 47.11 ± 18.42, 214 females; 590 males). Table 1 displays the demographic characteristics of the two groups. Univariate analysis of 14 clinical characteristics using independent sample t test and Pearson’s chi-squared test revealed that there among the demographic characteristics, patients with NTM-LD were older than those with MTB-LD (p < 0.001). Patients with MTB-LD had more cough (p = 0.024), and those with NTM-LD had more dyspnoea (p = 0.011), and the results were statistically significant.

Table 1 Baseline characteristics of datasets

Performance and evaluation of deep learning model

Using random assignment, the training set consisted of 886 patients (244 NTM-LD patients and 642 MTB-LD patients), the validation set had 113 patients (29 NTM-LD patients and 84 MTB-LD patients), and the testing set had 106 patients (28 NTM-LD patients and 78 MTB-LD patients). Finally, we had a training dataset of 18,080 [(244 × 25) + (642 × 24)] images and a validation dataset of 2272 [(29 × 25) + (84 × 24)] images. The test set data remained unchanged without data augmentation.

The external test set contained 40 NTM-LD patients and 40 MTB-LD patients.

Training process and convergence of deep learning model

The loss curve of the model training is shown in Fig. 5a. The loss curve showed that when the iteration was at 0–4000 times, the model was under fitting; on the other hand, when the iteration was at 4000–6500 times, the model was in ideal training. Figure 5b shows the ROC and AUC of the classifier at different iterations on the validation set. The model trained on four GeForce RTX 2080 Ti GPUs with the PyTorch framework for 10,000 iterations, and the training time lasted for 3 h and 7 min. The 3D-ResNet trained for 6000 iterations was selected as the final classifier for testing.

Fig. 5
figure 5

a Loss curves of training set and validation set under different iterations. b ROC and AUC of the validation set under different iterations.

Diagnostic performances of the deep learning model

The diagnostic performances of the selected classifier on the training, validation, and testing sets are shown in Table 2. The model fitted the training set well (AUC = 0.90, accuracy = 0.87, sensitivity = 0.98, specificity = 0.56) with good generalisability for the validation set (AUC = 0.88, accuracy = 0.88, sensitivity = 1, specificity = 0.52). When evaluated on the testing set, the overall model performed well (AUC = 0.86, accuracy = 0.83, sensitivity = 0.92, specificity = 0.57). In the external test set from Xi’an Chest Hospital, the model performed well (AUC = 0.78, accuracy = 0.69, sensitivity = 0.75, specificity = 0.63).

Table 2 Performance of 3D-ResNet on testing set, validation set, training set, and external test set

The ROCs of our model for the four datasets are uniformly plotted in Fig. 6. In Table 2, we list the performance of our model for the NTM-LD group and the MTB-LD group. We defined MTB-LD as a positive sample and NTM-LD as a negative sample.

Fig. 6
figure 6

The ROCs of 3D-ResNet on training set, validation set, test set, and external test set

Deep learning model identified suspicious lung areas

The CAM analysis demonstrated that our model after training was responsive to the suspicious lung areas that exhibited changes, including ground glass opacity, consolidation, nodule, tree-in-bud, and pleural effusion, which were also identified by the radiologists. Figure 7 shows the suspicious lung areas identified by the deep learning model in 3 NTM-LD patients (a–c) and 3 MTB-LD patients (d–f). Although the input region of interest (ROI) included both pulmonary and non-pulmonary areas, such as muscle and bone, without any manual annotation, the deep learning model was consistently focused on the inner lung region for prediction and was less affected by other tissue regions.

Fig. 7
figure 7

CT images and class activation maps of 6 patients. a NTM-LD, consolidation with bronchiectasis in right middle lobe, fibrous cord in left upper lobe lingual segment. b NTM-LD, multiple bronchiectasis with central lobular nodules and tree-in-bud in right middle lobe and left lung. c NTM-LD, nodules and fibrous cord in right upper lobe, which sticked with pleura. d MTB-LD, consolidation in right upper lobe, right pleural effusion. e MTB-LD, consolidation and fibrous cord in right upper lobe, which sticked with pleura. f MTB-LD, multiple consolidation, nodules, tree-in-bud and ground glass opacity in left upper lobe

In addition, the deep learning model found that the suspected lung area overlapped highly with the actual inflammatory areas. Without manual annotation, the deep learning model could automatically focus on the changes in the bilateral lungs, such as consolidation, bronchiectasis, tree-in-bud, nodule, ground glass opacity, and fibrous cord, for identification and inference. We found high overlap and consistency between the radiologists and deep learning models. The results of the quantitative assessment of CAM and CT findings are presented in supplementary material SS2.

Comparison between man and machine

With the testing set, three radiologists with different experiences and the 3D-ResNet were used for man–machine comparison. The confusion matrix of the three radiologists and our model in distinguishing the NTM-LD from MTB-LD is shown in Fig. 8, and their diagnostic performance is detailed in Table 3. The discerning ability of the radiologists increased with experience, as shown by most accuracy metrics. For recalling NTM-LD, the 5-year experienced radiologist was not superior to the 3-year experienced radiologist (specificity = 0.39). The deep learning model performed better than all three radiologists in almost all the metrics, except for the recall of MTB-LD, where our model was equivalent to the senior radiologist (sensitivity = 0.92). The area under the ROC of our model was found to be significantly different from those of the three radiologists (Fig. 9) by the Delong test (p < 0.05). In addition, the deep learning model was significantly more efficient than the radiologists, requiring only 0.04 ~ 0.06 s/CT, which was nearly 1000 times faster than the radiologists.

Fig. 8
figure 8

Confusion matrix on the test set between the 3D-ResNet and three radiologists with different experiences (Reader1_10y, Reader2_5y, and Reader3_3y)

Table 3 Comparison of discriminant performance between 3D-ResNet and 3 radiologists in the test set
Fig. 9
figure 9

ROC of 3D-ResNet and three radiologists (Reader1, Reader2, and Reader3) in the test set

Discussion

In this study, we developed a deep learning method based on chest CT images that showed good accuracy in distinguishing between NTM-LD and MTB-LD. The automatic activation area of the deep learning model was highly consistent with the area determined by radiologists, which suggests that deep learning automatically detects abnormalities in the lungs (consolidation, bronchiectasis, tree-in-bud, nodule, ground glass opacity, fibrous cord). This diagnostic method had the advantage of not requiring manual drawing ROI. Additionally, our model achieved a higher diagnostic accuracy with a higher efficiency rate than the radiologists.

This study collected the basic clinical datas and clinical manifestations of all patients included in the study. We found that patients with NTM-LD were older than those with MTB-LD. This is consistent with the findings of a previous study that found that NTM-LD was more common in the elderly than in young people [43]. Chiang et al. [44] showed that the proportion of females infected with NTM-LD was significantly higher than that of males. However, there is no difference in gender between NTM-LD and MTB-LD in this study, and this is consistent with the findings of Kim [26]. It was found that only two clinical symptoms, dyspnoea and cough, showed significant differences between the two groups. In this study, more patients with NTM-LD had dyspnoea compared to patients with MTB-LD, and this was consistent with the results of Klann [45]. However, Kim [30] reported that dyspnoea was more common in patients with MTB-LD. Therefore, we believe that patient characteristics and clinical manifestations alone are insufficient in differentiating between NTM-LD and MTB-LD.

Sensitivity and specificity served as important indices to evaluate the performance of the deep learning model and the radiologists to help improve the detection of NTM-LD and MTB-LD. Inverse relationships were observed between sensitivity and the number of misdiagnosed patients with MTB-LD, and between specificity and the number of misdiagnosed patients with NTM-LD. Our model showed higher sensitivity and specificity than the radiologists, which indicated that our model had a lower misdiagnosis rate for NTM-LD and MTB-LD compared to that of the radiologists. Moreover, the model’s diagnostic speed was > 1000 times faster than that of the radiologists, which can be considered an automatic rapid diagnosis. Deep learning models, when calibrated, would have the natural advantages of stability, repeatability, and ease of migration. As more chest CT images from other institutions are supplemented for training and testing, our model is expected to be applied to different institutions and would achieve better generalising capabilities.

According to previous studies, the use of deep learning models in the diagnosis of MTB has promising results. Lakhani [46]used 1007 chest radiographs from four datasets to construct deep convolutional neural networks (DCNNs) combined with AlexNet and GoogLeNet to classify patients as healthy or with tuberculosis infection (AUC = 0.99). Hwang [47] developed a deep learning automatic detection (DAD) algorithm using 5421 normal CXRs and 6768 active pulmonary MTB CXRs. The sensitivity and specificity of the algorithm were 94.3% and 91.1%, respectively, and the classification performance was significantly higher than that of the 15 physicians included in their study. Hwang [48]also designed a convolutional neural network-based (CNN-based) automated MTB screening computer-aided diagnosis (CAD) system based on many CXR images with transfer learning across three datasets. Qin [49] compared three deep learning (DL) systems (CAD4TB, Luna Insight, and QXR) based on CXRs to detect MTB in outpatients in Nepal and Cameroon. Seok [50] used 1000 CXR images to compare the performance of the I-CNN and the D-CNN from both image and demographic variables, and the DCNN was found to be better. Tawsifur [51]used multiple public databases to build a database containing 3500 MTB infections and 3500 normal chest radiographs that used nine different depth CNNs (RESNET18, RESNET50, RESNET101, CHEXNET, INCEPtionV3, VG19, DENSENET201, SQUEEZENET, and MobileNet) for transfer learning. The accuracy, precision, sensitivity, F1 score, and specificity of chest X-ray images were 96.47%, 96.62%, 96.47%, 96.47%, and 96.51%, respectively.

There are some differences between our study and previous studies. First, our objective was to distinguish NTM-LD from MTB-LD based on CT images, since the CT manifestations of these two diseases were similar making differentiation difficult even for senior radiologists. Compared to previous studies, our objective was relatively more difficult; however, our model still performed excellently. Second, we compared our model to radiologists that have 10, 5, and 3 years of experience on the test set, and the results showed that 3D-ResNet was significantly better than all radiologists in terms of accuracy, sensitivity, specificity, and AUC, which indicates good clinical applicability. Finally, we were able to easily collect 1105 chest CT images, suggesting that the model can be popularised and used in multiple institutions equipped with CT equipment. Additionally, the reference used for the diagnosis of the diseases was pathogenic microbiological examinations which were more time-consuming. The less time of diagnosis of our model further signifies the advantages of using the model.

Our study had several limitations. First, in the data collection, patients who were receiving specific drug therapy were not excluded. No study has shown whether clinical manifestations and chest CT findings are affected by specific drug therapy. However, the CT data of patients who received or did not receive specific drug treatment are prevalent in clinical scenarios, and this suggests that our model is suitable for practical medical scenarios. Second, the ratio of NTM-LD cases to MTB-LD cases was close to 3:8, suggesting that the sample was not sufficiently balanced. Although during the image preprocessing stage of model training, we used data resampling, data enhancement, and weight adjustment of loss function for optimisation, the performance of the model did not improve significantly (p > 0.05) (Supplementary material SS4 shows this in detail). This is directly related to the fact that the detection ability of the model in MTB-LD cases was far greater than that in NTM-LD cases (F1 score: 0.89 vs. 0.64). A more balanced sample ratio (1:1) will train the classifier to have a balanced detection ability. This also suggests that it is of definitive significance to supplement more NTM-LD cases in order to improve the model’s ability to detect NTM-LD. Third, although the training, testing, and validation data were obtained from a single institution, we collected external test data for external validation of the model, and the model performed well; therefore, the addition of multicentre, multi-device, and multi-mode CT data will undoubtedly be of great significance to this study, which is also the direction of development of future work.

In conclusion, this study developed a personalised deep learning framework that uses conventional chest CT images to classify nontuberculous mycobacteria infection and Mycobacterium tuberculosis infection with greater accuracy, sensitivity, and specificity compared to those of radiologists. This model has the potential to be a screening tool for patients with pulmonary mycobacterium and will lead to early detection and selection of appropriate clinical treatment strategies.