Validation of deep learning-based computer-aided detection software use for interpretation of pulmonary abnormalities on chest radiographs and examination of factors that influence readers’ performance and final diagnosis

Toda, Naoki; Hashimoto, Masahiro; Iwabuchi, Yu; Nagasaka, Misa; Takeshita, Ryo; Yamada, Minoru; Yamada, Yoshitake; Jinzaki, Masahiro

doi:10.1007/s11604-022-01330-w

Validation of deep learning-based computer-aided detection software use for interpretation of pulmonary abnormalities on chest radiographs and examination of factors that influence readers’ performance and final diagnosis

Original Article
Open access
Published: 19 September 2022

Volume 41, pages 38–44, (2023)
Cite this article

Download PDF

You have full access to this open access article

Japanese Journal of Radiology Aims and scope Submit manuscript

Validation of deep learning-based computer-aided detection software use for interpretation of pulmonary abnormalities on chest radiographs and examination of factors that influence readers’ performance and final diagnosis

Download PDF

Naoki Toda¹,
Masahiro Hashimoto ORCID: orcid.org/0000-0003-0162-5312¹,
Yu Iwabuchi¹,
Misa Nagasaka¹,
Ryo Takeshita¹,
Minoru Yamada¹,
Yoshitake Yamada¹ &
…
Masahiro Jinzaki¹

2390 Accesses
8 Citations
Explore all metrics

Abstract

Purpose

To evaluate the performance of a deep learning-based computer-aided detection (CAD) software for detecting pulmonary nodules, masses, and consolidation on chest radiographs (CRs) and to examine the effect of readers’ experience and data characteristics on the sensitivity and final diagnosis.

Materials and methods

The CRs of 453 patients were retrospectively selected from two institutions. Among these CRs, 60 images with abnormal findings (pulmonary nodules, masses, and consolidation) and 140 without abnormal findings were randomly selected for sequential observer-performance testing. In the test, 12 readers (three radiologists, three pulmonologists, three non-pulmonology physicians, and three junior residents) interpreted 200 images with and without CAD, and the findings were compared. Weighted alternative free-response receiver operating characteristic (wAFROC) figure of merit (FOM) was used to analyze observer performance. The lesions that readers initially missed but CAD detected were stratified by anatomic location and degree of subtlety, and the adoption rate was calculated. Fisher’s exact test was used for comparison.

Results

The mean wAFROC FOM score of the 12 readers significantly improved from 0.746 to 0.810 with software assistance (P = 0.007). In the reader group with < 6 years of experience, the mean FOM score significantly improved from 0.680 to 0.779 (P = 0.011), while that in the reader group with ≥ 6 years of experience increased from 0.811 to 0.841 (P = 0.12). The sensitivity of the CAD software and the adoption rate for the lesions with subtlety level 2 or 3 (obscure) lesions were significantly lower than for level 4 or 5 (distinct) lesions (50% vs. 93%, P < 0.001; and 55% vs. 74%, P = 0.04, respectively).

Conclusion

CAD software use improved doctors’ performance in detecting nodules/masses and consolidation on CRs, particularly for non-expert doctors, by preventing doctors from missing distinct lesions rather than helping them to detect obscure lesions.

Efficiency of a computer-aided diagnosis (CAD) system with deep learning in detection of pulmonary nodules on 1-mm-thick images of computed tomography

Article 26 June 2020

Radiologists with and without deep learning–based computer-aided diagnosis: comparison of performance and interobserver agreement for characterizing and diagnosing pulmonary nodules/masses

Article 25 June 2022

Deep learning-based detection system for multiclass lesions on chest radiographs: comparison with observer readings

Article 20 November 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Chest radiography is a commonly used medical imaging and diagnostic technique for initial screening of patients due to its low cost and easy accessibility [1]. It plays an important role in detecting lung diseases such as lung cancer and tuberculosis [2, 3].

However, accurate interpretation of chest radiographs (CRs) can occasionally be challenging for doctors. Approximately 90% of missed lung cancer cases involve CR assessments [4], and the miss rate of lung cancers on CRs is reportedly 19–22% [5, 6]. The characteristics of abnormal lesions, such as size, conspicuity, and location, influence the detection accuracy [4, 7]. Reader proficiency is another important factor. While expert observers establish specific scanning patterns for radiographs, non-expert observers generally search without order on the radiograph, and this can cause them to overlook obscure abnormal findings [4]. In Japan, doctors who do not specialize in pulmonology often read CRs in daily practice. Approximately 50% of the doctors who interpret CRs for patients screening are not lung disease experts, such as non-pulmonology physicians [8], and this may reduce the abnormality detection rate on CRs. Thus, there is a demand for detection tools for non-experts.

In recent years, Computer-aided detection (CAD) systems that use deep learning algorithms have been developed [9, 10]. Some studies have shown that observer performance for detection of abnormal thoracic lesions with CAD is significantly better than without CAD [11, 12]. These studies used algorithms developed by researchers from scratch, but few studies have used software products developed by vendors. Some CAD software packages for CRs are already commercially available, and their diagnostic performance has been reported (e.g., EIRL X-ray Lung nodule, Lpixel, Tokyo, Japan). However, few studies have analyzed data characteristics that affected improvement of readers’ performance and final diagnosis with CAD.

This study aimed to compare doctors' performance in interpreting CRs with and without CAD. We also examined the effect of readers’ experience and data characteristics on detection of abnormal pulmonary lesions with CAD. Lung nodules, masses, and consolidation on CRs were targeted because the CAD software used in this study was designed to detect these lesions.

Materials and methods

This retrospective multicenter study was approved by the institutional review board, and anonymized data were shared through a data-sharing agreement between the institutions. The requirement for written informed consent was waived because the data were collected retrospectively. This study was supported by Konica Minolta.

Data collection

Anonymized CRs (posteroanterior view) of 453 patients were retrospectively selected from two institutions in Japan. Only patients who were over the age of 19 years were included. Institution A, a university hospital, supplied the data of 238 patients who presented for physical examination between January 2012 and December 2019. Institution B, a health screening center, supplied the data of 215 patients who had routine health screening between January 2016 and December 2018. One CR image was acquired from each patient; thus, 453 images were collected. The images were acquired using Aero DR system (Konica Minolta, Tokyo, Japan) or FUJIFILM DR PRELIO U(Fujifilm, Tokyo, Japan)with 120–130 kVp tube voltage, 1–8mAs tube current time product. The exclusion criterion was poor image quality, for which no CR was excluded. Pulmonary nodules/masses and consolidation on the CRs were considered abnormal findings, while extrapulmonary abnormal findings, such as cardiomegaly and rib fractures, were considered normal for the purpose of this study. Nodules and masses were defined as focal lung opacities with smooth border, measuring ≤ 3 cm and > 3 cm in diameter, respectively. Consolidation was defined as lung opacities apart from nodules and masses. A total of 194 images (43%) included abnormal findings, while 259 (57%) were normal. For the observer-performance test, 60 images with and 140 without abnormal findings were randomly selected. Two board-certified radiologists (with 6 and 14 years of experience) reviewed the images and recorded the area, lesion type (nodule, mass, or consolidation), and the degree of subtlety of each lesion by mutual agreement. The degree of subtlety was measured on a five-point scale as follows: level 1, extremely subtle (detection is extremely difficult); level 2, very subtle (detection is very difficult); level 3, subtle (detection is difficult); level 4, relatively obvious (detection is relatively easy); and level 5, obvious (detection is easy) [13].

Ground truth

All images were independently reviewed by three board-certified radiologists (with 14, 25, and 33 years of experience) to establish the ground truth. The radiologists confirmed the presence of abnormal findings on the images and marked the lesion locations. The common areas annotated by at least two of the three radiologists with an intersection over union (IoU) greater than specific threshold for each finding were adopted as abnormal lesions. The thresholds for nodule/mass and consolidation were determined as 0.5 and 0.0, respectively.

Software

CAD-Chest X-ray (Konica Minolta, Inc. and Enlitic, Inc.) software was used for this study. This software is currently commercially available in Japan (Approval No. 30300BZX00271000). This software was designed as a second-reader type CAD. It automatically detects pulmonary nodules/masses and consolidation, and marks the areas of the lesions.

Detection performance test of the CAD software

First, a performance test of the CAD software alone (standalone test) was conducted. All 453 images in the dataset were interpreted with the software alone. Then, an observer-performance test was performed to assess whether the software would improve doctors’ performance. The test had a sequential-only design and was done in accordance with the US Food and Drug Administration guideline [14]. The CRs of the selected 200 patients were interpreted by 12 doctors, including three radiologists, 3 pulmonologists, three non-pulmonology physicians, and three junior residents with various years of experience (2–12 years). The readers were blinded to the clinical information of the patients, and the radiologists who defined the reference standard did not participate in the performance test. The test consisted of two sessions. In the first session, the readers were asked to determine whether each CR showed any nodule, mass, or consolidation. If any of these was present, the readers then marked the center of the lesion. All procedures were performed without CAD software. The readers were also asked to input a confidence score with a continuous value between zero and one for each annotation. In the second session, the readers were asked to re-evaluate every CR with the assistance of the CAD software and to modify their original decisions and confidence scores.

Statistical analyses

The sensitivity and specificity of the CAD software for detecting pulmonary nodules, masses, and consolidation were analyzed in the standalone test. Per lesion sensitivity and patient specificity were calculated. In the observer-performance test, the detection performances of the readers with and without CAD were compared. Jackknife alternative free-response receiver operating characteristic (JAFROC) analyses were performed using R statistical software version 4.0.2 (R Project for Statistical Computing, Vienna, Austria) and RJafroc version 2.0.1. Both the readers and CRs were treated as random effects. The weighted alternative free-response receiver operating characteristic (wAFROC) figure of merit (FOM) score was used as the performance measure for the analyses. The weights were equally divided by the number of lesions. Statistical significance was evaluated using the Dorfman-Berbaum-Metz method [15]. The results were stratified according to the specialty, years of experience of the readers. The mean FOM scores with and without CAD in each group were compared. For the analysis of the sensitivity of the CAD software and the adoption rate of the lesions that readers initially missed but CAD detected, the lesions were grouped by the anatomic location and the degree of subtlety. Fisher’s exact test was used for the analyses. Statistical significance was set at P < 0.05.

Results

Among the 453 images used for the Standalone test, 194 showed abnormal findings. The abnormal findings included nodules/masses in 101 (52%) images, consolidation in 91 (47%) images, and both in 2 (1%) images. 60 images with abnormal findings were selected for the observer-performance test. The abnormal findings included 36 (53%) nodules/masses and 29 (47%) consolidation. 3 images showed multiple nodules/masses and 2 images showed multiple consolidation. The demographic features of each dataset are shown in Table 1.

Table 1 Patients’ demographics in datasets in the standalone and observer-performance tests

Full size table

Detection performance of the CAD software

In the standalone test using 453 images, the CAD software achieved sensitivities of 83% (85/103) and 80% (74/93) for nodules/masses and consolidation, respectively. Thus, its sensitivity for detecting abnormal lesions was 81% (159/196), and its specificity was 62% (160/259).

Comparison of detection performances with and without CAD software.

When the 200 images were used for the observer-performance test, the CAD software achieved sensitivities of 79% (26/33) and 81% (26/32) for nodules/masses and consolidation, respectively. Thus, its sensitivity for detecting abnormal lesions was 80% (52/65), and its specificity was 63% (88/140). The mean wAFROC FOM scores of all readers with and without CAD were 0.746 (95% confidence interval [CI] 0.668–0.823) and 0.810 (95% CI 0.746–0.873), respectively. Thus, the increase in the wAFROC FOM score by the CAD software was 0.064. The mean wAFROC FOM score with CAD was significantly higher than without CAD (P = 0.007). Fig. 1 shows ROC curves with and without CAD. When stratified by the specialties of the readers, the mean wAFROC FOM scores for radiologists, pulmonologists, non-pulmonology physicians, and junior residents without CAD were 0.806 (95% CI 0.699–0.913), 0.817 (95% CI 0.714–0.920), 0.746 (95% CI 0.677–0.815) and 0.613 (95% CI 0.535–0.692), respectively. The wAFROC FOM scores increased with use of the CAD software to 0.835 (95% CI 0.765–0.904), 0.839 (95% CI 0.763–0.915), 0.815 (95% CI 0.747–0.884) and 0.749 (95% CI 0.6338–0.861), respectively. Thus, the increments were 0.029, 0.022, 0.069, and 0.136, respectively. The mean wAFROC FOM score of non-expert doctors (with < 6 years of experience) significantly improved with CAD from 0.680 (95% CI 0.586–0.773) to 0.779 (95% CI 0.705–0.853) (P = 0.011). The wAFROC FOM scores of experts (with ≥ 6 years of experience) also improved from 0.811 (95% CI 0.745–0.877) to 0.841 (95% CI 0.782–0.899) with CAD, but the difference was not significant (P = 0.12). The increments for each group were 0.099 and 0.030, respectively. Table 2 summarizes the results.

Table 2 Figure of merit scores for the detection of abnormal findings on chest radiographs with and without the computer-aided detection (CAD) software

Full size table

Analyses of factors influencing sensitivity and final diagnosis with CAD assistance

Non-experts and experts detected 66% (256/390) and 78% (305/390), respectively, of all abnormal lesions without using CAD. Table 3 shows the sensitivity of the CAD software and human readers, stratified by anatomic location and degree of subtlety of the lesion. Experts had higher sensitivity than non-expert doctors in all groups. The sensitivity of the CAD software for detecting obscure lesions (subtlety level 1, 2 or 3) was significantly lower than those for distinct lesions (subtlety level 4 or 5) [50% (10 of 20) vs 93% (42/45); P < 0.001]. The human readers initially missed 117 lesions without CAD. For 78 (67%) of these lesions, detection by the CAD software was accepted by each reader. Figures 2 and 3 show true-positive examples of pulmonary nodules and consolidation, which some readers missed without using CAD but acknowledged after using the software. Table 4 describes the accepted lesions, stratified by their characteristics. Of the initially missed lesions, 67% (55/82) and 66% (23/35) were corrected with CAD in the non-expert and expert groups, respectively. When stratified by the degree of subtlety, the adoption rate of obscure lesions and 50% (54/73) was significantly lower than that of distinct lesions [55% (24 of 44) vs 74% (54/73); P = 0.04]. The CAD software did not detect 13 lesions, and 12 of these were detected by at least one human reader.

Table 3 Characteristics of abnormal lesions detected by the computer-aided detection (CAD) software alone and by human readers alone in the observer-performance test

Full size table

Table 4 Adoption rate of abnormal lesions that were initially missed by readers but detected by computer-aided detection software

Full size table

Discussion

This study compared doctors’ performances in interpreting CRs with and without using CAD. The CAD software achieved about 80% sensitivity for detecting pulmonary nodules, masses, and consolidation on CRs in the standalone test. The observer-performance test showed that using the CAD software significantly increased the wAFROC FOM scores for these lesions.

Several studies have demonstrated that the assistance of deep learning-based algorithms yields higher detection performance than that achieved by human readers alone [11, 12]. The results of this study consistent with previous findings. Hwang et al. showed a significant improvement in the area under the curve for lesion-wise localization in various reader groups (from 0.781–0.907 to 0.873–0.938) [11]. Choi et al. reported that the assistance of a deep learning-based algorithm improved the FOM score from 0.843 to 0.911 [12]. However, these studies used algorithms developed from scratch for academic purposes.

This study used a CAD software package to demonstrate the utility of CAD. The FOM scores reported in previous studies were higher than those recorded in this study (0.746 to 0.810). However, the mean increments in FOM scores with CAD in the previous studies were 0.057 and 0.068, respectively, which were almost the same as those obtained in our study (0.064). Thus, while the lower FOM score in this study may be attributable to the difficulty of the dataset used, the degree of contribution of the software used is comparable to that of previous studies.

The increment in FOM scores by CAD was higher for non-pulmonology physicians and junior residents than for pulmonologists and radiologists. It was also higher for doctors with < 6 years of experience than for doctors with ≥ 6 years of experience. Thus, this study shows that CAD software is more useful for non-expert doctors than for expert doctors. In support of this findings, previous studies have also reported that CAD software was more beneficial for non-expert readers than expert readers [11, 12, 16].

In contrast, few studies have analyzed factors that influence readers’ performance and final diagnosis with the use of CAD. In our study, the CAD software yielded significantly less sensitive for detecting obscure lesions than for distinct lesions. The adoption rate of obscure lesions detected by the CAD software was also significantly lower than that of distinct lesions. These results revealed that detection of obscure lesions contributed less to the improvement of readers’ performance with CAD than detection of distinct lesions. Furthermore, adoption rate of CAD software detection for initially missed lesions by non-experts and experts were approximately the same (67% and 66%, respectively). Therefore, this study showed that CAD software was more effective for non-experts than experts because non-experts missed more distinct lesions than experts, and those lesions were detected by the CAD software.

The use of deep learning-based detection algorithms as second readers has already been described [11, 12, 17]. Such software packages can be adopted as second readers in daily practice, such as during medical checkups. The software automatically marks the regions where abnormal findings are suspected; thus, even non-expert doctors can recognize the lesions. Approximately 50% of doctors in Japan who read CRs for screening are not experienced readers [8]. Additionally, visual and mental fatigue caused by heavy workloads can increase the chances of perceptual errors [18]. The use of CAD software in institutions can therefore help to reduce misdiagnoses caused by these factors. However, the disadvantage of second-reader CAD is that it takes longer to read images with CAD than without CAD because of the necessity of two reading passes [19]. Therefore, some studies have highlighted the potential of using CAD software as a concurrent reader for CRs [11, 12]. On the other hand, the use of CAD software as a concurrent reader is associated with the risk that human readers may not pay attention to lesions that the software fails to detect. 20% of the abnormal findings in our study were not detected by the CAD software, and 92% of those lesions were detected by at least one reader. Using this software as a concurrent reader may lead to missing these lesions. To the best of our knowledge, no study has validated the effect of deep-learning-based algorithms on CRs as concurrent readers. Therefore, further study will be required to determine which reader type is more suitable in routine clinical practice.

This study has several limitations. First, validation was performed using small, designed datasets. In our study, 30% of the images in the observer-performance test showed pulmonary nodules/masses and consolidation. By contrast, one study reported that only 8% of the CRs taken for mandated health examinations showed any abnormal finding [20]. Thus, the prevalence of abnormal findings in this study was relatively higher than what is usually seen in routine practice. This may affect the adoption rate of CAD software detection. Second, CT was not used for ground truth labeling. Although the images were reviewed by three board-certified radiologists, some lesions might have been missed. Last, this study was conducted in accordance with the US Food and Drug Administration guideline, not the Japanese guidelines. Therefore, the performance could not be accurately compared with other products marketed in Japan.

In summary, this sequential evaluation study showed that the CAD software improved doctors’ performance in detecting nodules/masses and consolidation on CRs, particularly for non-expert doctors, by preventing doctors from missing distinct lesions rather than by helping them to detect obscure lesions. This software may prevent doctors from missing incidental lung abnormalities such as lung cancers, in clinical practice, due to inexperience and carelessness. Further prospective studies using multicenter data are required to validate the contribution of CAD software packages to clinical practice.

References

de Groot PM, Carter BW, Abbott GF, Wu CC. Pitfalls in chest radiographic interpretation: blind spots. Semin Roentgenol. 2015;50:197–209. https://doi.org/10.1053/j.ro.2015.01.008.
Article Google Scholar
Koo HJ, Choi CM, Park S, Lee HN, Oh DK, Ji WJ, et al. Chest radiography surveillance for lung cancer: results from a national health insurance database in South Korea. Lung Cancer. 2019;128:120–6. https://doi.org/10.1016/j.lungcan.2018.12.024.
Article Google Scholar
Pinto LM, Pai M, Dheda K, Schwartzman K, Menzies D, Steingart KR. Scoring systems using chest radiographic features for the diagnosis of pulmonary tuberculosis in adults: a systematic review. Eur Respir J. 2013;42:480–94. https://doi.org/10.1183/09031936.00107412.
Article Google Scholar
Del Ciello A, Franchi P, Contegiacomo A, Cicchetti G, Bonomo L, Larici AR. Missed lung cancer: when, where, and why? Diagn Interv Radiol. 2017;23:118–26. https://doi.org/10.5152/dir.2016.16187.
Article Google Scholar
Quekel LG, Kessels AG, Goei R, van Engelshoven JM. Miss rate of lung cancer on the chestradiograph in clinical practice. Chest. 1999;115:720–4. https://doi.org/10.1378/chest.115.3.720.
Article CAS Google Scholar
Donald JJ, Barnard SA. Common patterns in 558 diagnostic radiology errors. J Med Imaging Radiat Oncol. 2012;56:173–8. https://doi.org/10.1111/j.1754-9485.2012.02348.x.
Article Google Scholar
Fardanesh M, White C. Missed lung cancer on chest radiography and computed tomography. Semin Ultrasound CT MR. 2012;33:280–7. https://doi.org/10.1053/j.sult.2012.01.006.
Article Google Scholar
Mitomo H, Nakayama T, Ashizawa K, Endo C, Kobayashi T, Sato M, et al. Nationwide questionnaire on the actual status of an interpretation system for lung cancer screening by chest radiography. Haigan. 2018;58:243–51. https://doi.org/10.2482/haigan.58.243.
Article Google Scholar
Saba T. Recent advancement in cancer detection using machine learning: systematic survey of decades, comparisons and challenges. J Infect Public Health. 2020;13:1274–89. https://doi.org/10.1016/j.jiph.2020.06.033.
Article Google Scholar
Lee S, Summers RM. Clinical artificial intelligence applications in radiology: chest and abdomen. Radiol Clin North Am. 2021;59:987–1002. https://doi.org/10.1016/j.rcl.2021.07.001.
Article Google Scholar
Hwang EJ, Park S, Jin KN, Kim JI, Choi SY, Lee JH, et al. Development and validation of a deep learning-based automated detection algorithm for major thoracic diseases on chest radiographs. JAMA Netw Open. 2019;2:e191095. https://doi.org/10.1001/jamanetworkopen.2019.1095.
Article Google Scholar
Choi SY, Park S, Kim M, Park J, Choi YR, Jin KN. Evaluation of a deep learning-based computer-aided detection algorithm on chest radiographs: case–control study. Medicine (Baltimore). 2021;100:e25663. https://doi.org/10.1097/MD.0000000000025663.
Article Google Scholar
Shiraishi J, Katsuragawa S, Ikezoe J, Matsumoto T, Kobayashi T, Komatsu K, et al. Development of a digital image database for chest radiograph with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. AJR Am J Roentgenol. 2000;174:71–4.
Article CAS Google Scholar
US Food and Drug Administration. January 22, 2020. Clinical performance assessment: considerations for computer-assisted detection devices applied to radiology images and radiology device data in premarket notification (510(k)) submissions. Accessed on June 20, 2022. https://www.fda.gov/media/77642/download
Chakraborty DP, Berbaum KS. Observer studies involving detection and localization: modeling, analysis, and validation. Med Phys. 2004;31:2313–30. https://doi.org/10.1118/1.1769352.
Article Google Scholar
Yamada Y, Shiomi E, Hashimoto M, Abe T, Matsusako M, Saida Y, et al. Value of a computer-aided detection system based on chest tomosynthesis imaging for the detection of pulmonary nodules. Radiology. 2018;287:333–9. https://doi.org/10.1148/radiol.2017170405.
Article Google Scholar
Castellino RA. Computer aided detection (CAD): an overview. Cancer Imaging. 2005;5:17–9. https://doi.org/10.1102/1470-7330.2005.0018.
Article Google Scholar
Brady AP. Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging. 2017;8:171–82. https://doi.org/10.1007/s13244-016-0534-1.
Article Google Scholar
Beyer F, Zierott L, Fallenberg EM, Juergens KU, Stoeckel J, Heindel W, et al. Comparison of sensitivity and reading time for the use of computer-aided detection (CAD) of pulmonary nodules at MDCT as concurrent or second reader. Eur Radiol. 2007;17:2941–7. https://doi.org/10.1007/s00330-007-0667-1.
Article CAS Google Scholar
Watanabe Y, Nakagawa T, Fukai K, Honda T, Furuya H, Hayashi T, et al. Descriptive study of chest x-ray examination in mandatory annual health examinations at the workplace in Japan. PLoS ONE. 2022;17(1):e0262404.
Article CAS Google Scholar

Download references

Acknowledgements

This study was carried out as part of Cabinet Office of Japan, the Cross-ministerial Strategic Innovation Promotion Program, “Innovative AI Hospital System,” using the provided research funds.

Funding

Konica Minolta Japan, Inc.

Author information

Authors and Affiliations

Department of Radiology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
Naoki Toda, Masahiro Hashimoto, Yu Iwabuchi, Misa Nagasaka, Ryo Takeshita, Minoru Yamada, Yoshitake Yamada & Masahiro Jinzaki

Authors

Naoki Toda
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar
Yu Iwabuchi
View author publications
You can also search for this author in PubMed Google Scholar
Misa Nagasaka
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Takeshita
View author publications
You can also search for this author in PubMed Google Scholar
Minoru Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitake Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Jinzaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: MH, MJ; Methodology:NT, MH; Formal analysis and investigation:NT, MH, YI, MN, RT, YY,MY; Writing—original draft preparation:NT; Writing—review and editing:MH, YI, MN, RT, YY, MY,MJ; Funding acquisition: MJ; Supervision: MJ.

Corresponding author

Correspondence to Masahiro Hashimoto.

Ethics declarations

Conflict of interest

Masahiro Jinzaki received a research grant from Konica Minolta; Naoki Toda, Masahiro Hashimoto, Yu Iwabuchi, Misa Nagasaka and Ryo Takeshita received lecture fees from Konica Minolta. Yoshitake Yamada and Minoru Yamada have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Toda, N., Hashimoto, M., Iwabuchi, Y. et al. Validation of deep learning-based computer-aided detection software use for interpretation of pulmonary abnormalities on chest radiographs and examination of factors that influence readers’ performance and final diagnosis. Jpn J Radiol 41, 38–44 (2023). https://doi.org/10.1007/s11604-022-01330-w

Download citation

Received: 01 May 2022
Accepted: 15 August 2022
Published: 19 September 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s11604-022-01330-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Validation of deep learning-based computer-aided detection software use for interpretation of pulmonary abnormalities on chest radiographs and examination of factors that influence readers’ performance and final diagnosis