Skip to main content

A new breast tumor ultrasonography CAD system based on decision tree and BI-RADS features


In this paper, we present a novel computer-aided diagnostic (CAD) system based on the Breast Imaging Reporting and Data System (BI-RADS) terminology scores of screening ultrasonography (US). The decision tree algorithm is adopted to analyze the BI-RADS information to differentiate between the malignant and benign breast tumors. Although many ultrasonography CAD systems have been developed for decades, there are still some problems in clinical practice. Previous CAD systems are opaque for clinicians and cannot process the ultrasound image from different ultrasound machines. This study proposes a novel CAD system utilizing BI-RADS scoring standard and Classification and Regression Tree (CART) algorithm to overcome the two problems. The original dataset consists of 1300 ultrasound breast images. Three well-experienced clinicians evaluated all of the images according to the BI-RADS feature scoring standard. Subsequently, each image could be transformed into a 25 × 1 vector. The CART algorithm was finally used to classify these vectors. In the experiments, we used the oversampling method to balance the number of malignant samples and benign samples. The 5-fold cross validation was employed to evaluate the performance of the system. The accuracy reached 94.58%, the specificity was 98.84%, the sensitivity was 90.80%, the positive predictive value (PPV) was 98.91% and the negative predictive value (NVP) was 90.56%. The experiment results show that the proposed system can obtain a sufficient performance in the breast diagnosis and can effectively recognize the benign breast tumors in BI-RADS 3.

This is a preview of subscription content, access via your institution.

Figure 1
Figure 2
Figure 3


  1. Brinton, J. T., et al. Breast Cancer Facts & Figures 2011–2012. American Cancer Society (2011)

  2. American College of Radiology: ACR BI-RADS–US Lexicon Classification Form. (2013)

  3. Loh, W.Y.: Classification and regression trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(1), 14–23 (2011)

    Google Scholar 

  4. Chen, C., Chou, Y., Han, K., Hung, G., Tiu, C., Chiou, H., Chiou, S.: Breast lesions on sonograms: computer-aided diagnosis with nearly setting-independent features and artificial neural networks 1. Radiology. 226(2), 504–514 (2003)

    Article  Google Scholar 

  5. Committee, A.C.O.R., Radiology, A.C.O.: Breast imaging reporting and data system. American College of Radiology. (1998)

  6. DeSantis, C., Ma, J., Bryan, L., Jemal, A.: Breast cancer statistics, 2013. CA Cancer J. Clin. 64(1), 52–62 (2014)

    Article  Google Scholar 

  7. Drukker, K., Giger, M.L., Horsch, K., Kupinski, M.A., Vyborny, C.J., Mendelson, E.B.: Computerized lesion detection on breast ultrasound. Med. Phys. 29(7), 1438–1446 (2002)

    Article  Google Scholar 

  8. Estabrooks, A., Jo, T., Japkowicz, N.: A multiple resampling method for learning from imbalanced data sets. Comput Intell-US. 20(1), 18–36 (2004)

    Article  MathSciNet  Google Scholar 

  9. Feng, X., Guo, X., Huang, Q.: Systematic evaluation on speckle suppression methods in examination of ultrasound breast images. Appl. Sci. 7(1), 37 (2016)

    Article  Google Scholar 

  10. Hamy, A.S., Giacchetti, S., Albiter, M., Bazelaire, C.D., Cuvier, C., Perret, F., Bonfils, S., Charvériat, P., Hocini, H., Roquancourt, A.D.: BI-RADS categorisation of 2708 consecutive nonpalpable breast lesions in patients referred to a dedicated breast care unit. Eur. Radiol. 22(1), 9–17 (2012)

    Article  Google Scholar 

  11. He, W., Zhu, X., Cheng, D., Hu, R., Zhang, S.: Unsupervised feature selection for visual classification via feature-representation property. Neuro Comput. 236(C), 5–13 (2017)

    Google Scholar 

  12. Heinig, J., Witteler, R., Schmitz, R., Kiesel, L., Steinhard, J.: Accuracy of classification of breast ultrasound findings based on criteria used for BI-RADS. Breast Diseases A Year Book Quarterly, 32(4), 573–578 (2008)

    Article  Google Scholar 

  13. Huang, Y.L., Kuo, S.J., Chang, C.S., Liu, Y.K., Moon, W.K., Chen, D.R.: Image retrieval with principal component analysis for breast cancer diagnosis on various ultrasonic systems. Ultrasound Obst Gyn. 26(5), 558–566 (2005)

    Article  Google Scholar 

  14. Huang, Q., Yang, F., Liu, L., Li, X.: Automatic segmentation of breast lesions for interaction in ultrasonic computer-aided diagnosis. Inf. Sci. 314, 293–310 (2015)

    Article  Google Scholar 

  15. Huang, Q., Luo, Y., Zhang, Q.: Breast ultrasound image segmentation: a survey. Int. J. Comput. Assist. Radiol. Surg. 12(3), 493–507 (2017)

    Article  Google Scholar 

  16. Ikedo, Y., Morita, T., Fukuoka, D., Hara, T., Lee, G., Fujita, H., Takada, E., Endo, T.: Automated analysis of breast parenchymal patterns in whole breast ultrasound images: preliminary experience. Int J Comput Ass Rad. 4(3), 299–306 (2009)

    Google Scholar 

  17. Jesneck, J.L., Lo, J.Y., Baker, J.A.: Breast mass lesions: computer-aided diagnosis models with mammographic and sonographic descriptors 1. Radiology. 244(2), 390–398 (2007)

    Article  Google Scholar 

  18. Lu, X., Li, X., Mou, L.: Semi-supervised multitask learning for scene recognition. IEEE Trans. Cybern. 45(9), 1967–1976 (2015)

    Article  Google Scholar 

  19. Luo, Y., Liu, L., Huang, Q., Li, X.: A novel segmentation approach combining region-and edge-based information for ultrasound images. Biomed. Res. Int. 2017, (2017)

  20. Mendez, A., Cabanillas, F., Echenique, M., Malekshamran, K., Perez, I., Ramos, E.: Mammographic features and correlation with biopsy findings using 11-gauge stereotactic vacuum-assisted breast biopsy (SVABB). Ann. Oncol. 15(3), 450 (2004)

    Article  Google Scholar 

  21. Nothacker, M., Duda, V., Hahn, M., Warm, M., Degenhardt, F., Madjar, H., Weinbrenner, S., Albert, U.S.: Early detection of breast cancer: benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue. A systematic review. BMC Cancer. 9(1), 335 (2009)

    Article  Google Scholar 

  22. Prabusankarlal, K.M., Thirumoorthy, P., Manavalan, R.: Assessment of combined textural and morphological features for diagnosis of breast masses in ultrasound. HCIS. 5(1), 1–17 (2015)

    Google Scholar 

  23. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)

    Google Scholar 

  24. Huang, Q., Huang, X., Liu, L., Lin, Y., Long, X., Li, X.: A Case-oriented web-based training system for breast cancer diagnosis. Comput. Methods Prog. Biomed. 156, 73–83 (2017)

    Article  Google Scholar 

  25. Quinlan, J.R.: C4. 5: programs for machine learning. Morgan Kaufmann Series in Machine Learning, Morgan Kaufman Publishers, San Francisco (1993)

  26. Saunders, J.B., Aasland, O.G., Babor, T.F., De la Fuente, J.R., Grant, M.: Development of the alcohol use disorders identification test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption-II. Addiction. 88(6), 791–804 (1993)

    Article  Google Scholar 

  27. Shan, J., Alam, S.K., Garra, B., Zhang, Y., Ahmed, T.: Computer-aided diagnosis for breast ultrasound using computerized BI-RADS features and machine learning methods. Ultrasound Med. Biol. 42(4), 980–988 (2016)

    Article  Google Scholar 

  28. Shen, W., Chang, R., Moon, W.K., Chou, Y., Huang, C.: Breast ultrasound computer-aided diagnosis using BI-RADS features. Acad. Radiol. 14(8), 928–939 (2007)

    Article  Google Scholar 

  29. Su, Y., Wang, Y.: Computer-aided classification of breast tumors using the affinity propagation clustering 2010 4th International Conference on IEEE in Bioinformatics and Biomedical Engineering, pp. 1–4. IEEE (2010)

  30. Wang, S., Pan, P., Long, G., Chen, W., Li, X., Sheng, Q.Z.: Compact representation for large-scale unconstrained video analysis. World Wide Web Internet Web Inf. Syst. 19(2), 231–246 (2016)

    Article  Google Scholar 

  31. Weiss, S.M., Kapouleas, I.: An empirical comparison of pattern recognition, neural nets, and machine learning classification methods. The 11th Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, pp. 781–787. Morgan Kaufmann, San Mateo, CA (1989)

  32. Ye, R., Li, X.: Collective representation for abnormal event detection. J. Comput. Sci. Technol. 32(3), 470–479 (2017)

    Article  MathSciNet  Google Scholar 

  33. Zhang, H., Gao, X., Wu, P., Xu, X.: A cross-media distance metric learning framework based on multi-view correlation mining and matching. World Wide Web Internet Web Inf. Syst. 19(2), 181–197 (2016)

    Article  Google Scholar 

  34. Zhou, S., Shi, J., Zhu, J., Cai, Y., Wang, R.: Shearlet-based texture feature extraction for classification of breast tumor in ultrasound image. Biomed. Signal Process. Control. 8(6), 688–696 (2013)

    Article  Google Scholar 

  35. Zhu, X., Li, X., Zhang, S., Ju, C., Wu, X.: Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans. Neural Netw. Learn. Syst., PP (99), 1–13 (2016)

Download references


This work was partially supported by National Natural Science Foundation of China (Nos. 61372007 and 61571193), Guangzhou Key Lab of Body Data Science (No.201605030011), Natural Science Foundation of Guangdong Province, China (Nos. 2017A030312006 and 2015A030313210), Fundamental Research Funds for the Central Universities (NO.2015ZM138), Project of Science and Technology Department of Guangdong province (2014A050503020, 2016A010101021, 2016A010101022 and 2016A010101023), Science and Technology Program of Guangzhou (no. 201704020134).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Qinghua Huang.

Additional information

This article belongs to the Topical Collection: Special issue on Deep Mining Big Social Data

Guest Editors: Xiaofeng Zhu, Gerard Sanroma, Jilian Zhang, and Brent C. Munsell

Rights and permissions

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, Q., Zhang, F. & Li, X. A new breast tumor ultrasonography CAD system based on decision tree and BI-RADS features. World Wide Web 21, 1491–1504 (2018).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: