Abstract
In this paper, we present a novel computer-aided diagnostic (CAD) system based on the Breast Imaging Reporting and Data System (BI-RADS) terminology scores of screening ultrasonography (US). The decision tree algorithm is adopted to analyze the BI-RADS information to differentiate between the malignant and benign breast tumors. Although many ultrasonography CAD systems have been developed for decades, there are still some problems in clinical practice. Previous CAD systems are opaque for clinicians and cannot process the ultrasound image from different ultrasound machines. This study proposes a novel CAD system utilizing BI-RADS scoring standard and Classification and Regression Tree (CART) algorithm to overcome the two problems. The original dataset consists of 1300 ultrasound breast images. Three well-experienced clinicians evaluated all of the images according to the BI-RADS feature scoring standard. Subsequently, each image could be transformed into a 25 × 1 vector. The CART algorithm was finally used to classify these vectors. In the experiments, we used the oversampling method to balance the number of malignant samples and benign samples. The 5-fold cross validation was employed to evaluate the performance of the system. The accuracy reached 94.58%, the specificity was 98.84%, the sensitivity was 90.80%, the positive predictive value (PPV) was 98.91% and the negative predictive value (NVP) was 90.56%. The experiment results show that the proposed system can obtain a sufficient performance in the breast diagnosis and can effectively recognize the benign breast tumors in BI-RADS 3.
This is a preview of subscription content, access via your institution.



References
Brinton, J. T., et al. Breast Cancer Facts & Figures 2011–2012. American Cancer Society (2011)
American College of Radiology: ACR BI-RADS–US Lexicon Classification Form. (2013)
Loh, W.Y.: Classification and regression trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(1), 14–23 (2011)
Chen, C., Chou, Y., Han, K., Hung, G., Tiu, C., Chiou, H., Chiou, S.: Breast lesions on sonograms: computer-aided diagnosis with nearly setting-independent features and artificial neural networks 1. Radiology. 226(2), 504–514 (2003)
Committee, A.C.O.R., Radiology, A.C.O.: Breast imaging reporting and data system. American College of Radiology. (1998)
DeSantis, C., Ma, J., Bryan, L., Jemal, A.: Breast cancer statistics, 2013. CA Cancer J. Clin. 64(1), 52–62 (2014)
Drukker, K., Giger, M.L., Horsch, K., Kupinski, M.A., Vyborny, C.J., Mendelson, E.B.: Computerized lesion detection on breast ultrasound. Med. Phys. 29(7), 1438–1446 (2002)
Estabrooks, A., Jo, T., Japkowicz, N.: A multiple resampling method for learning from imbalanced data sets. Comput Intell-US. 20(1), 18–36 (2004)
Feng, X., Guo, X., Huang, Q.: Systematic evaluation on speckle suppression methods in examination of ultrasound breast images. Appl. Sci. 7(1), 37 (2016)
Hamy, A.S., Giacchetti, S., Albiter, M., Bazelaire, C.D., Cuvier, C., Perret, F., Bonfils, S., Charvériat, P., Hocini, H., Roquancourt, A.D.: BI-RADS categorisation of 2708 consecutive nonpalpable breast lesions in patients referred to a dedicated breast care unit. Eur. Radiol. 22(1), 9–17 (2012)
He, W., Zhu, X., Cheng, D., Hu, R., Zhang, S.: Unsupervised feature selection for visual classification via feature-representation property. Neuro Comput. 236(C), 5–13 (2017)
Heinig, J., Witteler, R., Schmitz, R., Kiesel, L., Steinhard, J.: Accuracy of classification of breast ultrasound findings based on criteria used for BI-RADS. Breast Diseases A Year Book Quarterly, 32(4), 573–578 (2008)
Huang, Y.L., Kuo, S.J., Chang, C.S., Liu, Y.K., Moon, W.K., Chen, D.R.: Image retrieval with principal component analysis for breast cancer diagnosis on various ultrasonic systems. Ultrasound Obst Gyn. 26(5), 558–566 (2005)
Huang, Q., Yang, F., Liu, L., Li, X.: Automatic segmentation of breast lesions for interaction in ultrasonic computer-aided diagnosis. Inf. Sci. 314, 293–310 (2015)
Huang, Q., Luo, Y., Zhang, Q.: Breast ultrasound image segmentation: a survey. Int. J. Comput. Assist. Radiol. Surg. 12(3), 493–507 (2017)
Ikedo, Y., Morita, T., Fukuoka, D., Hara, T., Lee, G., Fujita, H., Takada, E., Endo, T.: Automated analysis of breast parenchymal patterns in whole breast ultrasound images: preliminary experience. Int J Comput Ass Rad. 4(3), 299–306 (2009)
Jesneck, J.L., Lo, J.Y., Baker, J.A.: Breast mass lesions: computer-aided diagnosis models with mammographic and sonographic descriptors 1. Radiology. 244(2), 390–398 (2007)
Lu, X., Li, X., Mou, L.: Semi-supervised multitask learning for scene recognition. IEEE Trans. Cybern. 45(9), 1967–1976 (2015)
Luo, Y., Liu, L., Huang, Q., Li, X.: A novel segmentation approach combining region-and edge-based information for ultrasound images. Biomed. Res. Int. 2017, (2017)
Mendez, A., Cabanillas, F., Echenique, M., Malekshamran, K., Perez, I., Ramos, E.: Mammographic features and correlation with biopsy findings using 11-gauge stereotactic vacuum-assisted breast biopsy (SVABB). Ann. Oncol. 15(3), 450 (2004)
Nothacker, M., Duda, V., Hahn, M., Warm, M., Degenhardt, F., Madjar, H., Weinbrenner, S., Albert, U.S.: Early detection of breast cancer: benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue. A systematic review. BMC Cancer. 9(1), 335 (2009)
Prabusankarlal, K.M., Thirumoorthy, P., Manavalan, R.: Assessment of combined textural and morphological features for diagnosis of breast masses in ultrasound. HCIS. 5(1), 1–17 (2015)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
Huang, Q., Huang, X., Liu, L., Lin, Y., Long, X., Li, X.: A Case-oriented web-based training system for breast cancer diagnosis. Comput. Methods Prog. Biomed. 156, 73–83 (2017)
Quinlan, J.R.: C4. 5: programs for machine learning. Morgan Kaufmann Series in Machine Learning, Morgan Kaufman Publishers, San Francisco (1993)
Saunders, J.B., Aasland, O.G., Babor, T.F., De la Fuente, J.R., Grant, M.: Development of the alcohol use disorders identification test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption-II. Addiction. 88(6), 791–804 (1993)
Shan, J., Alam, S.K., Garra, B., Zhang, Y., Ahmed, T.: Computer-aided diagnosis for breast ultrasound using computerized BI-RADS features and machine learning methods. Ultrasound Med. Biol. 42(4), 980–988 (2016)
Shen, W., Chang, R., Moon, W.K., Chou, Y., Huang, C.: Breast ultrasound computer-aided diagnosis using BI-RADS features. Acad. Radiol. 14(8), 928–939 (2007)
Su, Y., Wang, Y.: Computer-aided classification of breast tumors using the affinity propagation clustering 2010 4th International Conference on IEEE in Bioinformatics and Biomedical Engineering, pp. 1–4. IEEE (2010)
Wang, S., Pan, P., Long, G., Chen, W., Li, X., Sheng, Q.Z.: Compact representation for large-scale unconstrained video analysis. World Wide Web Internet Web Inf. Syst. 19(2), 231–246 (2016)
Weiss, S.M., Kapouleas, I.: An empirical comparison of pattern recognition, neural nets, and machine learning classification methods. The 11th Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, pp. 781–787. Morgan Kaufmann, San Mateo, CA (1989)
Ye, R., Li, X.: Collective representation for abnormal event detection. J. Comput. Sci. Technol. 32(3), 470–479 (2017)
Zhang, H., Gao, X., Wu, P., Xu, X.: A cross-media distance metric learning framework based on multi-view correlation mining and matching. World Wide Web Internet Web Inf. Syst. 19(2), 181–197 (2016)
Zhou, S., Shi, J., Zhu, J., Cai, Y., Wang, R.: Shearlet-based texture feature extraction for classification of breast tumor in ultrasound image. Biomed. Signal Process. Control. 8(6), 688–696 (2013)
Zhu, X., Li, X., Zhang, S., Ju, C., Wu, X.: Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans. Neural Netw. Learn. Syst., PP (99), 1–13 (2016)
Acknowledgements
This work was partially supported by National Natural Science Foundation of China (Nos. 61372007 and 61571193), Guangzhou Key Lab of Body Data Science (No.201605030011), Natural Science Foundation of Guangdong Province, China (Nos. 2017A030312006 and 2015A030313210), Fundamental Research Funds for the Central Universities (NO.2015ZM138), Project of Science and Technology Department of Guangdong province (2014A050503020, 2016A010101021, 2016A010101022 and 2016A010101023), Science and Technology Program of Guangzhou (no. 201704020134).
Author information
Authors and Affiliations
Corresponding author
Additional information
This article belongs to the Topical Collection: Special issue on Deep Mining Big Social Data
Guest Editors: Xiaofeng Zhu, Gerard Sanroma, Jilian Zhang, and Brent C. Munsell
Rights and permissions
About this article
Cite this article
Huang, Q., Zhang, F. & Li, X. A new breast tumor ultrasonography CAD system based on decision tree and BI-RADS features. World Wide Web 21, 1491–1504 (2018). https://doi.org/10.1007/s11280-017-0522-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-017-0522-5