Abstract
Breast cancer has become one of the most common and deadly cancers in the world, and its treatment has been the focus of research. In the search for breast cancer drug candidate compounds, it is important to establish an effective quantitative structure-activity relationship for drug research and development. Neural networks have achieved high accuracy in this field, but with shortcomings of a large number of parameters, high model complexity, and poor interpretability. Therefore, a Serial Fuzzy System built by Subtractive clustering and ANFIS (SFSSA) layer by layer is proposed to explore a solution with better interpretability. Through the experiment in the bioactivity data set of candidate compounds with several models, the following conclusions are found: 1) The precision of SFSSA is better than that of classic linear regression; 2) SFSSA has fewer parameters and rules, and has better interpretability and generalization ability than classic neural network algorithms; 3) SFSSA has less training time and higher prediction accuracy than optimized TSK fuzzy system algorithm MBGD-RDA (Minibatch Gradient Descent with Regularization, DropRule, and AdaBound); 4) SFSSA’s subsystem with 15 inputs achieved best prediction effect. In short, SFSSA provides a new way to apply fuzzy systems for high-dimensional regression problems.
Similar content being viewed by others
References
Bray F, Laversanne M, Weiderpass E, Soerjomataram I (2021) The ever-increasing importance of cancer as a leading cause of premature death worldwide. Cancer 127(16):3029–3030. https://doi.org/10.1002/cncr.33587
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F (2021) Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249. https://doi.org/10.3322/caac.21660
Germain D (2011) Estrogen carcinogenesis in breast cancer. Endocrinol Metab Clin North Am 40(3):473–484. https://doi.org/10.1016/j.ecl.2011.05.009, Hormones and Cancer: Breast and Prostate
Kumar N, Gulati HK, Sharma A, Heer S, Jassal AK, Arora L, Kaur S, Singh A, Bhagat K, Kaur A, Singh H, Singh JV, Bedi PMS (2021) Most recent strategies targeting estrogen receptor alpha for the treatment of breast cancer. Mol Divers 25:603–624. https://doi.org/10.1007/s11030-020-10133-y
Tecalco-Cruz CA, Ramírez-Jarquín OJ, Cruz-Ramos E (2019) Estrogen receptor alpha and its ubiquitination in breast cancer cells. Curr Drug Targets 20(6):690–704. https://doi.org/10.2174/1389450119666181015114041
Achary GRP (2020) Applications of quantitative structure-activity relationships (qsar) based virtual screening in drug design: a review. Mini-Rev Med Chem 20(14):1375–1388. https://doi.org/10.2174/1389557520666200429102334
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90. https://doi.org/10.1145/3065386
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE Conference on computer vision and pattern recognition (CVPR), pp 1–9. https://doi.org/10.1109/CVPR.2015.7298594
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), pp 770–778. https://doi.org/10.1109/CVPR.2016.90
Kuwajima H, Yasuoka H, Nakae T (2020) Engineering problems in machine learning systems. Mach Learn 109:1103–1126. https://doi.org/10.1007/s10994-020-05872-w
Rudin C (2019) Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell 1(5):206–215. https://doi.org/10.1038/s42256-019-0048-x
Arunkumar C, Ramakrishnan S (2019) Prediction of cancer using customised fuzzy rough machine learning approaches. Healthcare Technol Lett 6(1):13–18. https://doi.org/10.1049/htl.2018.5055
Idris NF, Ismail MA (2021) Breast cancer disease classification using fuzzy-id3 algorithm with fuzzydbd method: automatic fuzzy database definition. PeerJ Comput Sci 7:427. https://doi.org/10.7717/peerj-cs.427
Sharma A, Nilam, Singh HP (2022) Computer-controlled diabetes disease diagnosis technique based on fuzzy inference structure for insulin-dependent patients. Applied Intelligence. https://doi.org/10.1007/s10489-022-03416-4
Son LH, Fujita H (2019) Neural-fuzzy with representative sets for prediction of student performance. Appl Intell 49(1):172–187. https://doi.org/10.1007/s10489-018-1262-7
Saxena U (2012) Integrating neuro-fuzzy systems to develop intelligent planning systems for predicting students’ performance. Int J Eval Res Educ (IJERE) 1(2):61–66. https://doi.org/10.11591/ijere.v1i2.738
Zhang Y, Xu Z, Liao H (2017) A consensus process for group decision making with probabilistic linguistic preference relations. Inform Sci 414:260–275. https://doi.org/10.1016/j.ins.2017.06.006
Gou X, Xu Z, Herrera F (2018) Consensus reaching process for large-scale group decision making with double hierarchy hesitant fuzzy linguistic preference relations. Knowl-Based Syst 157:20–33. https://doi.org/10.1016/j.knosys.2018.05.008
Gou X, Xu Z, Liao H, Herrera F (2021) Consensus model handling minority opinions and noncooperative behaviors in large-scale group decision-making under double hierarchy linguistic preference relations. IEEE Trans Cybern 51(1):283–296. https://doi.org/10.1109/TCYB.2020.2985069
Wang L-X, Mendel JM (1992) Generating fuzzy rules by learning from examples. IEEE Trans Syst Man Cybern 22(6):1414–1427. https://doi.org/10.1109/21.199466
Jang J-SR (1993) Anfis: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23(3):665–685. https://doi.org/10.1109/21.256541
Cuevas E, Díaz P, Avalos O, Zaldivar D, Cisneros M (2018) Nonlinear system identification based on anfis-hammerstein model using gravitational search algorithm. Appl Intell 48(1):182–203. https://doi.org/10.1007/s10489-017-0969-1
Wu D, Mendel JM (2020) Patch learning. IEEE Trans Fuzzy Syst 28(9):1996–2008. https://doi.org/10.1109/TFUZZ.2019.2930022
Zhe I, Keikhosrokiani P (2021) Knowledge workers mental workload prediction using optimised elanfis. Appl Intell 51:1–25. https://doi.org/10.1007/s10489-020-01928-5
Wu D, Yuan Y, Huang J, Tan Y (2020) Optimize tsk fuzzy systems for regression problems: Minibatch gradient descent with regularization, droprule, and adabound (mbgd-rda). IEEE Trans Fuzzy Syst 28(5):1003–1015. https://doi.org/10.1109/TFUZZ.2019.2958559
Cui Y, Wu D, Huang J (2020) Optimize tsk fuzzy systems for classification problems: Minibatch gradient descent with uniform regularization and batch normalization. IEEE Trans Fuzzy Syst 28(12):3065–3075. https://doi.org/10.1109/TFUZZ.2020.2967282
Wang Ls-X (2020) Fast training algorithms for deep convolutional fuzzy systems with application to stock index prediction. IEEE Trans Fuzzy Syst 28(7):1301–1314. https://doi.org/10.1109/TFUZZ.2019.2930488
Huang Y, Chen D, Zhao W, Mo H (2021) Deep fuzzy system algorithms based on deep learning and input sharing for regression application. Int J Fuzzy Syst 23:727–742. https://doi.org/10.1007/s40815-020-00998-4
Zhao W, Chen D, Zhuo Y, Huang Y (2020) Deep neural fuzzy system algorithm and its regression application. Zidonghua Xuebao/Acta Automatica Sinica 46:2350–2358. https://doi.org/10.16383/j.aas.c200100
Huang Y, Chen D, Zhao W, Lv Y (2022) Fuzzy c-means clustering based deep patch learning with improved interpretability for classification problems. IEEE Access 10:49873–49891. https://doi.org/10.1109/ACCESS.2022.3171109
Talpur N, Jadid Abdulkadir S, Alhussian H, Hasan MH, Aziz N, Bamhdi A (2022) Deep neuro-fuzzy system application trends, challenges, and future perspectives: a systematic survey. Artif Intell Rev, 1–49. https://doi.org/10.1007/s10462-022-10188-3
Reshef DN, Reshef YA, Finucane HK, Grossman SR, McVean G, Turnbaugh PJ, Lander ES, Mitzenmacher M, Sabeti PC (2011) Detecting novel associations in large data sets. Science 334(6062):1518–1524. https://doi.org/10.1126/science.1205438
Acknowledgments
The main idea of this paper dates back to 10 years ago, when Dewang Chen, the corresponding author of this paper, was visiting University of California at Berkeley as a visiting scholar of Prof. Lotfi Zadeh, father of fuzzy logic and member of Academy of Engineering of USA. In the discussion during the serial seminars named as “Interpretability vs Accuracy” hosted by Prof. Zadeh, the idea of deep fuzzy modeling occurred to Chen’ mind. After long-time thinking, coding and writing, this article was formed. We would like to express our gratitude to the late Prof. Zadeh, who inspired us to pursue interpretable AI, not just high-accuracy AI.
This work is jointly supported by the National Natural Science Foundation of China under Grant 61976055, the special fund for education and scientific research of Fujian Provincial Department of Finance under Grant GY-Z21001, and open project support of State Key Laboratory of Management and Control for Complex Systems under Grant 20210116.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhao, W., Chen, D., Zheng, X. et al. Serial fuzzy system algorithm for predicting biological activity of anti-breast cancer compounds. Appl Intell 53, 13801–13814 (2023). https://doi.org/10.1007/s10489-022-04134-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04134-7