A Fuzzy Twin Support Vector Machine Based on Dissimilarity Measure and Its Biomedical Applications

Qiu, Jianxiang; Xie, Jialiang; Zhang, Dongxiao; Zhang, Ruping; Lin, Mingwei

doi:10.1007/s40815-024-01725-z

A Fuzzy Twin Support Vector Machine Based on Dissimilarity Measure and Its Biomedical Applications

Published: 17 May 2024

(2024)
Cite this article

International Journal of Fuzzy Systems Aims and scope Submit manuscript

Jianxiang Qiu¹,
Jialiang Xie¹,
Dongxiao Zhang¹,
Ruping Zhang² &
…
Mingwei Lin³

70 Accesses
Explore all metrics

Abstract

Biomedical data exhibit high-dimensional complexity in its internal structure and are susceptible to noise interference, making classification tasks in biomedical data highly challenging. Twin support vector machine (TSVM) is a machine learning algorithm that can effectively solve pattern recognition problems. To mitigate the negative impact of noise, researchers have combined fuzzy set theory with TSVM and use fuzzy membership to describe the influence of different samples on constructing the optimal hyperplane, thus, extending TSVM to fuzzy twin support vector machines (FTSVM). In this paper, the dissimilarity measure based on data distribution is innovatively introduced into the fuzzy membership assignment process, and a novel fuzzy membership assignment strategy is designed to effectively reduce the negative impact of noise in biomedical data. Rather than rely on geometric distance, this strategy takes data distribution as the primary factor in measuring dissimilarity between samples and then constructs a heuristic function to assign fuzzy membership to different samples. Combining this strategy with TSVM, this paper proposed a fuzzy twin support vector machine based on dissimilarity measure (DFTSVM), which could effectively solve the classification problem with noise and shows excellent generalization performance in biomedical data. Moreover, DFTSVM employs a coordinate descent strategy with shrinking by active set to reduce computational complexity, which significantly improves the training speed of the model. Experiments are conducted on 14 biomedical datasets to compare the performance of DFTSVM with 10 heterogeneous machine learning classification algorithms and four homology algorithms. The results demonstrate that DFTSVM outperforms other algorithms in terms of classification performance on biomedical data. It exhibits excellent generalization performance in noisy environments, and its advantages in terms of generalization performance and noise robustness become more prominent as the noise rate increases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data-driven mechanism based on fuzzy Lagrangian twin parametric-margin support vector machine for biomedical data analysis

Article 16 March 2021

Entropy-Based Fuzzy Least Squares Twin Support Vector Machine for Pattern Classification

Article 11 July 2019

A new fuzzy twin support vector machine for pattern classification

Article 06 April 2017

Data Availability

This paper uses the UCI Machine Learning Repository, which is publicly available on the Internet. As follows: https://archive.ics.uci.edu/.

References

Anagaw, A., Chang, Y.L.: A new complement Naïve Bayesian approach for biomedical data classification. J. Ambient. Intell. Humaniz. Comput. 10(10), 3889–3897 (2019)
Article Google Scholar
Aryal, S., Ting, K.M., Haffari, G., Washio, T.: MP-dissimilarity: a data dependent dissimilarity measure. In: 2014 IEEE International Conference on Data Mining, IEEE. pp. 707–712 (2014)
Aryal, S., Ting, K.M., Washio, T., Haffari, G.: Data-dependent dissimilarity measure: an effective alternative to geometric distance measures. Knowl. Inf. Syst. 53(2), 479–506 (2017)
Article Google Scholar
Asuncion, A., Newman, D.: UCI machine learning repository. (2007)
Bai, J., Li, Y., Li, J., Yang, X., Jiang, Y., Xia, S.T.: Multinomial random forest. Pattern Recogn. 122, 108331 (2022)
Article Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article Google Scholar
Chen, T.Q., He, T.: Xgboost: extreme gradient boosting. R package version 04-2. 1(4), 1–4 (2015)
Das, H., Naik, B., Behera, H.S., Jaiswal, S., Mahato, P., Rout, M.: Biomedical data analysis using neuro-fuzzy model with post-feature reduction. J. King Saud Univ.-Comput. Inf. Sci. 34(6), 2540–2550 (2022)
Google Scholar
Ding, S., Xu, X., Wang, Y.: Optimized density peaks clustering algorithm based on dissimilarity measure. J. Softw. 31(11), 3321–3333 (2020)
Google Scholar
Ganaie, M.A., Tanveer, M.: Alzheimer’s disease neuroimaging initiative: fuzzy least squares projection twin support vector machines for class imbalance learning. Appl. Soft Comput. 113, 107933 (2021)
Article Google Scholar
Ganaie, M.A., Kumari, A., Malik, A.K., Tanveer, M.: EEG signal classification using improved intuitionistic fuzzy twin support vector machines. Neural Comput. Appl. 36(1), 1–17 (2022)
Google Scholar
Ganaie, M., Tanveer, M., Lin, C.T.: Large-scale fuzzy least squares twin SVMS for class imbalance learning. IEEE Trans. Fuzzy Syst. 30(11), 4815–4827 (2022)
Article Google Scholar
Ganaie, M.A., Kumari, A., Girard, A., Kasa-Vubu, J., Tanveer, M.: Alzheimer’s disease neuroimaging initiative: diagnosis of Alzheimer’s disease via intuitionistic fuzzy least squares twin SVM. Appl. Soft Comput. 149, 110899 (2023)
Article Google Scholar
Gao, B.B., Wang, J.J., Wang, Y., Yang, C.Y.: Coordinate descent fuzzy twin support vector machine for classification. In: 2015 IEEE 14th international conference on machine learning and applications (ICMLA), IEEE. pp. 7–12 (2015)
Gautam, C., Mishra, P.K., Tiwari, A., Richhariya, B., Pandey, H.M., Wang, S.H., Tanveer, M.: Alzheimer’s disease neuroimaging initiative: minimum variance-embedded deep kernel regularized least squares method for one-class classification and its applications to biomedical data. Neural Netw. 123, 191–216 (2020)
Article Google Scholar
Gupta, D., Richhariya, B., Borah, P.: A fuzzy twin support vector machine based on information entropy for class imbalance learning. Neural Comput. Appl. 31(11), 7153–7164 (2019)
Article Google Scholar
Gupta, D., Borah, P., Sharma, U.M., Prasad, M.: Data-driven mechanism based on fuzzy Lagrangian twin parametric-margin support vector machine for biomedical data analysis. Neural Comput. Appl. 34(14), 11335–11345 (2022)
Article Google Scholar
Gupta, U., Gupta, D.: Bipolar fuzzy based least squares twin bounded support vector machine. Fuzzy Sets Syst. 449, 120–161 (2022)
Article MathSciNet Google Scholar
Hazarika, B.B., Gupta, D.: Density-weighted support vector machines for binary class imbalance learning. Neural Comput. Appl. 33(9), 4243–4261 (2021)
Article Google Scholar
Hosmer, D.W., Lemeshow, S., Sturdivant, R.X.: Applied Logistic Regression, p. 398. Wiley, Hoboken (2013)
Book Google Scholar
Ju, H., Qiang, W., Jing, L.: A novel interval-valued fuzzy multiple twin support vector machine. Iran. J. Fuzzy Syst. 18(2), 93–107 (2021)
MathSciNet Google Scholar
Ke, G.L., Finley, T., Wang, T.F., Chen, W., Ma, W.D., Ye, Q.W., Liu, T.Y.: Lightgbm: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 30, 1–9 (2017)
Google Scholar
Khemchandani, R., Chandra, S.: Twin support vector machines for pattern classification. IEEE Trans. Pattern Anal. Mach. Intell. 29(5), 905–910 (2007)
Article Google Scholar
Krumhansl, C.L.: Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol. Rev. 85(5), 445–463 (1978)
Article Google Scholar
Liang, Z.Z., Lei, Z.: Intuitionistic fuzzy twin support vector machines with the insensitive pinball loss. Appl. Soft Comput. 115, 108231 (2022)
Article Google Scholar
Liu, M.Z., Zhou, J., Xi, Q., Liang, Y.C., Li, H.C., Liang, P.F., Guo, Y.T., Liu, M., Temuqile, T., Yang, L., Zou, Y.C.: A computational framework of routine test data for the cost-effective chronic disease prediction. Brief. Bioinf. 24(2), bbad054 (2023)
Article Google Scholar
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., Gulin, A.: CatBoost: unbiased boosting with categorical features. Adv. Neural Inf. Process. Syst. 31, 1–11 (2018)
Google Scholar
Qiu, J.X., Xie, J.L., Zhang, D.X., Zhang, R.P.: A robust twin support vector machine based on fuzzy systems. Int. J. Intell. Comput. Cybern. 17(1), 101–25 (2023)
Article Google Scholar
Rasool, Z., Aryal, S., Bouadjenek, M.R., Dazeley, R.: Overcoming weaknesses of density peak clustering using a data-dependent similarity measure. Pattern Recogn. 137, 109287 (2023)
Article Google Scholar
Ren, J., Wang, Y., Cheung, Y.M., Gao, X.Z., Guo, X.: Grouping-based oversampling in kernel space for imbalanced data classification. Pattern Recogn. 133, 108992 (2023)
Article Google Scholar
Rezvani, S., Wang, X., Pourpanah, F.: Intuitionistic fuzzy twin support vector machines. IEEE Trans. Fuzzy Syst. 27(11), 2140–2151 (2019)
Article Google Scholar
Richhariya, B., Tanveer, M.: EEG signal classification using universum support vector machine. Expert Syst. Appl. 106, 169–182 (2018)
Article Google Scholar
Richhariya, B., Tanveer, M.: Alzheimer’s disease neuroimaging initiative: an efficient angle-based universum least squares twin support vector machine for classification. ACM Trans. Internet Technol. (TOIT) 21(3), 1–24 (2021)
Article Google Scholar
Richhariya, B., Tanveer, M.: Alzheimer’s disease neuroimaging initiative: a fuzzy universum least squares twin support vector machine (FULSTSVM). Neural Comput. Appl. 34(14), 11411–11422 (2022)
Article Google Scholar
Tanveer, M., Ganaie, M.A., Bhattacharjee, A., Lin, C.T.: Intuitionistic fuzzy weighted least squares twin SVMs. IEEE Trans. Cybern. 53(7), 4400–4409 (2023)
Article Google Scholar
Ting, K.M., Zhu, Y., Carman, M., Zhu, Y., Zhou, Z.H.: Overcoming key weaknesses of distance-based neighbourhood methods using a data dependent dissimilarity measure. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. pp. 1205–1214 (2016)
Wang, H., Gupta, G.: Fold-r++: a scalable toolset for automated inductive learning of default theories from mixed data. In: International Symposium on Functional and Logic Programming, Springer. pp. 224–242 (2022)
Wang, H., Shakerin, F., Gupta, G.: Fold-rm: a scalable, efficient, and explainable inductive learning algorithm for multi-category classification of mixed data. Theory Pract. Logic Program. 22(5), 658–677 (2022)
Article MathSciNet Google Scholar
Xu, Y., Yang, Z., Pan, X.: A novel twin support-vector machine with pinball loss. IEEE Trans. Neural Netw. Learn. Syst. 28(2), 359–370 (2016)
Article MathSciNet Google Scholar
Zhang, L., Yang, H., Jiang, Z.: Imbalanced biomedical data classification using self-adaptive multilayer elm combined with dynamic GAN. Biomed. Eng. Online 17(1), 1–21 (2018)
Article Google Scholar
Zou, Y., Ding, Y., Peng, L., Zou, Q.: FTWSVM-SR: DNA-binding proteins identification via fuzzy twin support vector machines on self-representation. Interdiscip. Sci. 14(2), 372–384 (2021)
Article Google Scholar

Download references

Acknowledgements

This paper would like to thank the editors and the anonymous referees for their professional comments, which improved the quality of the manuscript. This work was supported in part by the National Natural Science Foundation of China (Grant Nos. 12271211, 12071179), the National Natural Science Foundation of Fujian Province (Grant Nos. 2021J01861, 2020J01710), the Youth Innovation Fund of Xiamen City (Grant No. 3502Z20206020), and the Open Fund of Digital Fujian Big Data Modeling and Intelligent Computing Institute, Pre-Research Fund of Jimei University.

Author information

Authors and Affiliations

School of Science, Jimei University, Xiamen, 361021, China
Jianxiang Qiu, Jialiang Xie & Dongxiao Zhang
School of Mathematics and Physics, Xiamen University Malaysia, Selangor, 43900, Malaysia
Ruping Zhang
College of Computer and Cyber Security, Fujian Normal University, Fuzhou, 350117, China
Mingwei Lin

Authors

Jianxiang Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Jialiang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Dongxiao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ruping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mingwei Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jialiang Xie or Mingwei Lin.

Ethics declarations

Conflict of interest

All authors declare that they have no Conflict of interest.

Informed Consent

Informed consent was obtained from all individual participants included in the study.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Qiu, J., Xie, J., Zhang, D. et al. A Fuzzy Twin Support Vector Machine Based on Dissimilarity Measure and Its Biomedical Applications. Int. J. Fuzzy Syst. (2024). https://doi.org/10.1007/s40815-024-01725-z

Download citation

Received: 14 November 2023
Revised: 17 February 2024
Accepted: 01 March 2024
Published: 17 May 2024
DOI: https://doi.org/10.1007/s40815-024-01725-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Fuzzy Twin Support Vector Machine Based on Dissimilarity Measure and Its Biomedical Applications

Abstract

Access this article

Similar content being viewed by others

Data-driven mechanism based on fuzzy Lagrangian twin parametric-margin support vector machine for biomedical data analysis

Entropy-Based Fuzzy Least Squares Twin Support Vector Machine for Pattern Classification

A new fuzzy twin support vector machine for pattern classification

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Informed Consent

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Fuzzy Twin Support Vector Machine Based on Dissimilarity Measure and Its Biomedical Applications

Abstract

Access this article

Similar content being viewed by others

Data-driven mechanism based on fuzzy Lagrangian twin parametric-margin support vector machine for biomedical data analysis

Entropy-Based Fuzzy Least Squares Twin Support Vector Machine for Pattern Classification

A new fuzzy twin support vector machine for pattern classification

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Informed Consent

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation