In order to improve the performance of semi-supervised learning, a safe semi-supervised classification algorithm using multiple classifiers ensemble (S3C-MC) is proposed. First, unlabeled samples are filtered and unlabeled samples with small ambiguity are selected for semi-supervised learning. Next, the labeled training set is sampled to multiple subsets and they generate multiple classifiers to predict the filtered unlabeled sample respectively. The predicted label is formed by multiple classifiers with weighted voting mechanism, and the weight of classifier is changing constantly according to the correctness of the prediction for unlabeled samples by classifier. Then, security verification is carried out to ensure that the classifier evolves in the direction of error reduction when the new sample is added. Only the label making classifiers error lower and having the same predictive value with the three classifiers in security verification is added into the labeled set to expand the number of labeled sets. Finally, the algorithm iterates until the unlabeled sample set is empty. The experiment is carried out on the UCI data set and the result shows that the proposed S3C-MC has good safety and has a higher classification rate.
This is a preview of subscription content, log in to check access.
This work was supported by Shangluo Universities Key Disciplines Project, Discipline name: Mathematics; Natural Science Basic Research Plan in Shaanxi Province of China (No.2015JM6347); Science Research Plan of Shangluo University (No.14SKY026); Horizontal Project of Shangluo University (No.2018HXKY056, 19HKY082).
Li M, Li H, Zhou ZH (2009) Semi-supervised document retrieval. Inf Process Manage 45(3):341–355CrossRefGoogle Scholar
Silva NFFD, Coletta LFS, Hruschka ER (2016) A Survey and comparative study of tweet sentiment analysis via semi-supervised learning. ACM Comput Surv 49(1):1–26Google Scholar
Camps-Valls G, Munoz-Mari J, Gomez-Chova L et al (2009) Biophysical parameter estimation with a semisupervised support vector machine. IEEE Geosci Remote Sens Lett 6(2):248–252CrossRefGoogle Scholar
Dornaika F, El Traboulsi Y, Dornaika F, El TY (2015) Learning flexible graph-based semi-supervised embedding. IEEE Trans Cybern 46(1):206–218CrossRefGoogle Scholar
Peng Y, Zhai X, Zhao Y et al (2016) Semi-supervised cross-media feature learning with unified patch graph regularization. IEEE Trans Circuits Syst Video Technol 26(3):583–596CrossRefGoogle Scholar
Abdelgayed TS, Morsi WG, Sidhu TS (2018) Fault detection and classification based on co-training of semi-supervised machine learning. IEEE Trans Ind Electron 65(2):1595–1605CrossRefGoogle Scholar
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the 11th annual conference on computational learning theory (COLT’98), pp 92–100. ACM, WisconsinGoogle Scholar
Yu ZW, Zhang YD, You JN et al (2019) Adaptive semi-supervised classifier ensemble for high dimensional data classification. IEEE Trans Cybern 49(2):366–379CrossRefGoogle Scholar
Keyvanpour MR, Imani MB (2013) Semi-supervised text categorization: exploiting unlabeled data using ensemble learning algorithms. Intell Data Anal 17(3):367–385CrossRefGoogle Scholar
Yu GX, Zhang GJ, Yu ZW et al (2012) Semi-supervised ensemble classification in subspaces. Appl Soft Comput 12(5):1511–1522CrossRefGoogle Scholar
Zhou ZH, Li M (2005) Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans Knowl Data Eng 17(11):1529–1541CrossRefGoogle Scholar
Li Y, Guo M (2012) A new relational Tri-training system with adaptive data editing for inductive logic programming. Knowl-Based Syst 35:73–185Google Scholar
Li YF, Liang DM (2019) Safe semi-supervised learning: a brief introduction. Front Comput Sci 4:669–676CrossRefGoogle Scholar
Li YF, Zhou ZH (2015) Towards making unlabeled data never hurt. IEEE Trans Pattern Anal Mach Intell 37(1):175–188CrossRefGoogle Scholar
Li YF, Zhou ZH (2011) Improving semi-supervised support vector machines through unlabeled instances selection. In: Proceedings of the 25th AAAI conference on artificial intelligence, pp 386–391Google Scholar
Sang N, Gan H, Fan Y et al (2019) Adaptive safety degree-based safe semi-supervised learning. Int J Mach Learn Cybernet 10:1101–1108CrossRefGoogle Scholar
Goldman S, Zhou Y (2000) Enhancing supervised learning with unlabeled data. In: Proceedings of the 17th international conference on machine learning, pp 327–334Google Scholar
Soonthornphisaj N, Kijsirikul B (2004) Interative cross-training: an algorithm for learning from unlabeled Web pages. Int J Intell Syst 19(1–2):131–147CrossRefGoogle Scholar
Mallapragada PK, Jin R, Jain AK et al (2009) SemiBoost: boosting for semi-supervised learning. IEEE Trans Pattern Anal Mach Intell 31(11):2000–2014CrossRefGoogle Scholar
Peng J, Aved AJ, Seetharaman G et al (2018) Multiview boosting with information propagation for classification. IEEE Trans Neural Netw Learn Syst 29(3):657–669MathSciNetCrossRefGoogle Scholar