Abstract
Graph-based semi-supervised learning (GSSL) has received more and more attention due to its efficiency and accuracy. Label propagation is a critical step in GSSL that propagates label information to unlabeled data through the structure of graph. However, the traditional label propagation algorithms treat all unlabeled samples as equivalent and blindly propagate label information to all neighbors without considering their reliabilities. In this case, some unreliable samples may mislead the process of label propagation, thus greatly reducing the accuracy of classification. In order to solve this problem, this paper proposes a novel label propagation algorithm called node influence-based label propagation (NILP). Based on the structure of graph, the NILP algorithm measures the influences of nodes by calculating their degrees and local densities. In the process of label propagation, the label information is preferentially transmitted to the influential neighbors to control the propagation sequence and prevent wrong propagation. Moreover, our algorithm improves the transition matrix by integrating label information and feature information. The experimental results on both synthetic and real-world benchmark datasets show that the proposed method is superior to some existing label propagation algorithms. Especially when the number of labeled samples is very small, the advantage of NILP algorithm is more obvious.
Similar content being viewed by others
References
Appice A, Guccione P, Malerba D (2017) A novel spectral-spatial co-training algorithm for the transductive classification of hyperspectral imagery data. Pattern Recognition 63:229–245
Bahrami S, Bosaghzadeh A, Dornaika F (2018) Graph fusion with correlation graph in semisupervised learning. In: 2018 4th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS), pp. 125–130. IEEE
Berton L, de Andrade Lopes A (2014) Graph construction based on labeled instances for semi-supervised learning. In: 2014 22nd International Conference on Pattern Recognition, pp. 2477–2482. IEEE
Bhatia V, Rani R (2017) An efficient influence based label propagation algorithm for clustering large graphs. In: 2017 International Conference on Infocom Technologies and Unmanned Systems (Trends and Future Directions)(ICTUS), pp. 1–7. IEEE
Dong W, Moses C, Li K (2011) Efficient k-nearest neighbor graph construction for generic similarity measures. In: Proceedings of the 20th international conference on World wide web, pp. 577–586
Dornaika F, Dahbi R, Bosaghzadeh A, Ruichek Y (2017) Efficient dynamic graph construction for inductive semi-supervised learning. Neural Networks 94:192–203
Dornaika F, El Traboulsi Y (2019) Joint sparse graph and flexible embedding for graph-based semi-supervised learning. Neural Networks 114:91–95
Druck G, McCallum A (2010) High-performance semi-supervised learning using discriminatively constrained generative models. In: ICML
Du B, Xinyao T, Wang Z, Zhang L, Tao D (2018) Robust graph-based semisupervised learning for noisy labeled data via maximum correntropy criterion. IEEE transactions on cybernetics 49(4):1440–1453
El Kouni IB, Karoui W, Romdhane LB (2019) Node importance based label propagation algorithm for overlapping community detection in networks. Expert Systems with Applications p. 113020
Gong C, Tao D, Liu W, Liu L, Yang J (2017) Label propagation via teaching-to-learn and learning-to-teach. IEEE Transactions on Neural Networks and Learning Systems 28(6):1452–1465. https://doi.org/10.1109/TNNLS.2016.2514360
Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. In: Advances in neural information processing systems, pp. 3581–3589
Li M, Zhou ZH (2005) Setred: Self-training with editing. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining
Li Y, Guo M (2012) A new relational tri-training system with adaptive data editing for inductive logic programming. Knowledge-Based Systems 35:173–185
Liu C, Hsaio W, Lee C, Chang T, Kuo T (2016) Semi-supervised text classification with universum learning. IEEE Transactions on Cybernetics 46(2):462–473. https://doi.org/10.1109/TCYB.2015.2403573
Liu W, Wang J, Chang S (2012) Robust and scalable graph-based semisupervised learning. Proceedings of the IEEE 100(9):2624–2638. https://doi.org/10.1109/JPROC.2012.2197809
Lu Z, Wang L (2015) Noise-robust semi-supervised learning via fast sparse coding. Pattern Recognition 48(2):605–612. 10.1016/j.patcog.2014.08.019. http://www.sciencedirect.com/science/article/pii/S0031320314003331
Ma L, Ma A, Ju C, Li X (2016) Graph-based semi-supervised learning for spectral-spatial hyperspectral image classification. Pattern Recognition Letters 83:133–142. 10.1016/j.patrec.2016.01.022. http://www.sciencedirect.com/science/article/pii/S0167865516000349. Advances in Pattern Recognition in Remote Sensing
Nie F, Xiang S, Liu Y, Zhang C (2010) A general graph-based semi-supervised learning with novel class discovery. Neural Computing and Applications 19(4):549–555
Nie F, Xu D, Tsang IWH, Zhang C (2010) Flexible manifold embedding: A framework for semi-supervised and unsupervised dimension reduction. IEEE Transactions on Image Processing 19(7):1921–1932
Nigam K, McCallum AK, Thrun S, Mitchell T (2000) Text classification from labeled and unlabeled documents using em. Machine learning 39(2–3):103–134
Prakash VJ, Nithya LM (2014) A survey on semi-supervised learning techniques. International Journal of Computer Trends and Technology 8(1):25–29
Saul LK, Roweis ST (2003) Think globally, fit locally: unsupervised learning of low dimensional manifolds. Journal of Machine Learning Research 4:119–155
Seyedi SA, Lotfi A, Moradi P, Qader NN (2019) Dynamic graph-based label propagation for density peaks clustering. Expert Systems with Applications 115:314–328
de Sousa CA (2015) An overview on the gaussian fields and harmonic functions method for semi-supervised learning. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE
Triguero I, Sáez JA, Luengo J, García S, Herrera F (2014) On the characterization of noise filters for self-training semi-supervised in nearest neighbor classification. Neurocomputing 132:30–41
Wang B, Tsotsos J (2016) Dynamic label propagation for semi-supervised multi-class multi-label classification. Pattern Recognition 52:75–84. 10.1016/j.patcog.2015.10.006. http://www.sciencedirect.com/science/article/pii/S0031320315003738
Wang F, Zhang C (2008) Label propagation through linear neighborhoods. IEEE Transactions on Knowledge and Data Engineering 20(1):55–67. https://doi.org/10.1109/TKDE.2007.190672
Wang M, Fu W, Hao S, Tao D, Wu X (2016) Scalable semi-supervised learning by efficient anchor graph regularization. IEEE Transactions on Knowledge and Data Engineering 28(7):1864–1877. https://doi.org/10.1109/TKDE.2016.2535367
Wang T, Ji Z, Sun Q, Chen Q, Yu S, Fan W, Yuan S, Liu Q (2016) Label propagation and higher-order constraint-based segmentation of fluid-associated regions in retinal sd-oct images. Information Sciences 358–359:92–111. https://doi.org/10.1016/j.ins.2016.04.017. http://www.sciencedirect.com/science/article/pii/S0020025516302523
Wu D, Shang M, Wang G, Li L (2018) A self-training semi-supervised classification algorithm based on density peaks of data and differential evolution. In: 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), pp. 1–6. IEEE
Yu J, Kim SB (2018) Consensus rate-based label propagation for semi-supervised classification. Information Sciences 465:265–284
Zhang H, Zhang Z, Li S, Ye Q, Zhao M, Wang M (2018) Robust adaptive label propagation by double matrix decomposition. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 2160–2165
Zhang Z, Li F, Jia L, Qin J, Zhang L, Yan S (2017) Robust adaptive embedded label propagation with weight learning for inductive classification. IEEE transactions on neural networks and learning systems 29(8):3388–3403
Zhang Z, Zhao M, Chow TW (2014) Graph based constrained semi-supervised learning framework via label propagation over adaptive neighborhood. IEEE Transactions on Knowledge and Data Engineering
Zhou D, Bousquet O, Lal TN, Weston J, Olkopf BS (2004) Learning with local and global consistency. Advances in neural information processing systems 16(3):
Zhou ZH, Li M (2010) Semi-supervised learning by disagreement. Knowledge and Information Systems 24(3):415–439
Zhu X (2005) Semi-supervised learning literature survey
Zhu X, Ghahramani Z (2002) Learning from labeled and unlabeled data with label propagation
Zhu X, Ghahramani Z, Lafferty JD (2003) Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of the 20th International conference on Machine learning (ICML-03), pp. 912–919
Zhu X, Lafferty J, Rosenfeld R (2005) Semi-supervised learning with graphs
Zhuang L, Zhou Z, Gao S, Yin J, Lin Z, Ma Y (2017) Label information guided graph construction for semi-supervised learning. IEEE Transactions on Image Processing 26(9):4182–4192
Acknowledgements
This work was supported by National Natural Science Foundation of China grant 61573266 and the University Natural Science Research Key Projects of Anhui Province(KJ2019A0816).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hua, Z., Yang, Y. & Qiu, H. Node influence-based label propagation algorithm for semi-supervised learning. Neural Comput & Applic 33, 2753–2768 (2021). https://doi.org/10.1007/s00521-020-05078-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-05078-0