Abstract
The rapid growth of HCI applications results in increased data size and complexity. For this, advanced machine learning techniques and data analysis solutions are used to prepare and process data patterns. However, the cost of data pre-processing, labelling, and classification can be significantly increased if the dataset is huge, complex, and unlabelled. This paper aims to propose a data pre-processing approach and semi-supervised learning technique to prepare and classify a big Motion Capture Hand Postures dataset. It builds the solutions via Tri-training and Co-forest techniques and compares them to figure out the best-fitted approach for hand posture classification. According to the results, Co-forest outperforms Tri-training in terms of Accuracy, Precision, recall, and F1-score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhou, Z.H., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans. Knowl. Data Eng. 17(11), 1529–1541 (2005)
Triguero, I., García, S., Herrera, F.: Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl. Inf. Syst. 42(2), 245–284 (2015)
Zhou, L., Pan, S., Wang, J., Vasilakos, A.V.: Machine learning on big data: opportunities and challenges. Neurocomputing 237, 350–361 (2017)
Gardner, A., Duncan, C. A., Kanno, J., Selmic, R.: 3D hand posture recognition from small unlabeled point sets. In: 2014 IEEE International Conference on Systems, Man and Cybernetics (SMC), pp. 164–169 (2014)
Elgendy, N., Elragal, A.: Big data analytics: a literature review paper. In: Perner, P. (ed.) ICDM 2014. LNCS (LNAI), vol. 8557, pp. 214–227. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08976-8_16
L’heureux, A., Grolinger, K., Elyamany, H.F., Capretz, M.A.: Machine learning with big data: challenges and approaches. IEEE Access 5, 7776–7797 (2017)
Chawla, N.V., Karakoulas, G.: Learning from labeled and unlabeled data: an empirical study across techniques and domains. J. Artif. Intell. Res. 23, 331–366 (2005)
Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)
Chen, K., Wang, S.: Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 129–143 (2010)
Reddy, Y.C.A.P., Viswanath, P., Reddy, B.E.: Semi-supervised learning: a brief review. Int. J. Eng. Technol. 7(1.8), 81 (2018)
Sawant, S.S., Prabukumar, M.: A review on graph-based semi-supervised learning methods for hyperspectral image classification. Egypt. J. Remote Sens. Space Sci. 23(2), 243–248 (2020)
Kacheria, A.: Semi-Supervised Learning Algorithm for Large Datasets Using Spark Environment (Doctoral dissertation, University of Cincinnati) (2021)
BalaAnand, M., Karthikeyan, N., Karthik, S., Varatharajan, R., Manogaran, G., Sivaparthipan, C.B.: An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter. J. Supercomput. 75(9), 6085–6105 (2019). https://doi.org/10.1007/s11227-019-02948-w
Melo-Acosta, G.E., Duitama-Munoz, F., Arias-Londono, J.D.: Fraud detection in big data using supervised and semi-supervised learning techniques. In: 2017 IEEE Colombian Conference on Communications and Computing (COLCOM), pp. 1–6. IEEE (2017)
Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self-training of object detection models (2005)
Riloff, E., Wiebe, J., Phillips, W.: Exploiting subjectivity classification to improve information extraction. In: AAAI, pp. 1106–1111 (2005)
Xia, Y., et al.: 3D semi-supervised learning with uncertainty-aware multi-view co-training. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 3646–3655 (2020)
Maeireizo, B., Litman, D., Hwa, R.: Co-training for predicting emotions with spoken dialogue data. In: Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 202–205 (2004)
Li, M., Zhou, Z.H.: Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE Trans. Syst. Man Cybernet.-Part A: Syst. Hum. 37(6), 1088–1098 (2007)
Aziz, K., Zaidouni, D., Bellafkih, M.: Leveraging resource management for efficient performance of Apache Spark. J. Big Data 6(1), 1–23 (2019). https://doi.org/10.1186/s40537-019-0240-1
Kostopoulos, G., Kotsiantis, S., Pintelas, P.: Estimating student dropout in distance higher education using semi-supervised techniques. In: Proceedings of the 19th Panhellenic Conference on Informatics, pp. 38–43 (2015)
Hady, F.A.M., Schwenker, F.: Combining committee-based semi-supervised learning and active learning. J. Comput. Sci. Technol. 25(4), 681–698 (2010)
Li, K., Zhang, W., Ma, X., Cao, Z., Zhang, C.: A novel semi-supervised SVM based on Tri-training. In: 2008 Second International Symposium on Intelligent Information Technology Application, vol. 3, pp. 47–51. IEEE (2008)
Penchikala, S.: Big data processing with apache spark (2018). https://www.lulu.com
Meng, X., et al.: MLlib: machine learning in apache spark. J. Mach. Learn. Res. 17(1), 1235–1241 (2016)
Armbrust, M., et al.: Scaling spark in the real world: performance and usability. Proc. VLDB Endow. 8(12), 1840–1843 (2015)
López, V., Fernández, A., García, S., Palade, V., Herrera, F.: An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf. Sci. 250, 113–141 (2013)
García, S., Ramírez-Gallego, S., Luengo, J., Benítez, J.M., Herrera, F.: Big data preprocessing: methods and prospects. Big Data Anal. 1(1), 1–22 (2016)
Bennett, D.A.: How can I deal with missing data in my study? Aust. N. Z. J. Public Health 25(5), 464–469 (2001)
Zhang, J., Yang, Z., Benslimane, Y.: Exploring and evaluating the scalability and efficiency of apache spark using educational datasets. In: 2019 International Conference on Machine Learning and Cybernetics (ICMLC), pp. 1–6. IEEE (2019)
Grandini, M., Bagli, E., Visani, G.: Metrics for multi-class classification: an overview. arXiv preprint arXiv:2008.05756 (2020)
Spark. Apache Spark (2022). https://spark.apache.org/ Accessed Aug 2022
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Nan, K., Hu, S., Luo, H., Wong, P., Pourroostaei Ardakani, S. (2023). A Semi-supervised Learning Application for Hand Posture Classification. In: Hou, R., Huang, H., Zeng, D., Xia, G., A. Ghany, K.K., Zawbaa, H.M. (eds) Big Data Technologies and Applications. BDTA BDTA 2022 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 480. Springer, Cham. https://doi.org/10.1007/978-3-031-33614-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-33614-0_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33613-3
Online ISBN: 978-3-031-33614-0
eBook Packages: Computer ScienceComputer Science (R0)