Abstract
Machine learning is used to analyze data from divergent perspectives, summarize it into expedient information, and use that information to predict the likelihood of future events. Classification is one of the main problems in the domain of machine learning. It is used to classify the predetermined data for a specific class and to predict the class label for unseen data. The aim here is to study various classification algorithms in machine learning applied to the considered Census income dataset. The algorithms used for this analysis are Logistic Regression, KNN Classifier, Decision Tree, Random Forest, AdaBoost Classifier, Support Vector Machine, Gradient Boosting Classifier, and Xtrim Gradient Boosting classifier. The performance is analyzed using various metrics such as Precision, Recall, F1-score, Accuracy, Macro Average, Weighted average, and ROC Area. It has been observed that the performance of the AdaBoost classifier, Gradient Boosting classifier, and Xtrim Gradient Boosting classifier is better than other algorithms for the considered dataset. The experimental analysis shows that the AdaBoost classifier, Gradient Boosting classifier, and Xtrim Gradient Boosting classifier are giving the best scores in terms of diversified contemplated performance measures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Portugal I, Alencar P, Cowan D (2018) The use of machine learning algorithms in recommender systems: a systematic review. Expert Syst Appl 97:205–227
Krittanawong C et al (2017)Artificial intelligence in precision cardiovascular medicine. J Am College Cardiol 69(21):2657–2664
Lindsay GW et al (2021) Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning. arXiv:2112.02027
Sukanya G, Sivanagamani N, Jahnavi Y (2019)Country location classification on tweets. Indian J Publ Health Res Dev 10(5)
Jahnavi Y, Radhika Y (2013) Hot topic extraction based on frequency, position, scattering and topical weight for time sliced news documents. In: 2013 15th international conference on advanced computing technologies (ICACT). IEEE
Lakshmi M, Sukeerthi K, Jahnavi Y (2019)Security health monitoring and attestation of virtual machines in cloud computing. Indian J Publ Health Res Dev 10(5)
Jahnavi Y, Pavan Kumar Reddy Y, Sindhura VSK, Tiwari V, Srivastava S (2023) A novel processing of scalable web log data using map reduce framework. In: Shukla PK, Singh KP, Tripathi AK, Engelbrecht A (eds) Computer vision and robotics. Algorithms for intelligent systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-7892-0_2
Almazroi AA, Khedr AE, Idrees AM (2021) A proposed customer relationship framework based on information retrieval for effective Firms’ competitiveness. Expert Syst Appl 176:114882
Bai J et al (2022)Multinomial random forest. Pattern Recogn 122:108331
Thakkar HK et al (2022)Clairvoyant: AdaBoost with cost-enabled cost-sensitive classifier for customer churn prediction. In: Computational intelligence and neuroscience 2022
Zhou J et al (2022) Predicting tunnel squeezing using support vector machine optimized by whale optimization algorithm. Acta Geotech 1–24
Gumaei A et al (2022)An effective approach for rumor detection of Arabic tweets using extreme gradient boosting method. Trans Asian Low-Resour Lang Inf Process 21(1):1–16
Rauber TW et al (2021)An experimental methodology to evaluate machine learning methods for fault diagnosis based on vibration signals. Expert Syst Appl 167:114022
Sarker IH (2021) Machine learning: Algorithms, real-world applications and research directions. SN Comput Sci 2(3):1–21
Sun D et al (2021)Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: a comparison between logistic regression and random forest. Eng Geol 281:105972
Wang Y, Pan Z, Dong J (2022) A new two-layer nearest neighbor selection method for kNN classifier. Knowl-Based Syst 235:107604
Jahnavi Y (2019) Statistical data mining technique for salient feature extraction. Int J Intell Syst Technol Appl 18(4):353–376
Jahnavi Y, Radhika Y (2012) A cogitate study on text mining. Int J Eng Adv Technol (IJEAT) 2249–8958
Jahnavi Y (2022) A new algorithm for time series prediction using machine learning models. EvolutIntell (Springer)
Yeturu J (2019) Analysis of weather data using various regression algorithms. Int J Data Sci 4(2):117–141
Książek W, Gandor M, Pławiak P (2021) Comparison of various approaches to combine logistic regression with genetic algorithms in survival prediction of hepatocellular carcinoma. Comput Biol Med 134:104431
Lin L et al (2022)Validation and refinement of cropland data layer using a spatial-temporal decision tree algorithm. Sci Data 9(1):1–9
Ma T et al (2022)Multiclassification prediction of clay sensitivity using extreme gradient boosting based on imbalanced dataset. Appl Sci 12(3):1143
Jahnavi Y et al (2021) A novel ensemble stacking classification of genetic variations using machine learning algorithms. Int J Image Graph. ISSN:0219-4678. https://doi.org/10.1142/S0219467823500158
Jahnavi Y, Radhika Y (2015) FPST: a new term weighting algorithm for long running and short-lived events. Int J Data Anal Tech Strateg 7(4):366–383
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jahnavi, Y., Balasaraswathi, V.R., Nagendra Kumar, P. (2023). Model Building and Heuristic Evaluation of Various Machine Learning Classifiers. In: Pandit, M., Gaur, M.K., Kumar, S. (eds) Artificial Intelligence and Sustainable Computing. ICSISCET 2022. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-1431-9_30
Download citation
DOI: https://doi.org/10.1007/978-981-99-1431-9_30
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1430-2
Online ISBN: 978-981-99-1431-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)