Skip to main content

Model Building and Heuristic Evaluation of Various Machine Learning Classifiers

  • Conference paper
  • First Online:
Artificial Intelligence and Sustainable Computing (ICSISCET 2022)

Abstract

Machine learning is used to analyze data from divergent perspectives, summarize it into expedient information, and use that information to predict the likelihood of future events. Classification is one of the main problems in the domain of machine learning. It is used to classify the predetermined data for a specific class and to predict the class label for unseen data. The aim here is to study various classification algorithms in machine learning applied to the considered Census income dataset. The algorithms used for this analysis are Logistic Regression, KNN Classifier, Decision Tree, Random Forest, AdaBoost Classifier, Support Vector Machine, Gradient Boosting Classifier, and Xtrim Gradient Boosting classifier. The performance is analyzed using various metrics such as Precision, Recall, F1-score, Accuracy, Macro Average, Weighted average, and ROC Area. It has been observed that the performance of the AdaBoost classifier, Gradient Boosting classifier, and Xtrim Gradient Boosting classifier is better than other algorithms for the considered dataset. The experimental analysis shows that the AdaBoost classifier, Gradient Boosting classifier, and Xtrim Gradient Boosting classifier are giving the best scores in terms of diversified contemplated performance measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Portugal I, Alencar P, Cowan D (2018) The use of machine learning algorithms in recommender systems: a systematic review. Expert Syst Appl 97:205–227

    Article  Google Scholar 

  2. Krittanawong C et al (2017)Artificial intelligence in precision cardiovascular medicine. J Am College Cardiol 69(21):2657–2664

    Google Scholar 

  3. Lindsay GW et al (2021) Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning. arXiv:2112.02027

  4. Sukanya G, Sivanagamani N, Jahnavi Y (2019)Country location classification on tweets. Indian J Publ Health Res Dev 10(5)

    Google Scholar 

  5. Jahnavi Y, Radhika Y (2013) Hot topic extraction based on frequency, position, scattering and topical weight for time sliced news documents. In: 2013 15th international conference on advanced computing technologies (ICACT). IEEE

    Google Scholar 

  6. Lakshmi M, Sukeerthi K, Jahnavi Y (2019)Security health monitoring and attestation of virtual machines in cloud computing. Indian J Publ Health Res Dev 10(5)

    Google Scholar 

  7. Jahnavi Y, Pavan Kumar Reddy Y, Sindhura VSK, Tiwari V, Srivastava S (2023) A novel processing of scalable web log data using map reduce framework. In: Shukla PK, Singh KP, Tripathi AK, Engelbrecht A (eds) Computer vision and robotics. Algorithms for intelligent systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-7892-0_2

  8. Almazroi AA, Khedr AE, Idrees AM (2021) A proposed customer relationship framework based on information retrieval for effective Firms’ competitiveness. Expert Syst Appl 176:114882

    Google Scholar 

  9. Bai J et al (2022)Multinomial random forest. Pattern Recogn 122:108331

    Google Scholar 

  10. Thakkar HK et al (2022)Clairvoyant: AdaBoost with cost-enabled cost-sensitive classifier for customer churn prediction. In: Computational intelligence and neuroscience 2022

    Google Scholar 

  11. Zhou J et al (2022) Predicting tunnel squeezing using support vector machine optimized by whale optimization algorithm. Acta Geotech 1–24

    Google Scholar 

  12. Gumaei A et al (2022)An effective approach for rumor detection of Arabic tweets using extreme gradient boosting method. Trans Asian Low-Resour Lang Inf Process 21(1):1–16

    Google Scholar 

  13. Rauber TW et al (2021)An experimental methodology to evaluate machine learning methods for fault diagnosis based on vibration signals. Expert Syst Appl 167:114022

    Google Scholar 

  14. Sarker IH (2021) Machine learning: Algorithms, real-world applications and research directions. SN Comput Sci 2(3):1–21

    Article  MathSciNet  Google Scholar 

  15. Sun D et al (2021)Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: a comparison between logistic regression and random forest. Eng Geol 281:105972

    Google Scholar 

  16. Wang Y, Pan Z, Dong J (2022) A new two-layer nearest neighbor selection method for kNN classifier. Knowl-Based Syst 235:107604

    Article  Google Scholar 

  17. Jahnavi Y (2019) Statistical data mining technique for salient feature extraction. Int J Intell Syst Technol Appl 18(4):353–376

    Google Scholar 

  18. Jahnavi Y, Radhika Y (2012) A cogitate study on text mining. Int J Eng Adv Technol (IJEAT) 2249–8958

    Google Scholar 

  19. Jahnavi Y (2022) A new algorithm for time series prediction using machine learning models. EvolutIntell (Springer)

    Google Scholar 

  20. Yeturu J (2019) Analysis of weather data using various regression algorithms. Int J Data Sci 4(2):117–141

    Article  Google Scholar 

  21. Książek W, Gandor M, Pławiak P (2021) Comparison of various approaches to combine logistic regression with genetic algorithms in survival prediction of hepatocellular carcinoma. Comput Biol Med 134:104431

    Article  Google Scholar 

  22. Lin L et al (2022)Validation and refinement of cropland data layer using a spatial-temporal decision tree algorithm. Sci Data 9(1):1–9

    Google Scholar 

  23. Ma T et al (2022)Multiclassification prediction of clay sensitivity using extreme gradient boosting based on imbalanced dataset. Appl Sci 12(3):1143

    Google Scholar 

  24. Jahnavi Y et al (2021) A novel ensemble stacking classification of genetic variations using machine learning algorithms. Int J Image Graph. ISSN:0219-4678. https://doi.org/10.1142/S0219467823500158

  25. Jahnavi Y, Radhika Y (2015) FPST: a new term weighting algorithm for long running and short-lived events. Int J Data Anal Tech Strateg 7(4):366–383

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Y. Jahnavi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jahnavi, Y., Balasaraswathi, V.R., Nagendra Kumar, P. (2023). Model Building and Heuristic Evaluation of Various Machine Learning Classifiers. In: Pandit, M., Gaur, M.K., Kumar, S. (eds) Artificial Intelligence and Sustainable Computing. ICSISCET 2022. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-1431-9_30

Download citation

Publish with us

Policies and ethics