Skip to main content

Juxtaposition on Classifiers in Modeling Hepatitis Diagnosis Data

  • Chapter
New Trends in Computational Vision and Bio-inspired Computing

Abstract

Machine Learning and Data Mining have been used extensively in the field of medical science. Approximately 2% of the world population, i.e., 3.9 million people are infected by Hepatitis C. This paper is an investigative study on the comparison of classification models—Support Vector Machine, Random Forest Classifier, Decision Tree Classifier, Logistic Regression, and Naive Bayes Classifier—modeling Hepatitis C Data based on various performance measures—Accuracy, Balanced Accuracy, Precision, Recall, F1-Measure, Matthews Correlation Coefficient and many more using R Programming Language. On normalizing the numerical attributes using Z-score Normalization and using the holdout method for the Train Test data split of 80–20%, the result shows that Random Forest outperforms the other classifiers with an accuracy of 90.7%, followed by Support Vector Machine, Logistic Regression, Decision Tree Classifier, and Naive Bayes Classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. M. Ramasamy, S. Selvaraj, and M. Mayilvaganan. An empirical analysis of decision tree algorithms: Modeling hepatitis data. In 2015 IEEE International Conference on Engineering and Technology (ICETECH), pages 1–4, 2015.

    Google Scholar 

  2. Huda Yasin, Tahseen A. Jilani, and Madiha Danish. Hepatitis-c classification using data mining techniques. International Journal of Computer Applications, 24(3):1–6, 2011.

    Article  Google Scholar 

  3. World Health Organization et al. Global hepatitis report 2017. World Health Organization, 2017.

    Google Scholar 

  4. A.H. Roslina and A. Noraziah. Prediction of hepatitis prognosis using support vector machines and wrapper method. In 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery, volume 5, pages 2209–2211, 2010.

    Google Scholar 

  5. S. Ekız and P. Erdogmus. Comparative study of heart disease classification. In 2017 Electric Electronics, Computer Science, Biomedical Engineerings’ Meeting (EBBT), pages 1–4, 2017.

    Google Scholar 

  6. Dheeru Dua and Casey Graff. UCI machine learning repository, 2017.

    Google Scholar 

  7. Strother H. Walker and David B. Duncan. Estimation of the probability of an event as a function of several independent variables. Biometrika, 54(1/2):167–179, 1967.

    Article  MathSciNet  Google Scholar 

  8. David J. Hand and Keming Yu. Idiot’s bayes: Not so stupid after all? International Statistical Review/Revue Internationale de Statistique, 69(3):385–398, 2001.

    Google Scholar 

  9. Corinna Cortes and Vladimir Vapnik. Support-vector networks. Machine Learning, 20(3):273–297, 1995.

    Article  Google Scholar 

  10. J. Ross Quinlan. Induction of decision trees. Machine Learning, 1(1):81–106, 1986.

    Article  Google Scholar 

  11. Leo Breiman. Random forests. Machine Learning, 45(1):5–32, 2001.

    Article  Google Scholar 

  12. Jiawei Han, Jian Pei, and Micheline Kamber. Data mining: concepts and techniques. Elsevier, 2011.

    Google Scholar 

  13. Tom Fawcett. An introduction to roc analysis. Pattern Recognition Letters, 27(8):861–874, 2006.

    Article  MathSciNet  Google Scholar 

  14. B.W. Matthews. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochimica et Biophysica Acta (BBA) - Protein Structure, 405(2):442 – 451, 1975.

    Google Scholar 

  15. David Martin Powers. Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation. 2011.

    Google Scholar 

  16. Donald B. Rubin. Statistical matching using file concatenation with adjusted weights and multiple imputations. Journal of Business Economic Statistics, 4(1):87–94, 1986.

    Article  MathSciNet  Google Scholar 

  17. D. Freedman, R. Pisani, and R. Purves. Statistics: Fourth International Student Edition. International student edition. W.W. Norton & Company, 2007.

    Google Scholar 

  18. Vikas K Vijayan, KR Bindu, and Latha Parameswaran. A comprehensive study of text classification algorithms. In 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pages 1109–1113. IEEE, 2017.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. R. Bindu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Cite this chapter

Ganesh, P., Vasu, H.V., Santhakumar, K.G., Rajan, R.A., Bindu, K.R. (2020). Juxtaposition on Classifiers in Modeling Hepatitis Diagnosis Data. In: Smys, S., Iliyasu, A.M., Bestak, R., Shi, F. (eds) New Trends in Computational Vision and Bio-inspired Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-41862-5_48

Download citation

Publish with us

Policies and ethics