Skip to main content

Enhancing Book Recommendations on GoodReads: A Data Mining Approach Based Random Forest Classification

  • Conference paper
  • First Online:
Innovations in Smart Cities Applications Volume 7 (SCA 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 938))

  • 104 Accesses

Abstract

With the rise of technology, new ways of finding books have emerged beyond traditional bookstores. Websites like www.goodreads.com allow readers to share their book reviews and ratings. This study uses the data from GoodReads to find the best way to suggest books to readers. Employing data mining classification, four methods - Random Forest, Naive Bayes, K-Nearest Neighbor, and Support Vector Classifier - were examined. Performance evaluation was conducted using accuracy, F-measure, recall, and precision metrics derived from the confusion matrix. Interestingly, the Random Forest algorithm stood out with remarkable results. It achieved 99.91% accuracy, 100% precision, 92% recall, a 95% F1-score, and a slight 0.09 average error. These impressive outcomes highlight the algorithm’s effectiveness in predicting user preferences and offering personalized book recommendations. Additionally, the study compared the Random Forest approach with the baseline methods, showing its clear superiority. This research showcases the promising potential of Random Forest in improving the GoodReads book recommendation system. Using the random forest classifier proved effective in predicting user preferences and generating relevant book recommendations, offering a promising approach to enhance personalized reading experiences, fitting well with changing reading habits in the digital era.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mathew, P., Kuriakose, B., Hegde, V.: Book recommendation system through content based and collaborative filtering method. In: 2016 International Conference on Data Mining and Advanced Computing (SAPIENCE), pp 47–52 (2016)

    Google Scholar 

  2. Najafabadi, M.K., Mohamed, A.H.J., Mahrin, M.N.: A survey on data mining techniques in recommender systems. Soft. Comput. 23, 627–654 (2019). https://doi.org/10.1007/s00500-017-2918-7

    Article  Google Scholar 

  3. Mhammedi, S., Gherabi, N., El Massari, H., Amnai, M.: Enhancing recommendation system using ontology-based similarity and incremental SVD prediction. Recent Adv Comput Sci Commun 16, 67–76 (2023)

    Article  Google Scholar 

  4. Mhammedi, S., El Massari, H., Gherabi, N.: Improving scalability of recommendation system using ontology and dimensionality reduction techniques. AIP Conf. Proc. 2814, 020007 (2023). https://doi.org/10.1063/5.0149217

    Article  Google Scholar 

  5. Mhammedi, S., El Massari, H., Gherabi, N., Amnai, M.: CF recommender system based on ontology and nonnegative matrix factorization (NMF). In: Farhaoui, Y., Rocha, A., Brahmia, Z., Bhushab, B. (eds.) Artificial Intelligence and Smart Environment, pp. 313–318. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-26254-8_44

    Chapter  Google Scholar 

  6. Goodreads-books (2022). https://www.kaggle.com/datasets/jealousleopard/goodreadsbooks. Accessed 22 Aug 2022

  7. Thelwall, M., Kousha, K.: Goodreads: a social network site for book readers. J. Assoc. Inf. Sci. Technol. 68, 972–983 (2016). https://doi.org/10.1002/asi.23733

    Article  Google Scholar 

  8. Furtado, F., Singh, A.: Movie recommendation system using machine learning. Int. J. Res. Ind. Eng. 9, 84–98 (2020). https://doi.org/10.22105/riej.2020.226178.1128

  9. Aher, S.B., Lobo, L.: Applicability of data mining algorithms for recommendation system in e-learning. In: Proceedings of the International Conference on Advances in Computing, Communications and Informatics, pp. 1034–1040. Association for Computing Machinery, New York (2012)

    Google Scholar 

  10. Amatriain, X., Pujol, J.M.: Data mining methods for recommender systems. In: Ricci, F., Rokach, L., Shapira, B. (eds.) Recommender Systems Handbook, pp. 227–262. Springer, Boston (2015). https://doi.org/10.1007/978-1-4899-7637-6_7

    Chapter  Google Scholar 

  11. Piletskiy, P., Chumachenko, D., Meniailov, I.: Development and analysis of intelligent recommendation system using machine learning approach. In: Nechyporuk, M., Pavlikov, V., Kritskiy, D. (eds.) Integrated Computer Technologies in Mechanical Engineering, pp. 186–197. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37618-5_17

    Chapter  Google Scholar 

  12. Nilashi, M.: An overview of data mining techniques in recommender systems. J. Soft. Comput. Decis. Support. Syst. 3, 16–44 (2016)

    Google Scholar 

  13. Roy, D., Dutta, M.: A systematic review and research perspective on recommender systems. J. Big Data 9, 59 (2022). https://doi.org/10.1186/s40537-022-00592-5

    Article  Google Scholar 

  14. Shambour, Q., Abualhaj, M.M., Abu-Shareha, A.: A trust-based recommender system for personalized restaurants recommendation. Int. J. Electr. Comput. Eng. Syst. 13(4), 293–299 (2022). https://doi.org/10.32985/ijeces.13.4.5

    Article  Google Scholar 

  15. Amin, M.S., Chiam, Y.K., Varathan, K.D.: Identification of significant features and data mining techniques in predicting heart disease. Telemat. Inform. 36, 82–93 (2019). https://doi.org/10.1016/j.tele.2018.11.007

    Article  Google Scholar 

  16. El Massari, H., Gherabi, N., Mhammedi, S., Sabouri, Z., Ghandi, H.: Ontology-based decision tree model for prediction of cardiovascular disease. Indian J. Comput. Sci. Eng. 13(3), 851–859 (2022). https://doi.org/10.21817/indjcse/2022/v13i3/221303143

    Article  Google Scholar 

  17. Atef, M., Elzanfaly, D.S., Ouf, S.: Early prediction of employee turnover using machine learning algorithms. Int. J. Electr. Comput. Eng. Syst. 13(2), 135–144 (2022). https://doi.org/10.32985/ijeces.13.2.6

    Article  Google Scholar 

  18. El Massari, H., Mhammedi, S., Gherabi, N.: Bridging the gap between the semantic web and big data: answering SPARQL queries over NoSQL databases. Int. J. Electr. Comput. Eng. 12(6), 6829 (2022). https://doi.org/10.11591/ijece.v12i6.pp6829-6835

    Article  Google Scholar 

  19. Sarker, I.H.: Machine learning: algorithms, real-world applications and research directions. SN Comput. Sci. 2(3), 1–21 (2021). https://doi.org/10.1007/s42979-021-00592-x

    Article  MathSciNet  Google Scholar 

  20. El Massari, H., Sabouri, Z., Mhammedi, S., Gherabi, N.: Diabetes prediction using machine learning algorithms and ontology. J. ICT Stand. 10, 319–337 (2022). https://doi.org/10.13052/jicts2245-800X.10212

    Article  Google Scholar 

  21. Bhavitha, B.K., Rodrigues, A.P., Chiplunkar N.N.: Comparative study of machine learning techniques in sentimental analysis. In: 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT), pp. 216–221 (2017)

    Google Scholar 

  22. El Massari, H., Gherabi, N., Mhammedi, S., Ghandi, H., Qanouni, F., Bahaj, M.: Integration of ontology with machine learning to predict the presence of covid-19 based on symptoms. Bull. Electr. Eng. Inform. 11(5), 2805–2816 (2022). https://doi.org/10.11591/eei.v11i5.4392

    Article  Google Scholar 

  23. Pham, B., Khosravi, K., Prakash, I.: Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at Pauri Garhwal area, Uttarakhand, India. Environ. Process. 4, 711–730 (2017). https://doi.org/10.1007/s40710-017-0248-5

    Article  Google Scholar 

  24. Panagiotakis, C., Papadakis, H., Fragopoulou, P.: A dual hybrid recommender system based on SCoR and the random forest. Comput. Sci. Inf. Syst. 18, 115–128 (2021). https://doi.org/10.2298/CSIS200515046P

    Article  Google Scholar 

  25. Kuzey, C., Uyar, A., Delen, D.: An investigation of the factors influencing cost system functionality using decision trees, support vector machines and logistic regression. Int. J. Account. Inf. Manag. 27, 27–55 (2019). https://doi.org/10.1108/IJAIM-04-2017-0052

    Article  Google Scholar 

  26. Hegde, Y., Padma, S.K.: Sentiment analysis using random forest ensemble for mobile product reviews in Kannada. In: 2017 IEEE 7th International Advance Computing Conference (IACC), pp. 777–782 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sajida Mhammedi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mhammedi, S., El Massari, H., Gherabi, N., Amnai, M. (2024). Enhancing Book Recommendations on GoodReads: A Data Mining Approach Based Random Forest Classification. In: Ben Ahmed, M., Boudhir, A.A., El Meouche, R., KaraÈ™, Ä°.R. (eds) Innovations in Smart Cities Applications Volume 7. SCA 2023. Lecture Notes in Networks and Systems, vol 938. Springer, Cham. https://doi.org/10.1007/978-3-031-54376-0_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-54376-0_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-54375-3

  • Online ISBN: 978-3-031-54376-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics