Abstract
With the rise of technology, new ways of finding books have emerged beyond traditional bookstores. Websites like www.goodreads.com allow readers to share their book reviews and ratings. This study uses the data from GoodReads to find the best way to suggest books to readers. Employing data mining classification, four methods - Random Forest, Naive Bayes, K-Nearest Neighbor, and Support Vector Classifier - were examined. Performance evaluation was conducted using accuracy, F-measure, recall, and precision metrics derived from the confusion matrix. Interestingly, the Random Forest algorithm stood out with remarkable results. It achieved 99.91% accuracy, 100% precision, 92% recall, a 95% F1-score, and a slight 0.09 average error. These impressive outcomes highlight the algorithm’s effectiveness in predicting user preferences and offering personalized book recommendations. Additionally, the study compared the Random Forest approach with the baseline methods, showing its clear superiority. This research showcases the promising potential of Random Forest in improving the GoodReads book recommendation system. Using the random forest classifier proved effective in predicting user preferences and generating relevant book recommendations, offering a promising approach to enhance personalized reading experiences, fitting well with changing reading habits in the digital era.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mathew, P., Kuriakose, B., Hegde, V.: Book recommendation system through content based and collaborative filtering method. In: 2016 International Conference on Data Mining and Advanced Computing (SAPIENCE), pp 47–52 (2016)
Najafabadi, M.K., Mohamed, A.H.J., Mahrin, M.N.: A survey on data mining techniques in recommender systems. Soft. Comput. 23, 627–654 (2019). https://doi.org/10.1007/s00500-017-2918-7
Mhammedi, S., Gherabi, N., El Massari, H., Amnai, M.: Enhancing recommendation system using ontology-based similarity and incremental SVD prediction. Recent Adv Comput Sci Commun 16, 67–76 (2023)
Mhammedi, S., El Massari, H., Gherabi, N.: Improving scalability of recommendation system using ontology and dimensionality reduction techniques. AIP Conf. Proc. 2814, 020007 (2023). https://doi.org/10.1063/5.0149217
Mhammedi, S., El Massari, H., Gherabi, N., Amnai, M.: CF recommender system based on ontology and nonnegative matrix factorization (NMF). In: Farhaoui, Y., Rocha, A., Brahmia, Z., Bhushab, B. (eds.) Artificial Intelligence and Smart Environment, pp. 313–318. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-26254-8_44
Goodreads-books (2022). https://www.kaggle.com/datasets/jealousleopard/goodreadsbooks. Accessed 22 Aug 2022
Thelwall, M., Kousha, K.: Goodreads: a social network site for book readers. J. Assoc. Inf. Sci. Technol. 68, 972–983 (2016). https://doi.org/10.1002/asi.23733
Furtado, F., Singh, A.: Movie recommendation system using machine learning. Int. J. Res. Ind. Eng. 9, 84–98 (2020). https://doi.org/10.22105/riej.2020.226178.1128
Aher, S.B., Lobo, L.: Applicability of data mining algorithms for recommendation system in e-learning. In: Proceedings of the International Conference on Advances in Computing, Communications and Informatics, pp. 1034–1040. Association for Computing Machinery, New York (2012)
Amatriain, X., Pujol, J.M.: Data mining methods for recommender systems. In: Ricci, F., Rokach, L., Shapira, B. (eds.) Recommender Systems Handbook, pp. 227–262. Springer, Boston (2015). https://doi.org/10.1007/978-1-4899-7637-6_7
Piletskiy, P., Chumachenko, D., Meniailov, I.: Development and analysis of intelligent recommendation system using machine learning approach. In: Nechyporuk, M., Pavlikov, V., Kritskiy, D. (eds.) Integrated Computer Technologies in Mechanical Engineering, pp. 186–197. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37618-5_17
Nilashi, M.: An overview of data mining techniques in recommender systems. J. Soft. Comput. Decis. Support. Syst. 3, 16–44 (2016)
Roy, D., Dutta, M.: A systematic review and research perspective on recommender systems. J. Big Data 9, 59 (2022). https://doi.org/10.1186/s40537-022-00592-5
Shambour, Q., Abualhaj, M.M., Abu-Shareha, A.: A trust-based recommender system for personalized restaurants recommendation. Int. J. Electr. Comput. Eng. Syst. 13(4), 293–299 (2022). https://doi.org/10.32985/ijeces.13.4.5
Amin, M.S., Chiam, Y.K., Varathan, K.D.: Identification of significant features and data mining techniques in predicting heart disease. Telemat. Inform. 36, 82–93 (2019). https://doi.org/10.1016/j.tele.2018.11.007
El Massari, H., Gherabi, N., Mhammedi, S., Sabouri, Z., Ghandi, H.: Ontology-based decision tree model for prediction of cardiovascular disease. Indian J. Comput. Sci. Eng. 13(3), 851–859 (2022). https://doi.org/10.21817/indjcse/2022/v13i3/221303143
Atef, M., Elzanfaly, D.S., Ouf, S.: Early prediction of employee turnover using machine learning algorithms. Int. J. Electr. Comput. Eng. Syst. 13(2), 135–144 (2022). https://doi.org/10.32985/ijeces.13.2.6
El Massari, H., Mhammedi, S., Gherabi, N.: Bridging the gap between the semantic web and big data: answering SPARQL queries over NoSQL databases. Int. J. Electr. Comput. Eng. 12(6), 6829 (2022). https://doi.org/10.11591/ijece.v12i6.pp6829-6835
Sarker, I.H.: Machine learning: algorithms, real-world applications and research directions. SN Comput. Sci. 2(3), 1–21 (2021). https://doi.org/10.1007/s42979-021-00592-x
El Massari, H., Sabouri, Z., Mhammedi, S., Gherabi, N.: Diabetes prediction using machine learning algorithms and ontology. J. ICT Stand. 10, 319–337 (2022). https://doi.org/10.13052/jicts2245-800X.10212
Bhavitha, B.K., Rodrigues, A.P., Chiplunkar N.N.: Comparative study of machine learning techniques in sentimental analysis. In: 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT), pp. 216–221 (2017)
El Massari, H., Gherabi, N., Mhammedi, S., Ghandi, H., Qanouni, F., Bahaj, M.: Integration of ontology with machine learning to predict the presence of covid-19 based on symptoms. Bull. Electr. Eng. Inform. 11(5), 2805–2816 (2022). https://doi.org/10.11591/eei.v11i5.4392
Pham, B., Khosravi, K., Prakash, I.: Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at Pauri Garhwal area, Uttarakhand, India. Environ. Process. 4, 711–730 (2017). https://doi.org/10.1007/s40710-017-0248-5
Panagiotakis, C., Papadakis, H., Fragopoulou, P.: A dual hybrid recommender system based on SCoR and the random forest. Comput. Sci. Inf. Syst. 18, 115–128 (2021). https://doi.org/10.2298/CSIS200515046P
Kuzey, C., Uyar, A., Delen, D.: An investigation of the factors influencing cost system functionality using decision trees, support vector machines and logistic regression. Int. J. Account. Inf. Manag. 27, 27–55 (2019). https://doi.org/10.1108/IJAIM-04-2017-0052
Hegde, Y., Padma, S.K.: Sentiment analysis using random forest ensemble for mobile product reviews in Kannada. In: 2017 IEEE 7th International Advance Computing Conference (IACC), pp. 777–782 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mhammedi, S., El Massari, H., Gherabi, N., Amnai, M. (2024). Enhancing Book Recommendations on GoodReads: A Data Mining Approach Based Random Forest Classification. In: Ben Ahmed, M., Boudhir, A.A., El Meouche, R., KaraÈ™, Ä°.R. (eds) Innovations in Smart Cities Applications Volume 7. SCA 2023. Lecture Notes in Networks and Systems, vol 938. Springer, Cham. https://doi.org/10.1007/978-3-031-54376-0_36
Download citation
DOI: https://doi.org/10.1007/978-3-031-54376-0_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-54375-3
Online ISBN: 978-3-031-54376-0
eBook Packages: EngineeringEngineering (R0)