Abstract
The analysis focuses on data from an e-commerce retailer to identify the classification techniques for obtaining the best performances of validated classification algorithms. It is essential to highlight that this type of prediction is used in the commerce sector since it is a sector in constant development and allows to identify the consumer’s behaviour to variables such as average time on the website, transactions completed, pages rated, among others. The research focuses on Business to Consumer commerce, collecting data from the purchase sessions for 12 months. This allowed the comparison of the Naive Bayes model of Gaussian type, Random Forest and Extra Trees Classifier, which are finally validated against the results of Dummy classifier, highlighting the efficiency of Extra Trees Classifier. This type of study and the selection of the variables provide a higher correlation with the Revenue attribute, which is the one used as a prediction class. The efficiency of Extra Trees Classifier stands out in this type of study, together with the selection of the variables that provide the highest correlation with the Revenue attribute, which is the one used as the prediction class.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Statista: eCommerce report 2021. https://www.statista.com/study/42335/ecommerce-report/. Last accessed 3 June 2022
Jílková, P., Králová, P.: Digital consumer behaviour and eCommerce trends during the COVID-19 crisis. Int. Adv. Econ. Res. 27(1), 83–85 (2021). https://doi.org/10.1007/s11294-021-09817-4
CEPAL: Recuperación económica tras la pandemia COVID-19: empoderar a América Latina y el Caribe para un mejor aprovechamiento del comercio electrónico y digital (2020)
Lim, Y.J., et al.: Online purchase behavior of generation Y in Malaysia. Procedia Econ. Financ. 37, 292–298 (2016). https://doi.org/10.1016/s2212-5671(16)00050-2
Statista: Ecuador: online shopping devices 2021 | Statista. https://www.statista.com/statistics/921189/ecuador-online-purchases-device/. Last accessed 6 June 2022
Sakar C., Kastro, Y.: Online Shoppers Purchasing Intention Dataset (2018)
Colombo-Mendoza, L.O., Paredes-Valverde, M.A., del Salas-Zárate, M.P., Valencia-García, R.: Internet of things-driven data mining for smart crop production prediction in the peasant farming domain. Appl. Sci. 12, 1940 (2022). https://doi.org/10.3390/app12041940
Hervert-Escobar, L., López-Pérez, J.F., Esquivel-Flores, O.A.: Optimal pricing model: case of study for convenience stores. In: Pichardo-Lagunas, O., Miranda-Jiménez, S. (eds.) Advances in Soft Computing: 15th Mexican International Conference on Artificial Intelligence, MICAI 2016, Cancún, Mexico, October 23–28, 2016, Proceedings, Part II, pp. 353–364. Springer International Publishing, Cham (2017). https://doi.org/10.1007/978-3-319-62428-0_28
Alejandro, R.H., Trejo, L.A., Hervert-Escobar, L., Hernández-Gress, N., Enrique, G.N.: Mexican stock return prediction with differential evolution for hyperparameter tuning. In: Batyrshin, I., Gelbukh, A., Sidorov, G. (eds.) Advances in Computational Intelligence: 20th Mexican International Conference on Artificial Intelligence, MICAI 2021, Mexico City, Mexico, October 25–30, 2021, Proceedings, Part I, pp. 355–368. Springer International Publishing, Cham (2021). https://doi.org/10.1007/978-3-030-89817-5_27
Lim, Y.J., Osman, A., Salahuddin, S.N., Romle, A.R., Abdullah, S.: Factors influencing online shopping behavior: the mediating role of purchase intention. Procedia Econ. Financ. 35, 401–410 (2016). https://doi.org/10.1016/s2212-5671(16)00050-2
Mokryn, O., Bogina, V., Kuflik, T.: Will this session end with a purchase? inferring current purchase intent of anonymous visitors. Electron. Commer. Res. Appl. 34, 100836 (2019). https://doi.org/10.1016/J.ELERAP.2019.100836
Shi, X.: The application of machine learning in online purchasing intention prediction. In: ACM International Conference Proceeding Service, pp. 21–29 (2021). https://doi.org/10.1145/3469968.3469972
Esmeli, R., Bader-El-Den, M., Abdullahi, H.: Towards early purchase intention prediction in online session based retailing systems. Electron. Mark. 31(3), 697–715 (2020). https://doi.org/10.1007/s12525-020-00448-x
Charry, K., Coussement, K., Demoulin, N., Heuvinck, N.: Descriptive analysis. In: Charry, K., Coussement, K., Demoulin, N., Heuvinck, N. (eds.) Marketing Research with IBM® SPSS Statistics: A Practical Guide, pp. 31–47. Routledge (2016). https://doi.org/10.4324/9781315525532-2
Lemaître, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18, 1–5 (2017)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2011). https://doi.org/10.1613/jair.953
Ampomah, E.K., Qin, Z., Nyame, G.: Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement. Information 11(6), 332 (2020). https://doi.org/10.3390/info11060332
Silaparasetty, N.: Machine Learning Concepts with Python and the Jupyter Notebook Environment: Using Tensorflow 2.0. Apress, Berkeley, CA (2020). https://doi.org/10.1007/978-1-4842-5967-2
Bisong, E.: Matplotlib and seaborn. In: Building Machine Learning and Deep Learning Models on Google Cloud Platform, pp. 151–165. Apress, Berkeley, CA (2019). https://doi.org/10.1007/978-1-4842-4470-8_12
Muschelli, J.: ROC and AUC with a binary predictor: a potentially misleading metric. J. Classif. 37(3), 696–708 (2019). https://doi.org/10.1007/s00357-019-09345-1
Chaubey, G., Gavhane, P.R., Bisen, D., Arjaria, S.K.: Customer purchasing behavior prediction using machine learning classification techniques. J. Ambient Intell. Humaniz. Comput. (2022). https://doi.org/10.1007/s12652-022-03837-6
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Aguirre-Munizaga, M., Del Cioppo Morstadt, J., Samaniego-Cobo, T. (2022). Analysis of Classification Algorithms for the Prediction of Purchase Intention in Electronic Commerce. In: Valencia-García, R., Bucaram-Leverone, M., Del Cioppo-Morstadt, J., Vera-Lucio, N., Jácome-Murillo, E. (eds) Technologies and Innovation. CITI 2022. Communications in Computer and Information Science, vol 1658. Springer, Cham. https://doi.org/10.1007/978-3-031-19961-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-19961-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19960-8
Online ISBN: 978-3-031-19961-5
eBook Packages: Computer ScienceComputer Science (R0)