Skip to main content

Improving Sales Prediction for Point-of-Sale Retail Using Machine Learning and Clustering

  • Chapter
  • First Online:
Machine Learning and Data Analytics for Solving Business Problems

Part of the book series: Unsupervised and Semi-Supervised Learning ((UNSESUL))

Abstract

Point-of-sale retail represents an important aspect of daily consumer purchases. Even with the increasing growth of online retailing, physical retail stores provide useful services for consumers. Data analytics can be applied to improve the performance of this type of retailing by better predicting product sales and optimizing product availability. Large physical retail chains sell a wide range of products in different store locations which makes high-quality predictions across different products, categories, and store locations complex and often results in low-quality sales forecasts. Developing a data analytics model for every single product and store in a retail chain would be difficult to scale. Against this background, machine learning methods are highly promising and could be used to cluster stores with similar properties to subsequently provide a single model for predicting their specific product sales. Yet, literature that provides a systematic approach for clustering stores based on a standardized list of properties is limited. This paper addresses this gap by identifying the main factors for clustering retail stores and examines model combinations of clustering and prediction algorithms that improve sales forecasts in retail stores. The results of this paper show selected factors for organizing stores and present the best performing algorithms for predicting product sales.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. N.S. Terblanche, Revisiting the supermarket in-store customer shopping experience. J. Retail. Consum. Serv. 40, 48–59 (2018) https://doi.org/10.1016/j.jretconser.2017.09.004

    Article  Google Scholar 

  2. I. Sachdeva, S. Goel, Retail store environment and customer experience: A paradigm. J. Fash. Mark. Manag. (2015). https://doi.org/10.1108/JFMM-03-2015-0021

  3. A. Likas, N. Vlassis, J.J. Verbeek, The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003). https://doi.org/10.1016/S0031-3203(02)00060-2

    Article  Google Scholar 

  4. T. Kohonen, The self-organizing map. Proc. IEEE 78(9), 1464–1480 (1990). https://doi.org/10.1109/5.58325

    Article  Google Scholar 

  5. R.J. Kuo, S.-C. Chi, S.-S. Kao, A decision support system for selecting convenience store location through integration of fuzzy AHP and artificial neural network. Comput. Ind. 47(2), 199–214 (2002). https://doi.org/10.1016/S0166-3615(01)00147-6

    Article  Google Scholar 

  6. İ. İşlek, Ş.G. Öğüdücü, A retail demand forecasting model based on data mining techniques, in 2015 IEEE 24th International Symposium on Industrial Electronics (ISIE), (IEEE, 2015). https://doi.org/10.1109/ISIE.2015.7281443

    Chapter  Google Scholar 

  7. Odegua, R, Applied machine learning for supermarket sales prediction. Research gate (2020). https://www.researchgate.net/publication/338681895_Applied_Machine_Learning_for_Supermarket_Sales_Prediction

  8. İ. İşlek, Ş.G. Öğüdücü, A retail demand forecasting model based on data mining techniques, in 2015 IEEE 24th International Symposium on Industrial Electronics (ISIE), (IEEE, 2015, June), pp. 55–60. https://doi.org/10.1109/ISIE.2015.7281443

    Chapter  Google Scholar 

  9. G. Turhan, M. Akalın, C. Zehir, Literature review on selection criteria of store location based on performance measures. Proc. Soc. Behav. Sci 99, 391–402 (2013). https://doi.org/10.1016/j.sbspro.2013.10.507

    Article  Google Scholar 

  10. K. Kusrini, Grouping of retail items by using K-means clustering. Proc. Comput. Sci 72, 495–502 (2015). https://doi.org/10.1016/j.procs.2015.12.131

    Article  Google Scholar 

  11. I.-F. Chen, L. Chi-Jie, Sales forecasting by combining clustering and machine-learning techniques for computer retailing. Neural Comput. & Applic. 28(9), 2633–2647 (2017) https://doi.org/10.1007/s00521-016-2215-x

    Article  Google Scholar 

  12. S.S. Kolhatkar, S.V. Joshi, Transforming the point of sale to point of service-applying SOA in the Indian retail scenario. J. Business Retail Manag. Res 4(2) (2010). https://jbrmr.com/details&cid=33

  13. D. Pederzoli, ICT and retail: State of the art and prospects, in Information and Communication Technologies in Organizations and Society, (2016), pp. 329–336. https://doi.org/10.1007/978-3-319-28907-6_22

    Chapter  Google Scholar 

  14. S. Chopra, How omni-channel can be the future of retailing. Decision 43(2), 135–144 (2016). https://doi.org/10.1007/s40622-015-0118-9

    Article  Google Scholar 

  15. D. Vale, I.C.-L. Guillaume, X. Lecocq, The new retail model: Global reach demands omni-channels. J. Bus. Strateg. (2021). https://doi.org/10.1108/JBS-02-2021-0026

  16. H. Liu et al., Optimal purchase and inventory retrieval policies for perishable seasonal agricultural products. Omega 79, 133–145 (2018) https://doi.org/10.1016/j.omega.2017.08.006

    Article  Google Scholar 

  17. C. Joseph Udokwu, F. Darbanian, T.N. Falatouri, P. Brandtner, Evaluating technique for capturing customer satisfaction data in retail supply chain, in 2020 the 4th International Conference on E-Commerce, E-Business and E-Government, (2020), pp. 89–95. https://doi.org/10.1145/3409929.3414743

    Chapter  Google Scholar 

  18. P. Brandtner, F. Darbanian, T. Falatouri, C. Udokwu, Impact of COVID-19 on the customer end of retail supply chains: A big data analysis of consumer satisfaction. Sustainability 13(3), 1464 (2021). https://doi.org/10.3390/su13031464

    Article  Google Scholar 

  19. T. Thron, G. Nagy, N. Wassan, Evaluating alternative supply chain structures for perishable products. Int. J. Logist. Manag (2007). https://doi.org/10.1108/09574090710835110

  20. P. Anitha, M.M. Patil, RFM model for customer purchase behaviour using K-Means algorithm. J. King Saud Univ. Comput. Inf. Sci (2019). https://doi.org/10.1016/j.jksuci.2019.12.011

  21. R. Diestel, Graph Theory, Graduate Texts in Mathematics (Springer, 2005) ISBN 978-3-642-14278-9

    Google Scholar 

  22. N. Friedman, D. Geiger, M. Goldszmidt, Bayesian network classifiers. Mach. Learn. 29(2), 131–163 (1997). https://doi.org/10.1023/A:1007465528199

    Article  MATH  Google Scholar 

  23. C. Catal et al., Benchmarking of regression algorithms and time series analysis techniques for sales forecasting. Balkan J. Electr. Comput. Eng 7(1), 20–26 (2019). https://doi.org/10.17694/bajece.494920

    Article  Google Scholar 

  24. R. Shumway, D. Stoffer, Time Series Analysis and its Applications: With R Examples, 3rd edn. (Springer, 2010) ISBN 144197864X)

    MATH  Google Scholar 

  25. S. Wang, C. Li, A. Lim, Why are the ARIMA and SARIMA not sufficient. arXiv preprint arXiv:1904.07632 (2019)

    Google Scholar 

  26. J. Schmidhuber, Deep Learning in Neural Networks: An Overview. Neural Netw. 61, 85–117 (2015) arXiv:1404.7828

    Article  Google Scholar 

  27. T. Kansal et al., Customer segmentation using K-means clustering, in 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), (IEEE, 2018). https://doi.org/10.1109/CTEMS.2018.8769171

    Chapter  Google Scholar 

  28. V. Holý, O. Sokol, M. Černý, Clustering retail products based on customer behaviour. Appl. Soft Comput. 60, 752–762 (2017). https://doi.org/10.1016/j.asoc.2017.02.004

    Article  Google Scholar 

  29. P.J. Rousseeuw, Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987). https://doi.org/10.1016/0377-0427(87)90125-7

    Article  MATH  Google Scholar 

  30. A. Salamzadeh, What constitutes a theoretical contribution? J. Org. Cult. Commun. Confl. 24(1), 1–2 (2020) https://ssrn.com/abstract=3599931

    Google Scholar 

  31. G. Fillion, V. Koffi, J.P.B. Ekionea, Peter Senge’s learning organization: A critical view and the addition of some new concepts to actualize theory and practice. J. Org. Cult. Commun. Confl. 19(3), 73–102 (2015) https://www.researchgate.net/publication/304824741_Peter_Senge’s_learning_organization_A_critical_view_and_the_addition_of_some _new_concepts_to_actualize_theory_and_practice

    Google Scholar 

  32. S. Arbour, C.T. Kwantes, J.M. Kraft, C.A. Boglarsky, Person-organization fit: Using normative behaviours to predict workplace satisfaction, stress and intentions to stay. J. Org. Cult. Commun. Confl. 18(1), 41–64 (2014) https://www.researchgate.net/publication/287549304_Person-organization_fit_Using_normative_behaviors_to_predict_workplace_satisfaction_stress_ and_intentions_to_stay

    Google Scholar 

  33. H. Wu, J.L. Shapiro, Does overfitting affect performance in estimation of distribution algorithms, in Proceedings of the 8th Annual Conference on Genetic and Evolutionary Computation, (2006, July), pp. 433–434. https://doi.org/10.1145/1143997.1144078

    Chapter  Google Scholar 

  34. C.C. Tu, P.Y. Chen, N. Wang, Improving prediction efficacy through abnormality detection and data preprocessing. IEEE Access 7, 103794–103805 (2019). https://doi.org/10.1109/ACCESS.2019.2930257

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Patrick Brandtner .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Udokwu, C., Brandtner, P., Darbanian, F., Falatouri, T. (2022). Improving Sales Prediction for Point-of-Sale Retail Using Machine Learning and Clustering. In: Alyoubi, B., Ben Ncir, CE., Alharbi, I., Jarboui, A. (eds) Machine Learning and Data Analytics for Solving Business Problems. Unsupervised and Semi-Supervised Learning. Springer, Cham. https://doi.org/10.1007/978-3-031-18483-3_4

Download citation

Publish with us

Policies and ethics