Skip to main content

Comparative Analysis of Statistical and Machine Learning Methods for Classification of Match Outcomes in Association Football

  • Conference paper
  • First Online:
Proceedings of the 7th International Conference on the Applications of Science and Mathematics 2021

Part of the book series: Springer Proceedings in Physics ((SPPHY,volume 273))

  • 329 Accesses

Abstract

Association football is one of the most popular sports in the world, having a large fan base that draws media and entertainment platforms to it. The three major difficulties to developing highly accurate match result prediction models, especially in prediction match outcomes, are data availability and quality [1], model assumptions [2], and testing various models and parameters [3]. The primary goal of the study is to identify the best model in predicting football match outcomes. The football dataset was obtained from the top 5 leagues in Euro. Exploratory data analysis had been conducted to better understand the dataset. Models used in predicting the football match outcomes include Logistic Regression, Artificial Neural Networks, and XGBoost. The predictive performance of three classification models was compared in terms of accuracy, precision and recall. The results showed that the Artificial Neural Networks achieved highest accuracy of 0.6788, followed by Logistic Regression (0.668) and XGBoost (0.654). These results are hoped to be used as benchmark results for future experiments in the area of football match classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. E. Morgulev, O.H. Azar, R. Lidor, Sports analytics and the big-data era. Int. J. Data Sci. Anal. 5(4), 213–222 (2018). https://doi.org/10.1007/s41060-017-0093-7

    Article  Google Scholar 

  2. A.C. Constantinou, Dolores: a model that predicts football match outcomes from all over the world. Mach. Learn. 108(1), 49–75 (2019)

    Article  MathSciNet  Google Scholar 

  3. H. Arntzen, L.M. Hvattum, Predicting match outcomes in association football using team ratings and player ratings. Stat. Modell. 2020, 1–22 (2020). https://doi.org/10.1177/1471082X20929881

    Article  MATH  Google Scholar 

  4. M. Ahin, R. Erol, Prediction of attendance demand in European football games: comparison of ANFIS, Fuzzy Logic, and ANN. Comput. Intell. Neurosci. 2018 (2018). https://doi.org/10.1155/2018/5714872

  5. L. Bransen, J. Van Haaren, Measuring football players’ on-the-ball contributions from passes during games, vol. 11330 LNAI. Springer International Publishing (2019). https://doi.org/10.1007/978-3-030-17274-9_1

  6. K. Kapadia, H. Abdel-Jaber, F. Thabtah, W. Hadi, Sport analytics for cricket game results using machine learning: an experimental study. Appl. Comput. Inform. (xxxx) (2019). https://doi.org/10.1016/j.aci.2019.11.006

    Article  Google Scholar 

  7. F. Thabtah, L. Zhang, N. Abdelhamid, NBA game result prediction using feature analysis and machine learning. Ann. Data Sci. 6(1), 103–116 (2019). https://doi.org/10.1007/s40745-018-00189-x

    Article  Google Scholar 

  8. N. Razali, A. Mustapha, F.A. Yatim, R. Ab Aziz, Predicting football matches results using bayesian networks for english premier league (EPL). IOP Conf. Ser.: Mater. Sci. Eng. 226(1) (2017). https://doi.org/10.1088/1757-899X/226/1/012099

  9. G. Angelini, L. De Angelis, PARX model for football match predictions. J. Forecast. 36(7), 795–807 (2017). https://doi.org/10.1002/for.2471

    Article  MathSciNet  MATH  Google Scholar 

  10. C. Herbinet, Predicting football results using machine learning techniques, in 2011 Proceedings of the 34th International Convention MIPRO, vol. 48, pp. 1623–1627 (2018)

    Google Scholar 

  11. T. Blobel, M. Lames, A concept for club information systems (CIS) an example for applied sports informatics. Int. J. Comput. Sci. Sport 19(1), 102–122 (2020). https://doi.org/10.2478/ijcss-2020-0006

    Article  Google Scholar 

  12. G. Baio, M. Blangiardo, Bayesian hierarchical model for the prediction of football results. J. Appl. Stat. 37(2), 253–264 (2010). https://doi.org/10.1080/02664760802684177

    Article  MathSciNet  MATH  Google Scholar 

  13. J. Hucaljuk, A. Rakipović, Predicting football scores using machine learning techniques, in MIPRO 2011—34th International Convention on Information and Communication Technology, Electronics and Microelectronics—Proceedings, vol. 48, pp. 1623–1627 (2011)

    Google Scholar 

  14. N. Tax, Y. Joustra, Predicting the Dutch football competition using public data: a machine learning approach. Trans. Knowl. Data Eng. 10(10), 1–13 (2015)

    Google Scholar 

  15. R. Baboota, H. Kaur, Predictive analysis and modelling football results using machine learning approach for English Premier League. Int. J. Forecast. 35(2), 741–755 (2019). https://doi.org/10.1016/j.ijforecast.2018.01.003

    Article  Google Scholar 

  16. D. Prasetio, Harlili: predicting football match results with logistic regression, in 4th IGNITE Conference and 2016 International Conference on Advanced Informatics: concepts, Theory and Application, ICAICTA 2016, pp. 2–6 (2016). https://doi.org/10.1109/ICAICTA.2016.7803111

  17. R. Gasparyan, A Novel Way to Soccer Match Prediction. Stanford University, Department of Computer Science (2014)

    Google Scholar 

  18. M.A. Rahman, A deep learning framework for football match prediction. SN Appl. Sci. 2(2) (2020). https://doi.org/10.1007/s42452-019-1821-5

  19. N. Danisik, P. Lacko, M. Farkas, Football match prediction using players attributes, in DISA 2018—IEEE World Symposium on Digital Intelligence for Systems and Machines, Proceedings, pp. 201–206 (2018). https://doi.org/10.1109/DISA.2018.8490613

  20. A.D. Blaikie, G.J. Abud, J.A. David, R.D. Pasteur, NFL & NCAA football prediction using artificial neural networks, in Proceedings of the Midstates Conference for Undergraduate Research in Computer Science and Mathematics. Denison University, Granville, OH (2011). http://ohio5.openrepository.com/ohio5/handle/2374.DEN/3930

  21. N. Zaveri, U. Shah, S. Tiwari, P. Shinde, L.K. Teli, Prediction of football match score and decision making process. Int. J. Recent Innov. Trends Comput. Commun. 6(2), 162–165 (2018)

    Google Scholar 

  22. A.E. Tümer, S. Koçer, Prediction of team league’s rankings in volleyball by artificial neural network method. Int. J. Perform. Anal. Sport 17(3), 202–211 (2017). https://doi.org/10.1080/24748668.2017.1331570

    Article  Google Scholar 

  23. T. Chen, C. Guestrin, XGBoost: a scalable tree boosting system, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016) (2016)

    Google Scholar 

  24. J. Brownlee, A gentle introduction to xgboost for applied machine learning. Mach. Learn. Mastery (2016). http://machinelearningmastery.com/gentle-introduction-xgboost-appliedmachine-learning/. Accessed 2 Mar. 2018

  25. W. Gourh, K. Poojary, M. Vengarai, N. Parkar, Football prediction using XGBoost algorithm: a literature review. J. Phys. Sci. Eng. Technol. 12(1), 109–112 (2020)

    Google Scholar 

Download references

Acknowledgements

This research is supported by the Ministry of Higher Education (MOHE) through Fundamental Research Grant Scheme (FRGS/1/2020/ICT02/UTHM/02/4).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aida Binti Mustapha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zulkifli, S., Mustapha, A.B., Ismail, S., Razali, N. (2022). Comparative Analysis of Statistical and Machine Learning Methods for Classification of Match Outcomes in Association Football. In: Mustapha, A.B., Shamsuddin, S., Zuhaib Haider Rizvi, S., Asman, S.B., Jamaian, S.S. (eds) Proceedings of the 7th International Conference on the Applications of Science and Mathematics 2021. Springer Proceedings in Physics, vol 273. Springer, Singapore. https://doi.org/10.1007/978-981-16-8903-1_31

Download citation

Publish with us

Policies and ethics