Abstract
Association football is one of the most popular sports in the world, having a large fan base that draws media and entertainment platforms to it. The three major difficulties to developing highly accurate match result prediction models, especially in prediction match outcomes, are data availability and quality [1], model assumptions [2], and testing various models and parameters [3]. The primary goal of the study is to identify the best model in predicting football match outcomes. The football dataset was obtained from the top 5 leagues in Euro. Exploratory data analysis had been conducted to better understand the dataset. Models used in predicting the football match outcomes include Logistic Regression, Artificial Neural Networks, and XGBoost. The predictive performance of three classification models was compared in terms of accuracy, precision and recall. The results showed that the Artificial Neural Networks achieved highest accuracy of 0.6788, followed by Logistic Regression (0.668) and XGBoost (0.654). These results are hoped to be used as benchmark results for future experiments in the area of football match classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
E. Morgulev, O.H. Azar, R. Lidor, Sports analytics and the big-data era. Int. J. Data Sci. Anal. 5(4), 213–222 (2018). https://doi.org/10.1007/s41060-017-0093-7
A.C. Constantinou, Dolores: a model that predicts football match outcomes from all over the world. Mach. Learn. 108(1), 49–75 (2019)
H. Arntzen, L.M. Hvattum, Predicting match outcomes in association football using team ratings and player ratings. Stat. Modell. 2020, 1–22 (2020). https://doi.org/10.1177/1471082X20929881
M. Ahin, R. Erol, Prediction of attendance demand in European football games: comparison of ANFIS, Fuzzy Logic, and ANN. Comput. Intell. Neurosci. 2018 (2018). https://doi.org/10.1155/2018/5714872
L. Bransen, J. Van Haaren, Measuring football players’ on-the-ball contributions from passes during games, vol. 11330 LNAI. Springer International Publishing (2019). https://doi.org/10.1007/978-3-030-17274-9_1
K. Kapadia, H. Abdel-Jaber, F. Thabtah, W. Hadi, Sport analytics for cricket game results using machine learning: an experimental study. Appl. Comput. Inform. (xxxx) (2019). https://doi.org/10.1016/j.aci.2019.11.006
F. Thabtah, L. Zhang, N. Abdelhamid, NBA game result prediction using feature analysis and machine learning. Ann. Data Sci. 6(1), 103–116 (2019). https://doi.org/10.1007/s40745-018-00189-x
N. Razali, A. Mustapha, F.A. Yatim, R. Ab Aziz, Predicting football matches results using bayesian networks for english premier league (EPL). IOP Conf. Ser.: Mater. Sci. Eng. 226(1) (2017). https://doi.org/10.1088/1757-899X/226/1/012099
G. Angelini, L. De Angelis, PARX model for football match predictions. J. Forecast. 36(7), 795–807 (2017). https://doi.org/10.1002/for.2471
C. Herbinet, Predicting football results using machine learning techniques, in 2011 Proceedings of the 34th International Convention MIPRO, vol. 48, pp. 1623–1627 (2018)
T. Blobel, M. Lames, A concept for club information systems (CIS) an example for applied sports informatics. Int. J. Comput. Sci. Sport 19(1), 102–122 (2020). https://doi.org/10.2478/ijcss-2020-0006
G. Baio, M. Blangiardo, Bayesian hierarchical model for the prediction of football results. J. Appl. Stat. 37(2), 253–264 (2010). https://doi.org/10.1080/02664760802684177
J. Hucaljuk, A. Rakipović, Predicting football scores using machine learning techniques, in MIPRO 2011—34th International Convention on Information and Communication Technology, Electronics and Microelectronics—Proceedings, vol. 48, pp. 1623–1627 (2011)
N. Tax, Y. Joustra, Predicting the Dutch football competition using public data: a machine learning approach. Trans. Knowl. Data Eng. 10(10), 1–13 (2015)
R. Baboota, H. Kaur, Predictive analysis and modelling football results using machine learning approach for English Premier League. Int. J. Forecast. 35(2), 741–755 (2019). https://doi.org/10.1016/j.ijforecast.2018.01.003
D. Prasetio, Harlili: predicting football match results with logistic regression, in 4th IGNITE Conference and 2016 International Conference on Advanced Informatics: concepts, Theory and Application, ICAICTA 2016, pp. 2–6 (2016). https://doi.org/10.1109/ICAICTA.2016.7803111
R. Gasparyan, A Novel Way to Soccer Match Prediction. Stanford University, Department of Computer Science (2014)
M.A. Rahman, A deep learning framework for football match prediction. SN Appl. Sci. 2(2) (2020). https://doi.org/10.1007/s42452-019-1821-5
N. Danisik, P. Lacko, M. Farkas, Football match prediction using players attributes, in DISA 2018—IEEE World Symposium on Digital Intelligence for Systems and Machines, Proceedings, pp. 201–206 (2018). https://doi.org/10.1109/DISA.2018.8490613
A.D. Blaikie, G.J. Abud, J.A. David, R.D. Pasteur, NFL & NCAA football prediction using artificial neural networks, in Proceedings of the Midstates Conference for Undergraduate Research in Computer Science and Mathematics. Denison University, Granville, OH (2011). http://ohio5.openrepository.com/ohio5/handle/2374.DEN/3930
N. Zaveri, U. Shah, S. Tiwari, P. Shinde, L.K. Teli, Prediction of football match score and decision making process. Int. J. Recent Innov. Trends Comput. Commun. 6(2), 162–165 (2018)
A.E. Tümer, S. Koçer, Prediction of team league’s rankings in volleyball by artificial neural network method. Int. J. Perform. Anal. Sport 17(3), 202–211 (2017). https://doi.org/10.1080/24748668.2017.1331570
T. Chen, C. Guestrin, XGBoost: a scalable tree boosting system, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016) (2016)
J. Brownlee, A gentle introduction to xgboost for applied machine learning. Mach. Learn. Mastery (2016). http://machinelearningmastery.com/gentle-introduction-xgboost-appliedmachine-learning/. Accessed 2 Mar. 2018
W. Gourh, K. Poojary, M. Vengarai, N. Parkar, Football prediction using XGBoost algorithm: a literature review. J. Phys. Sci. Eng. Technol. 12(1), 109–112 (2020)
Acknowledgements
This research is supported by the Ministry of Higher Education (MOHE) through Fundamental Research Grant Scheme (FRGS/1/2020/ICT02/UTHM/02/4).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zulkifli, S., Mustapha, A.B., Ismail, S., Razali, N. (2022). Comparative Analysis of Statistical and Machine Learning Methods for Classification of Match Outcomes in Association Football. In: Mustapha, A.B., Shamsuddin, S., Zuhaib Haider Rizvi, S., Asman, S.B., Jamaian, S.S. (eds) Proceedings of the 7th International Conference on the Applications of Science and Mathematics 2021. Springer Proceedings in Physics, vol 273. Springer, Singapore. https://doi.org/10.1007/978-981-16-8903-1_31
Download citation
DOI: https://doi.org/10.1007/978-981-16-8903-1_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8902-4
Online ISBN: 978-981-16-8903-1
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)