Abstract
The significance of adopting cloud technology in enterprises is accelerating and becoming ubiquitous in business and industry. Due to migrating the on-premises servers and services into cloud, companies can leverage several advantages such as cost optimization, high performance, and flexible system maintenance, to name a few. As the data volume, variety, veracity, and velocity are rising tremendously, adopting machine learning (ML) solutions in the cloud platform bring benefits from ML model building through model evaluation more efficiently and accurately. This study will provide a comparative performance analysis of the three big cloud vendors: Amazon Web Service (AWS), Microsoft Azure and Google Cloud Platform (GCP) by building regression models in each of the platforms. For validation purposes, i.e., training and testing the models, five different standard datasets from the UCI machine learning repository have been employed. This work utilizes the ML services of AWS Sage maker, Azure ML Studio and Google Big Query for conducting the experiments. Model evaluation criteria here include measuring R-squared values for each platform, calculating the error metrics (Mean Squared Error, Mean Absolute Error, Root Mean Squared Error etc.) and comparing the results to determine the best performing cloud provider in terms of ML service. The study concludes with presenting a comparative taxonomy of regression models across the three platforms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Townsend, M., Wimmer, H., Du, J.: Barriers and drivers to adoption of cloud infrastructure services: a security perspective. In: 2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), pp. 1–7. IEEE (2020)
Parida, B.R., Rath, A.K., Mohapatra, H.: Binary self-adaptive salp swarm optimization-based dynamic load balancing in cloud computing. Int. J. Inf. Technol. Web Eng. (IJITWE) 17, 1–25 (2022)
Khan, T., Tian, W., Zhou, G., Ilager, S., Gong, M., Buyya, R.: Machine learning (ML)–centric resource management in cloud computing: a review and future directions. J. Netw. Comput. Appl. 103405 (2022)
Grigoriou, N.N., Fink, A.: Cloud computing: Key to enabling smart production and industry 4.0. In: Madsen, O., Berger, U., Møller, C., Heidemann Lassen, A., Vejrum Waehrens, B., Schou, C. (eds.) The Future of Smart Production for SMEs, pp. 315–322. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-15428-7_26
Abdulqader, D.M., Abdulazeez, A.M., Zeebaree, D.Q.: Machine learning supervised algorithms of gene selection: a review. Mach. Learn. 62, 233–244 (2020)
Meyer, H., Kühnlein, M., Appelhans, T., Nauss, T.: Comparison of four machine learning algorithms for their applicability in satellite-based optical rainfall retrievals. Atmos. Res. 169, 424–433 (2016)
Osisanwo, F., Akinsola, J., Awodele, O., Hinmikaiye, J., Olakanmi, O., Akinjobi, J.: Supervised machine learning algorithms: classification and comparison. Int. J. Comput. Trends Technol. (IJCTT) 48, 128–138 (2017)
Maulud, D., Abdulazeez, A.M.: A review on linear regression comprehensive in machine learning. J. Appl. Sci. Technol. Trends 1, 140–147 (2020)
Wang, H., Lei, Z., Zhang, X., Zhou, B., Peng, J.: Machine learning basics. Deep Learn. 98–164 (2016)
Kolisetty, V.V., Rajput, D.S.: A review on the significance of machine learning for data analysis in big data. Jordan. J. Comput. Inf. Technol. (JJCIT) 6, 155–171 (2020)
Asim, Y., Shahid, A.R., Malik, A.K., Raza, B.: Significance of machine learning algorithms in professional blogger’s classification. Comput. Electr. Eng. 65, 461–473 (2018)
Love, B.C.: Comparing supervised and unsupervised category learning. Psychon. Bull. Rev. 9, 829–835 (2002)
Ratra, R., Gulia, P.: Experimental evaluation of open source data mining tools (WEKA and Orange). Int. J. Eng. Trends Technol. 68, 30–35 (2020)
Kodati, S.: Analysis of heart disease using in data mining tools Orange and Weka. Global J. Comp. Sci. Technol. 18, 17–21 (2018)
Kavitha, S., Varuna, S., Ramya, R.: A comparative analysis on linear regression and support vector regression. In: 2016 Online International Conference on Green Engineering and Technologies (IC-GET), pp. 1–5. IEEE (2016)
Rajagopal, S., Hareesha, K.S., Kundapur, P.P.: Performance analysis of binary and multiclass models using azure machine learning. Int. J. Electr. Comput. Eng. (2088–8708) 10 (2020)
Kaushik, P., Rao, A.M., Singh, D.P., Vashisht, S., Gupta, S.: Cloud computing and comparison based on service and performance between Amazon AWS, Microsoft Azure, and Google Cloud. In: 2021 International Conference on Technological Advancements and Innovations (ICTAI), pp. 268–273. IEEE (2021)
Alkhatib, A., Al Sabbagh, A., Maraqa, R.: Public cloud computing: big three vendors. In: 2021 International Conference on Information Technology (ICIT), pp. 230–237. IEEE (2021)
Sikeridis, D., Papapanagiotou, I., Rimal, B.P., Devetsikiotis, M.: A comparative taxonomy and survey of public cloud infrastructure vendors. arXiv preprint arXiv:1710.01476 (2017)
Kaggle Dataset. https://www.kaggle.com/datasets/teertha/ushealthinsurancedataset. Accessed 23 Dec 2022
Kaggle Dataset. https://www.kaggle.com/datasets/arshid/iris-flower-dataset. Accessed 24 Dec 2022
Kaggle Dataset. https://www.kaggle.com/datasets/quantbruce/real-estate-price-prediction. Accessed 24 Dec 2022
Kaggle Dataset. https://www.kaggle.com/datasets/uciml/red-wine-quality-cortez-et-al-2009. Accessed 26 Dec 2022
Kaggle Dataset. https://www.kaggle.com/datasets/piyushagni5/white-wine-quality. Accessed 26 Dec 2022
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Jamal, S., Wimmer, H. (2023). Performance Analysis of Machine Learning Algorithm on Cloud Platforms: AWS vs Azure vs GCP. In: Gibadullin, A. (eds) Information Technologies and Intelligent Decision Making Systems. ITIDMS 2022. Communications in Computer and Information Science, vol 1821. Springer, Cham. https://doi.org/10.1007/978-3-031-31353-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-31353-0_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31352-3
Online ISBN: 978-3-031-31353-0
eBook Packages: Computer ScienceComputer Science (R0)