Abstract
The precise prediction of construction costs during the initial phase of a construction project is crucial for ensuring the project’s success. Identifying the parameters that influence project cost contributes to achieving accurate results and improves the overall accuracy of cost estimation. This study applied three machine learning methods such as artificial neural network (ANN), natural gradient boosting (NGBoost), and linear regression (LR) models, to predict the total cost of construction. The NGBoost model was employed for construction cost estimation and was compared with two machine learning algorithms: artificial neural network and linear regression. Evaluation metrics, including Mean Absolute Error (MAE), Coefficient of efficiency (CE), Mean Absolute Percentage Error (MAPE), index of agreement (d), and coefficient of determination (R2), are employed to assess and compare the accuracy of the developed algorithms. Statistical indicators revealed that the NGBoost algorithm outperformed others, displaying the highest coefficient of determination (R2 = 0.992 for training and R2 = 0.985 for testing) and the lowest root mean square error (RMSE = 0.5136 for training and RMSE = 0.3702). Moreover, sensitivity analysis revealed that the input parameter with the highest contribution was formwork, accounting for nearly 41%. On the other hand, the superimposed load had the lowest contribution, totaling 5%. The Shapley Additive Explanation (SHAP) method was employed to elucidate the importance and contribution of input variables influencing construction costs. The findings of this study offer valuable insights for project stakeholders, enabling them to minimize errors in estimated costs and make informed decisions early in the construction process.
Similar content being viewed by others
Data availability
Data will be made available on request.
References
Abdulalim Alabdullah, A., Iqbal, M., Zahid, M., Khan, K., Nasir Amin, M., & Jalal, F. E. (2022). Prediction of rapid chloride penetration resistance of metakaolin based high strength concrete using light GBM and XGBoost models by incorporating SHAP analysis. Construction and Building Materials. https://doi.org/10.1016/j.conbuildmat.2022.128296
Ali, Z. H., & Burhan, A. M. (2023). Hybrid machine learning approach for construction cost estimation: An evaluation of extreme gradient boosting model. Asian Journal of Civil Engineering, 24(7), 2427–2442. https://doi.org/10.1007/s42107-023-00651-z
Amin, M. N., Ahmad, W., Khan, K., Nazar, S., Arab, A. M. A., & Deifalla, A. F. (2023). Evaluating the relevance of eggshell and glass powder for cement-based materials using machine learning and SHapley Additive exPlanations (SHAP) analysis. Case Studies in Construction Materials. https://doi.org/10.1016/j.cscm.2023.e02278
Beycioʇlu, A., Emiroʇlu, M., Kocak, Y., & Subaşi, S. (2015). Analyzing the compressive strength of clinker mortars using approximate reasoning approaches—ANN vs MLR. Computers and Concrete, 15(1), 89–101. https://doi.org/10.12989/cac.2015.15.1.089
Bilal, M., & Oyedele, L. O. (2020). Guidelines for applied machine learning in construction industry—A case of profit margins estimation. Advanced Engineering Informatics. https://doi.org/10.1016/j.aei.2019.101013
Cakiroglu, C., Demir, S., Hakan Ozdemir, M., Latif Aylak, B., Sariisik, G., & Abualigah, L. (2024). Data-driven interpretable ensemble learning methods for the prediction of wind turbine power incorporating SHAP analysis. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2023.121464
Chakraborty, D., Elhegazy, H., Elzarka, H., & Gutierrez, L. (2020). A novel construction cost prediction model using hybrid natural and light gradient boosting. Advanced Engineering Informatics. https://doi.org/10.1016/j.aei.2020.101201
Dao, D. V., Ly, H. B., Vu, H. L. T., Le, T. T., & Pham, B. T. (2020). Investigation and optimization of the C-ANN structure in predicting the compressive strength of foamed concrete. Materials. https://doi.org/10.3390/ma13051072
Das, P., & Kashem, A. (2024). Hybrid machine learning approach to prediction of the compressive and flexural strengths of UHPC and parametric analysis with shapley additive explanations. Case Studies in Construction Materials, 20, e02723. https://doi.org/10.1016/j.cscm.2023.e02723
Ekanayake, I. U., Meddage, D. P. P., & Rathnayake, U. (2022). A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Studies in Construction Materials. https://doi.org/10.1016/j.cscm.2022.e01059
Golafshani, E. M., Behnood, A., & Arashpour, M. (2020). Predicting the compressive strength of normal and High-Performance Concretes using ANN and ANFIS hybridized with Grey Wolf Optimizer. Construction and Building Materials. https://doi.org/10.1016/j.conbuildmat.2019.117266
Haque, M. A., Chen, B., Kashem, A., Qureshi, T., & Ahmed, A. A. M. (2023). Hybrid intelligence models for compressive strength prediction of MPC composites and parametric analysis with SHAP algorithm. Materials Today Communications. https://doi.org/10.1016/j.mtcomm.2023.105547
Hashemi, S. T., Ebadati, O. M., & Kaur, H. (2019). A hybrid conceptual cost estimating model using ANN and GA for power plant projects. Neural Computing and Applications, 31(7), 2143–2154. https://doi.org/10.1007/s00521-017-3175-5
Islam, N., Kashem, A., Das, P., Ali, M. N., & Paul, S. (2023). Prediction of high-performance concrete compressive strength using deep learning techniques. Asian Journal of Civil Engineering. https://doi.org/10.1007/s42107-023-00778-z
Jalal, F. E., Xu, Y., Iqbal, M., Javed, M. F., & Jamhiri, B. (2021). Predictive modeling of swell-strength of expansive soils using artificial intelligence approaches: ANN, ANFIS and GEP. Journal of Environmental Management. https://doi.org/10.1016/j.jenvman.2021.112420
Jiang, Y., Li, H., & Zhou, Y. (2022). Compressive strength prediction of fly ash concrete using machine learning techniques. Buildings. https://doi.org/10.3390/buildings12050690
Jin, R., Cho, K., Hyun, C., & Son, M. (2012). MRA-based revised CBR model for cost prediction in the early stage of construction projects. Expert Systems with Applications, 39(5), 5214–5222. https://doi.org/10.1016/j.eswa.2011.11.018
Kashem, A., & Das, P. (2023). Compressive strength prediction of high-strength concrete using hybrid machine learning approaches by incorporating SHAP analysis. Asian Journal of Civil Engineering. https://doi.org/10.1007/s42107-023-00707-0
Kaveh, A., Dadras Eslamlou, A., Javadi, S. M., & Geran Malek, N. (2021). Machine learning regression approaches for predicting the ultimate buckling load of variable-stiffness composite cylinders. Acta Mechanica, 232(3), 921–931. https://doi.org/10.1007/s00707-020-02878-2
Kaveh, A., Eskandari, A., & Movasat, M. (2023). Buckling resistance prediction of high-strength steel columns using Metaheuristic-trained Artificial Neural Networks. Structures. https://doi.org/10.1016/j.istruc.2023.07.043
Kaveh, A., & Khavaninzadeh, N. (2023). Efficient training of two ANNs using four meta-heuristic algorithms for predicting the FRP strength. Structures, 52, 256–272. https://doi.org/10.1016/j.istruc.2023.03.178
Keshavarz, Z., & Torkian, H. (2018). Application of ANN and ANFIS models in determining compressive strength of concrete. Journal of Soft Computing in Civil Engineering, 2(1), 62–70. https://doi.org/10.22115/SCCE.2018.51114
Liu, G., & Sun, B. (2023). Concrete compressive strength prediction using an explainable boosting machine model. Case Studies in Construction Materials. https://doi.org/10.1016/j.cscm.2023.e01845
Lu, Y., Luo, X. Y., & Zhang, H. (2011). A gene expression programming algorithm for highway construction cost prediction problems. Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/journal of Transportation Systems Engineering and Information Technology, 11(6), 85–92. https://doi.org/10.1016/s1570-6672(10)60150-x
Mohammed, S. J., Zubaidi, S. L., Al-Ansari, N., Mohammed Ridha, H., Dulaimi, A., & Al-Khafaji, R. (2023). Application of metaheuristic algorithms and ANN model for univariate water level forecasting. Advances in Civil Engineering, 20, 23. https://doi.org/10.1155/2023/9947603
Paul, S., Das, P., Kashem, A., & Islam, N. (2023). Sustainable of rice husk ash concrete compressive strength prediction utilizing artificial intelligence techniques. Asian Journal of Civil Engineering. https://doi.org/10.1007/s42107-023-00847-3
Rofooei, F. R., Kaveh, A., & Farahani, F. M. (2011). Estimating the vulnerability of the concrete moment resisting frame structures using artificial neural networks. International Journal of Optimization in Civil Engineering, 3, 433–448.
Siwiński, J., Szcześniak, A., & Stolarski, A. (2020). Modified formula for designing ultra-high-performance concrete with experimental verification. Materials, 13(20), 1–14. https://doi.org/10.3390/ma13204518
Tayefeh Hashemi, S., Ebadati, O. M., & Kaur, H. (2020). Cost estimation and prediction in construction projects: a systematic review on machine learning techniques. SN Applied Sciences. https://doi.org/10.1007/s42452-020-03497-1
Wang, J., & Ashuri, B. (2017). Predicting ENR Construction Cost Index Using Machine-Learning Algorithms. International Journal of Construction Education and Research, 13(1), 47–63. https://doi.org/10.1080/15578771.2016.1235063
Xi, B., Li, E., Fissha, Y., Zhou, J., & Segarra, P. (2023). LGBM-based modeling scenarios to compressive strength of recycled aggregate concrete with SHAP analysis. Mechanics of Advanced Materials and Structures. https://doi.org/10.1080/15376494.2023.2224782
Zhao, L., Zhang, W., & Wang, W. (2020). Construction cost prediction based on genetic algorithm and BIM. International Journal of Pattern Recognition and Artificial Intelligence. https://doi.org/10.1142/S0218001420590260
Zheng, X., Xie, Y., Yang, X., Amin, M. N., Nazar, S., Khan, S. A., Althoey, F., & Deifalla, A. F. (2023). A data-driven approach to predict the compressive strength of alkali-activated materials and correlation of influencing parameters using SHapley Additive exPlanations (SHAP) analysis. Journal of Materials Research and Technology, 25, 4074–4093. https://doi.org/10.1016/j.jmrt.2023.06.207
Zhou, C., Ding, L. Y., Skibniewski, M. J., Luo, H., & Zhang, H. T. (2018). Data based complex network modeling and analysis of shield tunneling performance in metro construction. Advanced Engineering Informatics, 38, 168–186. https://doi.org/10.1016/j.aei.2018.06.011
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Contributions
PD: Supervision, Conceptualization, Writing—original draft, Data curation, Data analysis, Methodology, Software, Formal analysis, Writing—review & editing. AK: Conceptualization, Writing—original draft, Data curation, Data Analysis, formal analysis, Software, Methodology. Writing—review & editing. IH: Conceptualization, Writing—original draft, Data curation. Writing– review & editing. MI: Data curation, Writing—review & editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Das, P., Kashem, A., Hasan, I. et al. A comparative study of machine learning models for construction costs prediction with natural gradient boosting algorithm and SHAP analysis. Asian J Civ Eng 25, 3301–3316 (2024). https://doi.org/10.1007/s42107-023-00980-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42107-023-00980-z