Skip to main content
Log in

A comparative study of machine learning models for construction costs prediction with natural gradient boosting algorithm and SHAP analysis

  • Research
  • Published:
Asian Journal of Civil Engineering Aims and scope Submit manuscript

Abstract

The precise prediction of construction costs during the initial phase of a construction project is crucial for ensuring the project’s success. Identifying the parameters that influence project cost contributes to achieving accurate results and improves the overall accuracy of cost estimation. This study applied three machine learning methods such as artificial neural network (ANN), natural gradient boosting (NGBoost), and linear regression (LR) models, to predict the total cost of construction. The NGBoost model was employed for construction cost estimation and was compared with two machine learning algorithms: artificial neural network and linear regression. Evaluation metrics, including Mean Absolute Error (MAE), Coefficient of efficiency (CE), Mean Absolute Percentage Error (MAPE), index of agreement (d), and coefficient of determination (R2), are employed to assess and compare the accuracy of the developed algorithms. Statistical indicators revealed that the NGBoost algorithm outperformed others, displaying the highest coefficient of determination (R2 = 0.992 for training and R2 = 0.985 for testing) and the lowest root mean square error (RMSE = 0.5136 for training and RMSE = 0.3702). Moreover, sensitivity analysis revealed that the input parameter with the highest contribution was formwork, accounting for nearly 41%. On the other hand, the superimposed load had the lowest contribution, totaling 5%. The Shapley Additive Explanation (SHAP) method was employed to elucidate the importance and contribution of input variables influencing construction costs. The findings of this study offer valuable insights for project stakeholders, enabling them to minimize errors in estimated costs and make informed decisions early in the construction process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig.7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig.12

Similar content being viewed by others

Data availability

Data will be made available on request.

References

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Contributions

PD: Supervision, Conceptualization, Writing—original draft, Data curation, Data analysis, Methodology, Software, Formal analysis, Writing—review & editing. AK: Conceptualization, Writing—original draft, Data curation, Data Analysis, formal analysis, Software, Methodology. Writing—review & editing. IH: Conceptualization, Writing—original draft, Data curation. Writing– review & editing. MI: Data curation, Writing—review & editing.

Corresponding author

Correspondence to Pobithra Das.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Das, P., Kashem, A., Hasan, I. et al. A comparative study of machine learning models for construction costs prediction with natural gradient boosting algorithm and SHAP analysis. Asian J Civ Eng 25, 3301–3316 (2024). https://doi.org/10.1007/s42107-023-00980-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42107-023-00980-z

Keywords

Navigation