Skip to main content

Towards the Development of a Budget Categorisation Machine Learning Tool: A Review

  • Conference paper
  • First Online:
Trends on Construction in the Digital Era (ISIC 2022)


Engineering, procurement, and construction (EPC) contracts include time, budget, quality, and safety, among other issues. In budgeting, construction companies must assess each task's scope and map the client's expectations (expressed in the bill of quantities) to an internal database of tasks, resources, and costs. The results from this classification will determine the quality of the tenders issued by the company and are thus contractually binding. Construction companies must achieve their contractual targets in order to make a profit.

In this paper, we review the literature and explore the latest advancements regarding the automatisation of these processes to find the methods that yield the best results in the classification of bills of quantities and works in the construction industry.

Although full automation is not within our reach in the short term, especially due to the lack of standard construction specifications, machine learning can provide useful support tools. This communication is part of the authors’ study aiming to develop a framework and tool to automate the process of task classification in a construction contract.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others


  1. Elhegazy, H., et al.: Artificial Intelligence for developing accurate preliminary cost estimates for composite flooring systems of multi-storey buildings. J. Asian Archite. Build. Eng. (2021).

  2. Pessoa, A., Sousa, G., Maues, L.M.F., Alvarenga, F.C., Santos, D.D.: Cost forecasting of public construction projects using multilayer perceptron artificial neural networks: a case study. Ingenieria E Investigacion 41(3) (2021 Dec). Art no. e87737,

  3. Jafari, P., Al Hattab, M., Mohamed, E., Abourizk, S.: Automated extraction and time-cost prediction of contractual reporting requirements in construction using natural language processing and simulation. Applied Sciences (Switzerland), Article 11(13) (2021). Art no. 6188,

  4. Sharma, S., Ahmed, S., Naseem, M., Alnumay, W.S., Singh, S., Cho, G.H.: A survey on applications of artificial intelligence for pre-parametric project cost and soil shear-strength estimation in construction and geotechnical engineering. Sensors 21(2), 463 (2021).

    Article  Google Scholar 

  5. Juszczyk, M., Leśniak, A., Zima, K.: ANN based approach for estimation of construction costs of sports fields. Complexity 2018, 1–11 (2018).

    Article  Google Scholar 

  6. Jeon, J.H., Xu, X., Zhang, Y.X., Yang, L., Cai, H.B.: Extraction of construction quality requirements from textual specifications via natural language processing. Transportation Research Record 2675(9), 222–237 (Sep 2021). Art no. 03611981211001385,

  7. Ul Hassan, F., Le, T., Tran, D.H.: Multi-class categorisation of design-build contract requirements using text mining and natural language processing techniques. In: 2020: American Society of Civil Engineers (ASCE), pp. 1266–1274. [Online]. Available: [Online]. Available:

  8. Baker, H., Smith, S., Masterton, G., Hewlett, B.: Data-led learning: using natural language processing (NLP) and machine learning to learn from construction site safety failures. In: 2020: Association of Researchers in Construction Management, pp. 356–365. [Online]. Available: [Online]. Available:

  9. Akanbi, T., Zhang, J.S.: Design information extraction from construction specifications to support cost estimation. Autom. Constr. 131 (Nov 2021). Art no. 103835,

  10. Li, R.Y.M., Li, H.C.Y., Tang, B., Au, W.C.: Fast AI classification for analysing construction accidents claims. ICST, pp. 1–4 (2020). [Online]. Available:

  11. Dimitriou, L., Marinelli, M., Fragkakis, N.: Early bill-of-quantities estimation of concrete road bridges: an artificial intelligence-based application. Public Works Manag. Policy 23(2), 127–149 (2018). Apr

    Article  Google Scholar 

  12. Moon, S., Lee, G., Chi, S., Oh, H.: Automated construction specification review with named entity recognition using natural language processing. J. Constr. Eng. Manage. 147(1), 04020147 (2021).

    Article  Google Scholar 

  13. Cao, Y., Ashuri, B.: Predicting the volatility of highway construction cost index using long short-term memory. J. Manage. Eng. 36(4), 04020020 (2020).

    Article  Google Scholar 

  14. Xue, X., Jia, Y., Tang, Y.: Expressway project cost estimation with a convolutional neural network model. IEEE Access 8, 217848–217866 (2020).

    Article  Google Scholar 

  15. Alaka, H., Oyedele, L., Owolabi, H., Akinade, O., Bilal, M., Ajayi, S.: A big data analytics approach for construction firms failure prediction models. IEEE Trans. Eng. Manage. 66(4), 689–698 (2019).

    Article  Google Scholar 

  16. Tajziyehchi, N., Moshirpour, M., Jergeas, G., Sadeghpour, F.: A predictive model of cost growth in construction projects using feature selection. In: 2020 IEEE Third International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), 9–13 Dec. 2020, pp. 142–147 (2020).

  17. Bloch, T., Sacks, R.: Clustering information types for semantic enrichment of building information models to support automated code compliance checking. J. Comput. Civil. Eng. Article 34(6) (2020). Art no. 04020040,

  18. Jallan, Y., Brogan, E., Ashuri, B., Clevenger, C.M.: Application of natural language processing and text mining to identify patterns in construction-defect litigation cases. J. Leg. Aff. Disput. Resolut. Eng. Constr. 11(4), 04519024 (2019).

    Article  Google Scholar 

  19. Hong, Y., Xie, H.Y., Bhumbra, G., Brilakis, I.: Comparing natural language processing methods to cluster construction schedules. J. Constr. Eng. Manage. 147(10) (Oct 2021). Art no. 04021136,

  20. Suneja, N., Shah, J.P., Shah, Z.H., Holia, M.S.: A neural network approach to design reality oriented cost estimate model for infrastructure projects. Reliability: Theory and Applications Article 16, 254–263 (2021). [Online]. Available:

  21. Moon, S., Lee, G., Chi, S.: Semantic text-pairing for relevant provision identification in construction specification reviews. Autom. Constr. Article 128 (2021). Art no. 103780,

  22. Gaussmann, R., Coelho, D., Fernandes, A.M.R., Crocker, P., Leithardt, V.R.Q.: Using machine learning for road maintenance cost estimates in brazil: a case study in the federal district. In: 2020 15th Iberian Conference on Information Systems and Technologies (CISTI), 24–27 June 2020, pp. 1–7 (2020).

  23. Juszczyk, M.: Implementation of the ANNs ensembles in macro-BIM cost estimates of buildings’ floor structural frames, p. 020014 (2018). [Online]. Available:

  24. Zhang, J., et al.: A RMM based word segmentation method for chinese design specifications of building stairs. In: 14th International Conference on Computational Intelligence and Security (CIS), Hangzhou, PEOPLES R CHINA, Nov 16–19 2018, pp. 277–280 (2018). [Online]. Available: <Go to ISI>://WOS:000456370300060

  25. Cho, K., Kim, J., Kim, T.: Decision support method for estimating monetary value of post-renovation office buildings. Can. J. Civ. Eng. 46(12), 1103–1113 (2019). Dec

    Article  Google Scholar 

  26. Juszczyk, M., Zima, K., Lelek, W.: Forecasting of sports fields construction costs aided by ensembles of neural networks. J. Civ. Eng. Manag. Article 25(7), 715–729 (2019).

  27. Ronghui, S., Liangrong, N.: An intelligent fuzzy-based hybrid metaheuristic algorithm for analysis the strength, energy and cost optimisation of building material in construction management. Engineering with Computers, Article (2021).

  28. Wang, J., Gao, X.A., Zhou, X.P., Xie, Q.S.: Multi-scale information retrieval for bim using hierarchical structure modelling and natural language processing. J. Info. Technol. Constr. 26, 409–426 (2021).

    Article  Google Scholar 

  29. Elmousalami, H.H.: Data on field canals improvement projects for cost prediction using artificial intelligence. Data Brief 31, 105688 (2020).

    Article  Google Scholar 

  30. Yaqubi, M.K., Salhotra, S.: The automated cost estimation in construction. Int. J. Innov. Technol. Explor. Eng. Article 8(7), 845–849 (2019). [Online]. Available:

  31. Jeon, K., Lee, G., Jeong, H.D.: Classification of the Requirement Sentences of the US DOT Standard Specification Using Deep Learning Algorithms. In: Toledo Santos, E., Scheer, S. (eds.) ICCCBE 2020. LNCE, vol. 98, pp. 89–97. Springer, Cham (2021).

    Chapter  Google Scholar 

Download references


This work was financially supported by: Core Funding-UIDB/04708/2020 of CONSTRUCT-Institute of R&D in Structures and Constructions-funded by national funds through FCT/MCTES (PIDDAC). This work is also co-funded by the European Social Fund (ESF), through the Northern Regional Operational Programme (Norte 2020) [Funding Reference: NORTE-06–3559-FSE-000176].”

Author information

Authors and Affiliations


Corresponding author

Correspondence to Luís Jacques de Sousa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jacques de Sousa, L., Poças Martins, J., Santos Baptista, J., Sanhudo, L. (2023). Towards the Development of a Budget Categorisation Machine Learning Tool: A Review. In: Gomes Correia, A., Azenha, M., Cruz, P.J.S., Novais, P., Pereira, P. (eds) Trends on Construction in the Digital Era. ISIC 2022. Lecture Notes in Civil Engineering, vol 306. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-20240-7

  • Online ISBN: 978-3-031-20241-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics