Requirement Text Detection from Contract Packages to Support Project Definition Determination
Project requirements are wishes and expectations of the client toward the design, construction, and other project management processes. The project definition is typically specified in a contract package including a contract document and many other related documents such as drawings, specifications, and government codes. Project definition determination is critical to the success of a project. Due to the lack of efficient tools for requirement processing, the current practices regarding project scoping still heavily rely on a manual basis which is tedious, time-consuming, and error-prone. This study aims to fill that gap by developing an automated method for identifying requirement texts from contractual documents. The study employed Naïve Bayes to train a classification model that can be used to separate requirement statements from non-requirement statements. An experiment was conducted on a manually labeled dataset of 1191 statements. The results revealed that the developed requirement detection model achieves a promising accuracy of over 90%.
KeywordsProject definition Requirement management Requirement extraction Machine learning Natural language processing Text classification Naïve bayes
- 3.Kamara, J., Anumba, C., and Evbuomwan, N.: Requirements processing: a first step towards client satisfaction. Proceedings of CIB W55 & W65 Joint Triennial Symposium—Customer Satisfaction: A focus for research & practice, Cape Town, 5–10. (1999)Google Scholar
- 10.Webster, J.J., Kit, C.: Tokenization as the initial phase in NLP. Paper presented at the Proceedings of the 14th conference on Computational linguistics-Volume 4. (1992)Google Scholar
- 11.Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an architecture for development of robust HLT applications. Paper presented at the proceedings of the 40th annual meeting on association for computational linguistics (2002)Google Scholar
- 12.Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. Paper presented at the Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1. (2003)Google Scholar
- 20.Halpin, D.W., Woodhead, R.W.: Construction Management. Wiley, New York (1998)Google Scholar