The application of AI techniques in requirements classification: a systematic mapping

Kaur, Kamaljit; Kaur, Parminder

doi:10.1007/s10462-023-10667-1

The application of AI techniques in requirements classification: a systematic mapping

Open access
Published: 15 February 2024

Volume 57, article number 57, (2024)
Cite this article

Download PDF

You have full access to this open access article

Artificial Intelligence Review Aims and scope Submit manuscript

The application of AI techniques in requirements classification: a systematic mapping

Download PDF

Kamaljit Kaur¹ &
Parminder Kaur¹

1798 Accesses
1 Altmetric
Explore all metrics

Abstract

Requirement Analysis is the essential sub-field of requirements engineering (RE). From the last decade, numerous automatic techniques are widely exploited in requirements analysis. In this context, requirements identification and classification is challenging for RE community, especially in context of large corpus and app review. As a consequence, several Artificial Intelligence (AI) techniques such as Machine learning (ML), Deep learning (DL) and transfer learning (TL)) have been proposed to reduce the manual efforts of requirement engineer. Although, these approaches reported promising results than traditional automated techniques, but the knowledge of their applicability in real-life and actual use of these approaches is yet incomplete. The main objective of this paper is to systematically investigate and better understand the role of Artificial Intelligence (AI) techniques in identification and classification of software requirements. This study conducted a systematic literature review (SLR) and collect the primary studies on the use of AI techniques in requirements classification. (1) this study found that 60 studies are published that adopted automated techniques in requirements classification. The reported results indicate that transfer learning based approaches extensively used in classification and yielding most accurate results and outperforms the other ML and DL techniques. (2) The data extraction process of SLR indicates that Support Vector Machine (SVM) and Convolutional Neural Network (CNN) are widely used in selected studies. (3) Precision and Recall are the commonly used metrics for evaluating the performance of automated techniques. This paper revealed that while these AI approaches reported promising results in classification. The applicability of these existing techniques in complex and real-world settings has not been reported yet. This SLR calls for the urge for the close alliance between RE and AI techniques to handle the open issues confronted in the development of some real-world automated system.

Smart literature review: a practical topic modelling approach to exploratory literature review

Article Open access 19 October 2019

Software defect prediction: future directions and challenges

Article 27 February 2024

Deep learning applications and challenges in big data analytics

Article Open access 24 February 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Requirements Engineering (RE) is concerned with elicitation, analysis, specification, validation and management of requirements (Kotonya and Sommerville 1998). It conceived as sub-discipline of software engineering that contributes to understand customer’s need to improve the quality of software, and decreasing the risk of software failure (Selby 2007). Requirements elicitation phase is concerned with collection of organizational needs, constraints, and facilities required by the system stakeholder. Traditional requirements elicitation takes place through the use of questionnaire, templates, checklists guidelines and inquiring quality issues of stakeholders. The requirements are mainly expressed in natural language and intertwined with each other. So, it is necessary for requirements analysts to identify, understand, prioritize and classify software requirements. Requirements Software requirements analysis considered as critical phase that analyzed the requirements perceived by users or stakeholders. In this phase, high-level requirements statements are analyzed that are collected by using elicitation methods. Besides, requirements analysts establish complete and consistent Software Requirement Specification (SRS) document. In addition, the collected requirements are detect, trace and classify into various categories (Dermeval et al. 2016; Wiegers and Beatty 2013). Software requirements are classified into two types: Functional requirements (FRs), which describe the services, functions or behavior provided by the software, and Non-Functional requirements (NFRs), are the quality attributes (such as quality, usability, security and privacy etc.) or constraint in software development.

Requirements classification is considered as advantageous before architecture and design phase. Because architects need preferential classification of software requirements, so that they can easily implement different types of requirements into different architectural component (Nuseibeh 2001). Early identification of software requirements helps in selection of software, hardware and its configuration. Moreover, later phases of software development depends upon the RE phase in order to implement their tasks. Therefore, manual extraction and classification of requirements is usually time consuming and labor-intensive task (Cleland-Huang et al. 2007). To overcome aforementioned issues, automatic classification methods are exploited by researchers from the last decade. In context of Software development, there is a growing interest in AI techniques to minimize or resolve different types of problems. There are some studies investigate the role of machine learning (ML) in software engineering, specially the role of ML in requirements engineering. However, these studies did not capture all the aspects and evidences that we are interested in. To the best of authors’ knowledge, there is no existing systematic investigation of literature covering AI in RE. Hence, the objective of this work is to conduct a systematic review of literature to find out what AI techniques have been used to extract software requirements from project documents and app stores. It is also important to investigate whether there are real evidences of improvement using AI in extraction of software requirements from documents app reviews. The purpose of this systematic literature is to better understand how AI supports Requirements classification. This paper presents the results from 2012 to 2022 and was conducted following a predefined review protocol (Sect. 3).

The paper is organized is as follows: Sect. 2 discusses the related work. Section 3 presents the research methods. Section 4 includes the research questions. Section 5 performs the results analysis. Active datasets utilized in requirement classification are presented in Sect. 6. Key findings, limitations and open challenges are discussed in Sect. 7. Some of the threats to the validity of this SLR are discussed in Sect. 8. Finally, we conclude this review paper in Sect. 9 with future research directions.

2 Related work

Existing research works have evidently proved the automation of software engineering activities. Most studies related to automatic strategies include machine and deep learning methods. Specifically, these automated techniques are also applied in RE (such as in elicitation, analysis, and specification). For example, Meth et al. (2013) has conducts SLR of existing automated tools for requirements elicitation. They have covered the period from January 1992 to March 2012, and identified requirements from domain documents. Besides, identification of requirements is a small part of their review. By contrast, our review focuses on identification of requirements from documents and app reviews using AI techniques. Mohammad et al. (2019) have discussed the various approaches for security requirements engineering (SRE).

In addition, Binkhonain and Zhao (2019) have also reviewed only 24 studies that adopted machine learning techniques for identification and classification of non-functional requirements (NFRs). In their contribution, they mainly focused on NFRs classification and cover the sub-categories of NFRs. They also discussed the role of natural language processing and data mining techniques in requirements classification. Their review includes 24 published studies from 2007 to 2017. In contrast, our review focuses includes 61 studies from 2012 to early 2023. Moreover, their review explicitly addresses NFRs via ML algorithms, whereas our review focuses on functional as well as NFRs via AI techniques.

Besides, Perez-Verdejo et al. (2020) have found only 13 studies from 2010 to 2019 in requirements classification where different machine and deep learning techniques are used. However, identifying software requirements from app reviews is a small part of that review. By contrast, our review includes 19 studies related to extraction of software requirements from app reviews. Moreover, the authors have restricted to studies employ machine and deep learning, whereas our research addresses the recent studies of transfer learning. Dabrowski et al. (2020) have reviewed the studies related to extraction of NFRs from app reviews using ML approaches. The aim of their SLR is to investigate the role of app reviews in software engineering. Moreover, the authors have included only 10 studies related to identification and classification of software requirements from app reviews. In contrast, our research work includes 19 studies of extraction of requirements from app reviews. The comparison with related work is shown in Table 1.

Table 1 Main key points that differs this study with existing studies

Paper ID	References
S01	Rashwan, A., 2012, May. Semantic analysis of functional and non-functional requirements in software requirements specifications. In Canadian Conference on Artificial Intelligence (pp. 388–391). Springer, Berlin, Heidelberg
S02	Slankas, J. and Williams, L., 2013, May. Automated extraction of non-functional requirements in available documentation. In 2013 1st International workshop on natural language analysis in software engineering (NaturaLiSE) (pp. 9–16). IEEE
S03	Mahalakshmi, K. and Prabhakar, R., 2015. Hybrid Optimization of SVM for Improved Non-Functional Requirements Classification. International Journal of Applied Engineering Research, 10(20)
S04	Guzman, E., El-Haliby, M. and Bruegge, B., 2015, November. Ensemble methods for app review classification: An approach for software evolution (n). In 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE) (pp. 771–776). IEEE
S05	Panichella, S., Di Sorbo, A., Guzman, E., Visaggio, C.A., Canfora, G. and Gall, H.C., 2015, September. How can i improve my app? classifying user reviews for software maintenance and evolution. In 2015 IEEE international conference on software maintenance and evolution (ICSME) (pp. 281–290). IEEE
S06	Gu, X. and Kim, S., 2015, November. " what parts of your apps are loved by users?"(T). In 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE) (pp. 760–770). IEEE
S07	Winkler, J. and Vogelsang, A., 2016, September. Automatic classification of requirements based on convolutional neural networks. In 2016 IEEE 24th International Requirements Engineering Conference Workshops (REW) (pp. 39–45). IEEE
S08	Maalej, W., Kurtanović, Z., Nabil, H. and Stanik, C., 2016. On the automatic classification of app reviews. Requirements Engineering, 21(3), pp.311–331. Springer
S09	Kurtanović, Z. and Maalej, W., 2017, September. Automatically classifying functional and non-functional requirements using supervised machine learning. In 2017 IEEE 25th International Requirements Engineering Conference (RE) (pp. 490–495). Ieee
S10	Lu, M. and Liang, P., 2017, June. Automatic classification of non-functional requirements from augmented app user reviews. In Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering (pp. 344–353). ACM
S11	Navarro-Almanza, R., Juarez-Ramirez, R. and Licea, G., 2017, October. Towards supporting software engineering using deep learning: A case of software requirements classification. In 2017 5th International Conference in Software Engineering Research and Innovation (CONISOFT) (pp. 116–120). IEEE
S12	Deocadez, R., Harrison, R. and Rodriguez, D., 2017, September. Automatically classifying requirements from app stores: A preliminary study. In 2017 IEEE 25th international requirements engineering conference workshops (REW) (pp. 367–371). IEEE
S13	Johann, T., Stanik, C. and Maalej, W., 2017, September. Safe: A simple approach for feature extraction from app descriptions and app reviews. In 2017 IEEE 25th international requirements engineering conference (RE) (pp. 21–30). IEEE
S14	Ezami, S., 2018. Extracting non-functional requirements from unstructured text (Master’s thesis, University of Waterloo)
S15	Tóth, L. and Vidács, L., 2018, May. Study of various classifiers for identification and classification of non-functional requirements. In International Conference on Computational Science and Its Applications (pp. 492–503). Springer, Cham
S16	Fong, V.L., 2018. Software requirements classification using word embeddings and convolutional neural networks
S17	Stanik, C., Haering, M. and Maalej, W., 2019, September. Classifying multilingual user feedback using traditional machine learning and deep learning. In 2019 IEEE 27th international requirements engineering conference workshops (REW) (pp. 220–226). IEEE
S18	Baker, C., Deng, L., Chakraborty, S. and Dehlinger, J., 2019, July. Automatic multi-class non-functional software requirements classification using neural networks. In 2019 IEEE 43rd annual computer software and applications conference (COMPSAC) (Vol. 2, pp. 610–615). IEEE
S19	Dalpiaz, F., Dell’Anna, D., Aydemir, F.B. and Çevikol, S., 2019, September. Requirements classification with interpretable machine learning and dependency parsing. In 2019 IEEE 27th International Requirements Engineering Conference (RE) (pp. 142–152). IEEE
S20	Li, L.F., Jin-An, N.C., Kasirun, Z.M. and Chua, Y.P., 2019. An Empirical comparison of machine learning algorithms for classification of software requirements. International Journal of Advanced Computer Science and Applications, 10(11)
S21	Rahman, M.A., Haque, M.A., Tawhid, M.N.A. and Siddik, M.S., 2019, August. Classifying non-functional requirements using RNN variants for quality software development. In Proceedings of the 3rd ACM SIGSOFT International Workshop on Machine Learning Techniques for Software Quality Evaluation (pp. 25–30)
S22	Haque, M.A., Rahman, M.A. and Siddik, M.S., 2019, May. Non-functional requirements classification with feature extraction and machine learning: An empirical study. In 2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT) (pp. 1–5). IEEE
S23	Messaoud, M.B., Jenhani, I., Jemaa, N.B. and Mkaouer, M.W., 2019, August. A multi-label active learning approach for mobile app user review classification. In International Conference on Knowledge Science, Engineering and Management (pp. 805–816). Springer, Cham
S24	Younas, M., Wakil, K., Jawawi, D.N., Shah, M.A. and Ahmad, M., 2019. An automated approach for identification of non-functional requirements using word2vec model. International Journal of Advanced Computer Science and Applications, 10(8)
S25	Jha, N. and Mahmoud, A., 2019. Mining non-functional requirements from app store reviews. Empirical Software Engineering, 24(6):3659–3695
S26	Dias Canedo, E. and Cordeiro Mendes, B., 2020. Software requirements classification using machine learning algorithms. Entropy, 22(9), p.1057
S27	Sabir, M., Chrysoulas, C. and Banissi, E., 2020, April. Multi-label classifier to deal with misclassification in non-functional requirements. In World Conference on Information Systems and Technologies (pp. 486–493). Springer, Cham
S28	Aslam, N., Ramay, W.Y., Xia, K. and Sarwar, N., 2020. Convolutional neural network based classification of app reviews. IEEE Access, 8, pp.185619–185,628
S29	Rahimi, N., Eassa, F. and Elrefaei, L., 2020. An ensemble machine learning technique for functional requirement classification. symmetry, 12(10), p.1601
S30	Tiun, S., Mokhtar, U.A., Bakar, S.H. and Saad, S., 2020, April. Classification of functional and non-functional requirement in software requirement using Word2vec and fast Text. In journal of Physics: conference series (Vol. 1529, No. 4, p. 042077). IOP Publishing
S31	Hey, T., Keim, J., Koziolek, A. and Tichy, W.F., 2020, August. NoRBERT: Transfer learning for requirements classification. In 2020 IEEE 28th International Requirements Engineering Conference (RE) (pp. 169–179). IEEE
S32	de Araújo, A.F. and Marcacini, R.M., 2021, March. Re-bert: automatic extraction of software requirements from app reviews using Bert language model. In Proceedings of the 36th Annual ACM Symposium on Applied Computing (pp. 1321–1327)
S33	Li, B., Li, Z. and Yang, Y., 2021, September. NFRNet: A Deep Neural Network for Automatic Classification of Non-Functional Requirements. In 2021 IEEE 29th International Requirements Engineering Conference (RE) (pp. 434–435). IEEE
S34	López-Hernández, D.A., Mezura-Montes, E., Ocharán-Hernández, J.O. and Sánchez-García, A.J., 2021, November. Non-functional Requirements Classification using Artificial Neural Networks. In 2021 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC) (Vol. 5, pp. 1–6). IEEE
S35	Kici, D., Malik, G., Cevik, M., Parikh, D. and Basar, A., 2021. A BERT-based transfer learning approach to text classification on software requirements specifications. In Canadian Conference on AI
S36	Kici, D., Bozanta, A., Cevik, M., Parikh, D. and Başar, A., 2021, November. Text classification on software requirements specifications using transformer models. In Proceedings of the 31st Annual International Conference on Computer Science and Software Engineering (pp. 163–172)
S37	Quba, G.Y., Al Qaisi, H., Althunibat, A. and AlZu’bi, S., 2021, July. Software requirements classification using machine learning algorithm’s. In 2021 International Conference on Information Technology (ICIT) (pp. 685–690). IEEE
S38	Shariff, H., 2021. Non-Functional Requirement Detection Using Machine Learning and Natural Language Processing. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(3):2224–2229
S39	EzzatiKarami, M. and Madhavji, N.H., 2021, April. Automatically classifying non-functional requirements with feature extraction and supervised machine learning techniques: A research preview. In International Working Conference on Requirements Engineering: Foundation for Software Quality (pp. 71–78). Springer, Cham
S40	Sabir, M., Banissi, E. and Child, M., 2021, March. A deep learning-based framework for the classification of non-functional requirements. In World Conference on Information Systems and Technologies (pp. 591–601). Springer, Cham

The application of AI techniques in requirements classification: a systematic mapping

Abstract

Similar content being viewed by others

Smart literature review: a practical topic modelling approach to exploratory literature review

Software defect prediction: future directions and challenges

Deep learning applications and challenges in big data analytics

1 Introduction

2 Related work

3 Research method

4 Research questions

4.1 Search strategy

4.2 Inclusion and exclusion criteria

4.3 Backward snowballing

4.4 Forward snowballing

4.5 Extracting and synthesizing the data

4.6 Quality assessment questions and criteria

5 Result analysis

5.1 Text pre-processing

5.1.1 Text pre-processing in machine and deep learning

5.1.2 Text preprocessing of BERT

5.1.2.1 Tokenization

5.1.2.2 Word embedding

5.2 Feature extraction techniques

5.2.1 Traditional feature selection methods

5.2.2 Pre-trained word representation

5.3 Automated techniques

5.3.1 Machine learning techniques

5.3.2 Deep learning techniques

5.3.3 Transfer learning based techniques

6 Datasets

6.1 Evaluation measures

7 Key findings, limitations and open challenges

7.1 Key finding

7.2 Limitations

7.2.1 Studies are still over old datasets

7.2.2 Scarcity of evaluation and reported results

7.3 Open challenges

7.3.1 Need of new benchmark dataset

7.3.2 Computational cost

8 Threats to validity

8.1 Construct validity

8.2 Conclusion validity

9 Conclusion

10 Appendix 1 Distribution of studies across different countries

11 Appendix 2

12 Appendix 3 Distribution of selected studies over different venue

13 Appendix 4 Reference of selected studies

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation