1 Introduction

Software Development is a complex process with ever-changing technologies and requirements, primarily due to business process changes. A critical activity in software development is Requirements Engineering (RE), which focuses on software systems' real-world objectives, functions, and constraints [1]. Requirements engineering has high significance for the quality of the software product. Owing to its inherently complex and interdisciplinary nature, RE is a challenging field in software and systems development; it is crucial for development success [2]. The RE process emphasises the systematic and recursive techniques that ensure system requirements' completeness, consistency, and relevance [3]. This implies that the RE process aims to establish a robust foundation for system development, and the process strives to minimise errors, enhance stakeholder communication, and increase the chances of developing a successful system. Requirements engineering is a crucial process that involves capturing and analysing stakeholder needs, and natural language documents serve as one of the compelling mediums for expressing and documenting those requirements over the years.

Natural language (NL) plays a significant role in requirements engineering as it is the primary means of communication between stakeholders and the software development team. This accounts for 79 per cent of all documents [4]. Software requirements are primarily written in natural language [5,6,7]. Therefore, the role of NL in requirements analysis and documentation cannot be overemphasised, as already highlighted in a study by Ryan in 1993 [8]. Although natural language is inherently object-oriented and descriptive, possessing strong representation power, its syntax and semantics are not formal enough to serve as a programming language; thus, requirements documentation written in NL needs to be reinterpreted into a formal specification language by software engineers [9]. While natural language requirements can present challenges, such as ambiguity or subjectivity, employing systematic approaches and leveraging natural language processing (NLP), techniques can improve the effectiveness and efficiency of the RE process in dealing with natural language artifacts. Natural language processing provides essential techniques for extracting information from descriptions of textual requirements, such as use cases, scenarios, user stories, and transcripts of conversations for requirements elicitation [10]. The applications of NLP for RE are studied in [11], which identifies six categories that encompass the possible activities of NLP in requirements engineering: classification, prioritisation, ambiguity removal, requirements elicitation, requirements assessment, and requirements analysis. To achieve these activities, various automated frameworks and tools have been proposed and developed. Automated requirements engineering aims to reduce the amount of time and labour expenses of RE while maintaining precise and thorough requirements.

Automated requirements engineering refers to using software tools and techniques to support and automate eliciting, analysing, specifying, validating, and managing software requirements. These tools can help streamline and optimise the requirements engineering process, which can be complex and time-consuming. The concept of computer-aided requirements engineering can be traced back to the 1980 s, as early works by Teichroew and Sayani [12] predicted that requirements engineers would eventually need to transition from manual methods to computer-aided approaches, much like how programmers have replaced manual programming with online programming. The goal of automation in requirements engineering is to minimise the time, effort, and cost involved in the RE process, as well as software development in general, while simultaneously enhancing the quality and accuracy of the requirements. This has been demonstrated in several research works in the literature, encompassing various aspects of requirements engineering activities. Automated tools such as DoMoBoT [13], TRAM [14], TestMEReq [15], ScenarioAmigo [16], SUGAR [17], CM-Builder [18], aTouCan [19], Requirements-Collector [20] among others, have been developed and empirically evaluated. However, it is essential to note that these tools have yet to be widely adopted on an industrial scale. Despite this, industry experts and the research community recognise the need for tool support to automate requirements engineering activities, particularly in automatically generating Unified Modelling Language (UML) diagrams [21]. These tools leverage Artificial Intelligence (AI) capabilities to automate tasks such as requirements elicitation, analysis, validation, and management. Integrating automated tools and AI allows for more efficient and effective handling of complex requirements, improved accuracy, and the potential for innovative approaches in software development.

Considering the vast potential of automated requirements engineering, our objective is to investigate the advancements achieved thus far in various aspects of these tools, including the generated output, RE phases, extent of automation, and tool evaluation methodologies. The motivation for this study arises from the substantial impact of requirements engineering on software quality. Specifically, we aim to examine the influence of automated RE on the requirements engineering process and determine the added value compared to traditional RE approaches. Notably, there has been a significant increase in research dedicated to automated RE recently. Consequently, this paper presents a systematic literature review to identify, evaluate, and analyse existing research on automated requirements engineering. Thus, the contribution of this study is summarised as follows: (i) Comprehensive Investigation: the systematic review explores various dimensions of automated RE tools, including their output, the phases of RE they target, the degree of automation they offer, and the methodologies used to evaluate these tools. (ii) Motivated by Software Quality: the study is motivated by the substantial impact that RE has on software quality. It seeks to understand how automation influences the RE process and assesses its added value compared to traditional RE methods. (iii) Timely and Relevant: given the recent surge in research focused on automated RE, this paper addresses the current state of the field. It systematically analyses existing research, offering valuable insights into the advancements and trends in automated RE. Consequently, it provides updated literature in the field.

The remainder of this paper is organised as follows: Sect. 2 discusses the theoretical background, focusing on traditional versus automated RE and the review of related works. Section 3 describes the methodology employed for the literature analysis. Section 4 presents the results of the literature analysis. Section 5 provides a discussion of the results. Finally, Sect. 6 concludes by explaining the limitations of the study and highlighting prospects for future research and directions.

2 Background

Software requirements describe the structure of a software application's development process and the software's primary goal and objectives for the development team. Thus, providing information that specifies the outlook of a software application. The process of creating, documenting, and maintaining requirements statements is referred to as requirements engineering. It is based on a framework that outlines the key structural components and process elements. It also offers a solid foundation for the process fundamentals, guiding concepts and methods, not software project- or development methodology-specific. The framework proposed by [22], which is shown in Fig. 1, is an example. To build a vision within an existing context, the framework identifies the key structural components of a requirements engineering process. The building block that makes up the framework are System context; The three fundamental requirements engineering activities (Elicitation, documentation, and negotiation); Two cross-sectional activities (Validation and management); and Requirements artefacts (Goals, scenarios, and solution-oriented requirements). Over the years, there has been significant improvement in the methodologies of RE processes. These processes range from sequential (traditional RE) to more iterative and incremental (agile RE). The traditional requirements engineering process's primary objective is creating a system requirements document for knowledge transfer. In contrast, agile techniques emphasise direct interaction between customers and agile teams to achieve a similar goal. Depending on the application area, the individuals involved, and the organisation providing the requirements, several requirements engineering procedures are employed [23]. According to Sommerville, all processes have the following generic activities (phases) in common:

  • Requirements elicitation. The process of determining and gathering requirements from sources such as stakeholders. Both functional and non-functional requirements are included here.

  • Requirements analysis and specification. This is the logical decomposition and structuring of the elicitation's process. It involves having a thorough understanding of the requirements and organising both the textual information and the derived requirements into model diagrams and written documentation.

  • Requirements validation. Ensures that information is gathered accurately and is organised well to fulfil system business goals. This is accomplished by verifying the documents' accuracy, completeness, and correctness and/or models that describe requirements.

  • Requirements management. This keeps track of changes in requirements and ensures that those changes are made to meet stakeholder’s requirements.

Fig. 1
figure 1

RE framework [22]

Traditional requirements engineering has mostly been based on the idea that requirements exist implicitly in stakeholders' minds and have focused on models and procedures to help identify and document such requirements [24]. However, with the rapid growth and dynamics in the software market, rapid changes in the business processes and increases in customer demands, there is a need to adopt methodologies that will accelerate software development.

2.1 The traditional requirements engineering process

Requirements engineering in traditional software development methodologies, such as the waterfall model, plays a crucial role in the early stages of the development lifecycle. It involves systematically gathering, analysing, validating, and managing software requirements. The waterfall model process activities are performed in a sequence of separate steps where preferably each step is finished before the next one begins [25]. A primary characteristic of this approach is that the software is detailed up-front. Since each project stage must be finished before moving on to the next, this leads to much documentation. The traditional approach starts with gathering and documenting a "complete" set of requirements, then moves on to architectural and high-level design, development, testing and maintenance [23]. Figure 2 depicts the development phases in the waterfall model.

Fig. 2
figure 2

Waterfall model [23]

The RE activities typically constitute an earlier part of the waterfall model before development activities begin, and it is a reference point for subsequent development stages. During RE in the waterfall model, requirements are often gathered through interviews, meetings, and stakeholder discussions [25]. These requirements are then documented in a requirements specification document as a baseline for the entire development process. One of the critical characteristics of requirements engineering in the waterfall model is its linear and sequential nature. Once requirements are finalized and documented, they are expected to stay the same throughout development. Any modifications or additions to requirements may require formal change requests and impact the project timeline and budget.

Requirements engineering is referred to as the first stage of the development process in early waterfall models of software development. However, more recent approaches to software development (such as the Rational Unified Process and Agile, among others) assume that requirements engineering continues across the system's whole life cycle [26]. The plan-driven process nature of the waterfall model made it compulsory to schedule and plan all the process activities before starting work on them. This usually resulted in colossal documentation. The model, however, is appropriate when the requirements are precise and unlikely to change dramatically during system development. The increasing need to address the limitations of the traditional requirements engineering process and software development has given birth to methodologies like agile software development (ASD). Additionally, customers need help explaining their requirements up front clearly. More so, the industry and technology evolve too quickly, and requirements change at rates that overwhelm established traditional methodologies [27]. Agile development is characterized by quick, iterative, and incremental development. Some standard requirements elicitation techniques in agile development are interviews, user stories and rapid feedback. Text mining, an automated technique for generating requirements documents, has gained recent attention [28]. These methodologies have significantly contributed to automating requirements engineering activities and software development.

2.2 Automation of software engineering processes

Automated Software Engineering represents a critical research area within software user requirements, as it centres on automating software processes to enhance the quality and productivity of software development [5]. Automation of software engineering processes involves using tools and techniques to automate various tasks involved in the software development process. These tasks include requirements analysis, software testing, software maintenance, code review, and code analysis. Automation in software engineering processes has become increasingly popular over the years, as it can help improve the quality of software development processes, reduce development time and costs, and increase the efficiency of software development teams. In recent years, there have been many research efforts in the automation of software engineering processes, and several tools and frameworks have been developed to automate various tasks involved in software development, especially in requirements engineering processes. Most of the tools reported in the selected primary studies are products of research efforts in RE. Additionally, other software engineering aspects have their share of the research outputs. Test data generation [29, 30], code generation [31, 32], code analysis [33], software testing [34, 35], and software maintenance [36] have all been areas of significant study. However, this study focuses on works related to automated requirements engineering.

Automated requirements engineering is a field of software engineering that leverages automation techniques to enhance the effectiveness and efficiency of the RE process. Automated RE encompasses various activities within RE, such as requirements elicitation, analysis, documentation, validation, and management, which can be automated using NLP, machine learning (ML), knowledge-based systems, and other artificial intelligence techniques. These techniques and approaches have provided a new paradigm in the RE sub-domain and software development in general. Even though the tools that will help analyse requirements automatically are still evolving through research, especially in using NLP and other AI techniques to improve the quality, efficiency, and consistency of the RE process. The existing commercial graphical CASE tools significantly help document the output of the Analysis and Design phases of software development and aid in identifying incompleteness and inconsistencies within an analysis [18]. Nevertheless, these tools do not contribute to the challenging initial stage of the analysis process, which involves identifying the object classes, attributes, and relationships utilised to model the problem domain.

Therefore, the systematic literature review (SLR) presented in this work aims to analyse and evaluate the available research on automated RE comprehensively. The findings of this review provide a valuable resource for researchers and practitioners seeking to understand better the state of the art in automated RE and its potential applications in software development. In this study, we consider studies that attempt to automate one activity of RE or the other. Typically, these activities have traditionally been carried out manually by human experts, and instead by automation a tool has been employed to do such activities on behalf of humans—for example, requirements elicitation, classification, modelling, checking requirements inconsistencies and duplication, among others. In automated requirements engineering, support tools are used to facilitate the activities of various components of the requirements engineering framework, which constitute the majority of the activities of RE. Over the years, many research efforts have been automating RE activities from elicitation to requirements management. These automation activities cut across various domains, for example, mobile [37], scientific systems [38], security [39], crowdsourcing [40], open-source applications [41], safety–critical systems [42], and train protection [43]. Accordingly, the automated support comprises both core framework activities and requirements artefacts.

2.3 Application of natural language processing in RE

The services a software system should provide to satisfy the needs and preferences of stakeholders are typically described in software requirements, which are often expressed in natural language [24]. For a long time, natural language has played a significant role in requirements engineering and software development in general. However, natural language is often a poor choice for representing requirements because of its innate propensity for ambiguity, inconsistency, redundancy, etc. [44,45,46]. Despite these shortcomings, natural language is best for experimentation and communication. It is a tool that human minds have evolved over millennia to use for just that [47]. Natural language processing techniques facilitate text processing which aids RE activities. Requirements elicitation is supported by extracting relevant lexical entries from the vast textual data generated by elicitation techniques (stakeholder interviews, group meetings, protocol analysis or participant observation) [10]. Some NLP technologies and techniques include rule-based techniques, Part-Of-Speech (POS) tagging and Lexico-Syntactic Pattern (LSP), and Speech act-based analysis techniques, among others.

On the other hand, despite the recent intensive efforts to produce formal specifications and automated toolkits, the practical significance of requirements documents expressed in natural language and the demand for user participation throughout the software development life cycle remain. Nevertheless, the ambitious objective of automation of RE activities is promising and has a stake in the future of requirements engineering. A classic example is the selected primary studies reported in this systematic review. The current state of the art of NLP technology has proven it is possible to automate requirements analysis and save significant time analysts spend [48]. Most studies reviewed in this study have applied one NLP technique or the other to demonstrate how RE activities can be automated.

2.4 Summary of related literature reviews

To understand what has been previously researched and avoid duplication of research efforts in the identification, evaluation, and analysis of literature through a systematic literature review, we searched literature reviews and surveys in this subject area. This process helps us gain insights into the current state of research and establish the need for the SLR. Following an extensive search, we discovered publications that have conducted systematic reviews or surveys on various aspects of software development pertaining to requirements engineering. This section presents examples of literature reviews related to automated RE found in software engineering research literature. The summary of these reviews is presented in Table 1. Consequently, we summarise the distinctions between these previous studies and the present one in the final paragraph of this section.

Table 1 An overview of related work

In 2011, Yue et al. [49] systematically reviewed approaches for transforming user requirements into analysis models. The review focused on studies that converted textual specifications into models for analysis, especially from publications from 1996 to 2008. The analyses established 16 techniques from the literature that were employed by numerous publications reviewed. In analysing the findings, they found that no existing methodologies were sufficiently efficient, could automatically or semi-automatically produce a complete and consistent analysis model, or needed acceptable user efforts to specify requirements. They also noted that the majority of methods did not handle traceability. As a result, they advocated for developing a traceability system to establish and maintain relationships between requirements items and analysis model elements. However, the impact of the transformation approaches on RE phases was not mentioned.

In 2012, Carrillo de Gea et al., [50] surveyed to assess the capabilities of requirement engineering tools. The survey's objectives were to gain insights into the extent to which requirements engineering tools support the requirements engineering process. The findings revealed that most tools were delivered under proprietary licenses and were typically not free. Furthermore, it was observed that requirements elicitation enjoyed more significant support regarding tools, while requirements modelling and management had the least support. The analysis presented are limited to proprietary tools.

Meth, Brhel, et al., [51] analysed the state of automated requirements elicitation through a systematic review. In the RE sub-domain, the review determined the state of automated requirements elicitation at the time. The review covered works on automated requirements elicitation from 1992 to 2012 by identifying and analysing 36 primary studies. The review conceptualises an analysis framework for works in automated elicitation and formulating future research directions. In contrast, we provide analysis of the development techniques and various RE phases the support tools address.

Through a systematic review, Yang et al. [52] studied requirements modelling and analysis for self-adaptive systems. The review aimed to investigate past research modelling techniques, RE activities, requirements quality activities, research topics, and application areas. According to the findings, 16 modelling techniques were used in 11 RE activities, and the most frequently stated application domains were online applications and service-based systems. Additionally, they stated that most of the research was conducted in American and European countries, making them very difficult to apply appropriately in the discourse of requirements modelling in developing countries. Our work is not limited to modelling methods but what specific models were generated as output from the support tool.

The purpose of the study by Abdouli et al., 2016 [53] was to survey works that transform requirements into UML diagrams from early manual procedures in 1996 to automated tools in 2015. The research also examined the methods for transforming requirements into models, using five trends of classification: inspection, NLP, heuristics, patterns/graphs, and ontology. Furthermore, the advantages and disadvantages of the reviewed tools were highlighted. The authors concluded that the trend of integrating AI into RE was promising. However, in this report, the analysis of the tools reported are not systematic.

Dawood and Saraoui [54] conducted a survey study of requirements engineering to UML using NLP. The study comprised of a literature review of existing tools to establish their strength and weaknesses; and a survey with a questionnaire distributed among research groups, academia, and practitioners. The results revealed that most users/organisations still manually generated UML diagrams from NL. The researchers drew their conclusion on the understanding that more research in the field of NLP was necessary to construct UML diagrams automatically or semi-automatically efficiently. The analysis of strengths and weaknesses presented in the literature was deduced by the authors and not conducted systematically.

Schön et al. [55] examined agile RE through a systematic literature review. The review focused on agile requirements engineering, which investigated stakeholders’ involvement in the process in the existing approaches. The methodologies used to offer user perspective and requirement management procedures were also investigated. The review covered empirical works from 1995 to 2015. The findings showed that ASD has a weak foundation for establishing a shared understanding of the user perspective. The review also revealed problems concerning the direct involvement of users and stakeholders in ASD. As a result, they identified four approaches incorporated into ASD by the selected publication to understand user needs better. The approaches were human-centred design, design thinking, contextual inquiry, and participatory design. The review is limited to agile requirements engineering.

Ahmed et al. [56] studied the automatic transformation of natural language to UML. The study focuses on various approaches used in the transformation by highlighting their pros and cons and metrics. The study also proposed a conceptual framework with further improvement guidelines. The study identifies 70 primary studies covering from 1994 to 2021. However, the review did not take into account the impact of these tools on the RE phase and evaluation approaches for the tools selected in the primary studies.

Through a systematic review, Kolahdouz-Rahimi et al. [57] examine requirements formalisation using NLP and ML. The study reviewed 47 relevant studies and reported that heuristic NLP approaches and deep learning are the commonly used techniques for requirements formalisation using NLP and ML respectively. The study also reported difficulties in comparing the performance of different formalisation approaches due to the absence of standard benchmarks cases for requirements formalisation. The study focused on assessing the evaluation criteria without discussing the analysis methodologies.

While conducting an extensive systematic literature review, our study endeavours to compile and analyse existing evidence concerning automated requirements engineering and its potential implications for future research. In contrast to the previously mentioned studies, the present study delves into the specifics of automated RE activities across distinct phases of the RE process. Unlike previous literature reviews, which offered a more generalized perspective, our study scrutinizes the automation efforts pertaining to key RE phases, including elicitation, analysis and specification, validation, and management. Furthermore, we aim to shed light on the diverse technologies and techniques employed in the automation process, as well as the resultant outcomes produced by supporting tools. As a result, our study seeks to establish connection between automation and the different activities of RE within the realm of software development. Our objectives include exploring the extent to which automation has been integrated into various stages of the software development lifecycle, its significance, identifying best practices, evaluation methods, and assessing the value it has contributed to the RE domain compared to traditional RE approaches.

3 Methodology

Evidence-Based Software Engineering seeks to enhance decision-making concerning software development and maintenance by fusing the most recent research's best evidence with real-world knowledge and ethical principles [58]. The systematic literature review, which focuses on identifying, classifying, and evaluating the extent of research in a particular study topic, is a crucial technique for evidence-based software engineering. Accordingly, the systematic literature review is a common and frequently used methodology in software engineering [59]. Consequently, this review aims to identify, assess, and interpret all the available research related to automated requirements engineering support. Therefore, we adhered to the recommendations made in [60] for technical relevance and reliability. Thus, we needed to outline the three critical phases of our systematic review: planning, conducting, and documentation. Figure 3 shows a typical research methodology for systematic literature review, which aided our review.

Fig. 3
figure 3

Research methodology for SLR

3.1 Research questions

This study investigated the empirical evidence of automated RE support. As a review, it focused more on answering the following research questions and their respective motivation, as provided in Table 2. The SLR is limited to the investigated and reported implemented tools to help in requirements engineering activities rather than the technicalities of the approaches used as published in the primary studies.

Table 2 Research questions

3.2 Search strategies

Once the research questions were established, a set of keywords aligned with the research objectives was selected and used for the search. The search strategy was refined to ensure the identification of relevant information that would facilitate an effective and comprehensive investigation of the research questions. Typically, this process involves four steps: selecting appropriate digital libraries, identifying additional search sources, determining relevant search keywords, and establishing the time frame for published articles. For this study, a search was performed across four databases: ACM, Elsevier (Science Direct), IEEE Xplore, and Springer. The decision to utilise these databases was based on their accessibility, extensive coverage, and specific relevance to the topic. These selected databases offer a wide range of scholarly resources that are highly relevant to the research topic.

The major goal of the procedure was to find potential primary study candidate papers. Applying search parameters, the title and the entire article text were searched. Inclusion and exclusion criteria were used to filter the articles published between 1996 and 2022, which were the only ones that were found through the search. Details of the inclusion and exclusion criteria are provided in Sect. 3.3.

Search terms/keywords were utilised to create search strings to get the desired result and make the search process more manageable. A comprehensive search was completed in December 2022 using the final combination of search terms interchangeably. “requirements engineering” OR “requirement elicitation” OR “requirement gathering” OR “software requirements engineering” OR “requirements identification” OR “requirements analysis” OR “requirements specification” OR “requirements modelling” OR “software modelling” OR “modelling” OR “requirements documentation” OR “requirements validation” OR “model extraction” OR “model generation” OR “requirements verification” OR “requirements management”, AND (automat* OR Computer Aided OR Computer Supported Software Engineering OR computer-assisted).

These search strings were applied manually in each of the databases, that is, based on the search functionality provided by the database. Additionally, we treated each database search as a learning and experimenting process. The first author conducted the literature search, data extraction, and analysis, while all authors undertook the review and quality analysis.

3.3 Inclusion and exclusion criteria

After screening, the eligibility for selecting primary studies was determined by applying inclusion and exclusion criteria. Only studies that provided empirical data and findings showcasing the implementation of automated requirements engineering tool assistance for requirements formalization were included to address the research questions.

The inclusion criteria: studies between 1996 and 2022. This date range was defined based on the observation that early work on automated RE emerged in 1995. Further, English language studies; studies relevant to the specified search string; peer-reviewed research, as well as original studies were the other criteria.

While the exclusion criteria included: the primary work that was not published as a chapter, or in conference proceedings or journals; duplicate papers (papers with conference and journal versions). More so, the subject was not directly related to automated RE support; editorials, keynotes, and short papers (less than three pages) were not included.

3.4 Quality assessment

A quality assessment is carried out during the review process to evaluate and validate the primary studies that were previously identified. The fundamental goal of the authors' quality assessment was to ensure (at least to some degree) that our findings would be supported by good-quality empirical research. According to Daun et al. [61] there is no common standard for quality assessment. However, the commonly suggested quantitative approach to quality assessment is by including publications that have been peer reviewed. Therefore, the quality assessment criteria used by [62] were adopted. Accordingly, the quality criteria offer a method for selecting suitable studies that would add to the significance of this research and were critically considered. As shown in Table 3, some of the quality assessment questions related to the primary studies' minimal quality threshold, rigour, credibility, and relevance. The degree to which we could be confident that the results of a particular study could significantly advance the review was measured by these criteria. A dichotomous scale ("yes" or "no") was used to grade the quality assessment criteria, where 1 represents Yes and 0 represents No. As a result, studies with at least one "no" response to the first three questions were disregarded because this review needed to meet a minimum quality threshold. Seven articles were eliminated when the quality assessment criteria were applied. As a result, 85 research in all were chosen as primary studies, accounting for 92.4% of all the studies that had their quality evaluated. Any disagreements were resolved through discussion among the authors.

Table 3 Quality assessment criteria adopted by [62]

3.5 Primary study

After reviewing the studies, we found 85 primary papers ranging from 1996 to 2022. These studies were collected from journal articles, conference proceedings, and book chapters. In the initial stage, 3853 studies were found by applying the search string to the publication databases. The studies were cut down to 425 after title-based selection. Phase 3 saw the short-listed studies reduced to 268 after reading the abstract and eliminating duplicate studies. Following thorough text evaluation, 85 studies were determined to be total. Figure 4 details all four stages of the selection process. Table 9 lists the identified primary studies (Appendix).

Fig. 4
figure 4

Phases of selection process

3.6 Data collection/extraction

Data collection is one of the crucial stages in a systematic review, and it is performed to extract information from each selected primary study to address our research questions. Data were collected manually from each of the primary studies. The authors went through each paper and manually extracted data into a prepared spreadsheet template. Afterward, the extracted data were collectively reviewed, and any conflicts were discussed and resolved. Subsequently, the data were analysed as a group, and a summary is presented in the results section. Furthermore, we have summarised the collected data, which is presented in Table 9 in Appendix. The data extracted were technically divided into three:

  • Foundational details: (title, authors, publication date and location).

  • Publication data: Journal, conference, chapter, date (year), publisher, publication title, volumes, issues, pages, keywords, and abstract.

  • Review data: tool development approaches/techniques, tool input source, RE phase, model proposed, degree of automation, type of requirements, tool evaluation method.

From the preceding, we therefore, present the report and the findings and statistical analysis of the SLR gathered from the selected primary studies in the next section.

4 Results

This section answers the research questions posed in Sect. 3.1 and highlights the review's findings.

4.1 Overview of the studies

As mentioned earlier, a total of 85 studies were identified for critical evaluation concerning automated RE support. The data extracted is summarized in Table 9 (Appendix). The table summarises each study selected, along with the study ID, author(s), reference, study title, year of publication, publisher, name, and citation type.

With respect to the years of publication, the first publication we reviewed was published in 1998. From other publications, works from 2016 and 2019 recorded the highest number, followed by publications from 2009, 2015 and 2017. The distribution of the reviewed papers spanned 1998 to 2022. The distribution of publications over the given period is shown in Fig. 5. This shows a generally increasing trend in the number of publications over time.

Fig. 5
figure 5

Distribution of publication over the years

The findings in Table 4 indicate that the studies were published in various venues. Springer accounts for 38.8% (33) as the highest publishing venue for automated RE papers. Then, IEEE and Elsevier account for 32.9% (28) and 17.6% (15), respectively. On the other hand, ACM accounts for 10.6% (9) of the total selected primary studies.

Table 4 Number of publications in each database

The selected primary studies were mainly journals, conferences, and chapter publications. Of the 85 primary studies selected, 41 appeared in conferences, 33 in journals and 11 in book chapters, as presented in Fig. 6. On journal-specific publications, Requirements Engineering Journal has the highest number of papers associated with automated RE. The number of selected primary studies in each publication venue is shown in Table 5.

Fig. 6
figure 6

Distribution of papers by citation type

Table 5 Classification of the selected publications

Accurately capture the geographical location of authors, we considered each author's location individually, as some of the papers had co-authors from different locations. Notably, most authors were from Europe and Asia, as depicted in Fig. 7. On the other hand, there were relatively few studies on automated requirements engineering from Africa, Australia, Oceania, and South America. Most of these publications are interconnected through citation, forming a network of references, as illustrated in Fig. 8. These connections among the articles through citations play a crucial role in establishing the research context, demonstrating scholarly contribution, strengthening validity and reliability, tracing research evolution, and building a knowledge network within the systematic literature review.

Fig. 7
figure 7

Distribution of authors geographic locations

Fig. 8
figure 8

Citation map of the selected primary studies

4.2 What type of output or models are being generated by the automated tools according to published empirical studies?

One of the communication tools between the RE team and the software development team is the UML models, as they provide a common language and understanding of the requirements. UML models are a valuable tool in the requirements engineering process, as they help to ensure that the requirements are well-defined and accurately capture the needs of the stakeholders. In this study, several UML models were generated as output by the support tools reported in the primary studies. Figure 9 shows some of the models produced by these automated tools. As depicted by the chart, the class diagram (domain model) is the most widely produced model among the tools, followed by the use case diagram and the structured requirements document. However, some studies did not specify any model output, constituting the highest number of primary studies. This category of studies mainly focused on several RE activities, which included improving one RE process or the other. For example, recommendation of omitted steps in use case scenario [16], requirements mining framework [38, 63,64,65], checking inconsistency of requirements [66, 67], requirements validation and review [15, 43, 68, 69], requirements classification [40, 41, 70,71,72,73], audio mining and visualization [74], duplicate requirements [75], automated requirements reuse [76], systematic analysis of NL [42, 47], enhancing the security of RE [37, 39]. In total, UML diagrams generated as output constitute 44.7% (38), while others with no model output made 55.3% (47) of the primary studies.

Fig. 9
figure 9

Distribution of generated output

An automated RE tool receives various inputs to facilitate its process. These inputs are the foundation for the tool to perform its automated functions effectively. Some common inputs to an automated RE tool include natural language requirements; domain knowledge; existing models or artefacts; stakeholder feedback; and metadata. It is important to note that the specific inputs required by an automated RE tool may differ based on its capabilities and functionalities. From our findings, most inputs used by these tools are written in natural language, including unstructured [13, 77, 78], semi-structured text [75, 79, 80]–[82], and structured text [7]. Others are domain knowledge [83], business process models [28] and feedback through voice [74]. In general, natural language text is the most prominent input for most tools, accounting for 94% of their usage. The automated requirements engineering software tools proposed in the primary studies are mostly one shot RE tool and a few editor tools. In conclusion, the value of automated RE generated UML and other outputs lies in improved visualisation, enhanced requirement analysis, early mistake identification, system design and development support, and improved traceability. These advantages contribute to a more efficient, accurate, and collaborative RE process, resulting in better software systems that meets the needs and expectations of stakeholders.

4.3 Which requirements engineering phase is mostly automated?

Requirements Engineering is made of phases, also known as activities. As mentioned earlier and defined by Sommerville [23], the RE phases include requirements elicitation, analysis and specification, validation and management. The tools proposed in the primary studies have automated various tasks within the requirements engineering (RE) process, which can be classified into different RE phases. Although some studies mentioned the specific RE phase their tool supports, we still carefully examine each study for classification. Assigning a primary study to a particular RE phase can be challenging due to the possibility of tool activities spanning multiple phases. The type of the task and how it relates to the broader RE process can be used to categorise an automated RE task into a RE phase. Therefore, we define the following approaches to categorise the primary studies' automated RE tasks into a RE phase:

  • Task Focus: determine the automated task focus to ascertain whether it is requirements elicitation, analysis, documentation, validation, or management. This classification aligns the task to the appropriate phase of the RE process.

  • Input–Output Mapping: examine the automated task's input and output and relate it to the specific phases of RE. For example, suppose the automated task receives unstructured textual requirements and produces structured use case diagrams. In that case, it can be classified as an analysis task that transforms requirements into a more formal representation.

  • Relationship to RE Activities: analyse the relationship between the automated task and the activities typically carried out during the RE phases. For instance, the activity can be classified as an elicitation task if it uses NLP techniques to extract key terms and concepts from requirements documents. This helps to capture and comprehend stakeholder requirements.

  • Impact on Phase Objectives: consider the general goals of each RE phase and how the automated task contributes to accomplishing those goals. If the task's goal is to maintain requirement consistency and traceability, it may align with the validation or management phases, which emphasise quality assurance and requirement tracking.

  • Integration with RE Processes: examine how the automated task fits into the more extensive RE processes or methodology. Determine whether it is a necessary aspect of a given phase or if it spans multiple phases. For example, if the activity involves automated requirements traceability analysis, it may be related to both the analysis and validation phases.

The approaches described above give a general outline for categorising automated tasks within the context of the RE phases, providing a better understanding of where the task belongs within the entire RE process. Therefore, the activity determined the requirements engineering phase for which each primary work was classified. According to published empirical studies, requirements analysis was the most widely automated phase of the RE activities, accounting for 49.53% (53) of the total selected primary studies. Requirements elicitation and management accounted for 23.36% (25) and 14.95% (16), respectively. On the other hand, requirements validation was the least automated RE phase resulting in 12.15% (13) of the total selected primary studies. Figure 10 shows the extent to which each RE phase has been automated. The analysis phase is critical because it ensures a complete grasp of stakeholder needs, refines requirements, fosters collaboration, establishes project scope, identifies risks, and acts as a basis for later development activities. Therefore, a well-executed analysis phase contributes considerably to the software project's success by providing a firm foundation for the development process and facilitating good communication and alignment among stakeholders.

Fig. 10
figure 10

Most automated RE phase

4.4 To what degree is the automation of the requirements engineering support tool?

Automation is a complete or partial replacement of a task that a human operator previously performed, which suggests that automation can vary along a continuum of levels, from the lowest level of entirely manual performance to the highest level of full automation, rather than being all or nothing [84]. Thus, an automated RE support tool could be semi-automated or complete/full automation. However, this study focused only on automated tools semi or complete. Of the 85 primary studies selected, 50 used semi-automated tools, and 35 had complete automation, as represented in Fig. 11. All the literature reported in this review has implemented their respective proposed tools. The majority, representing 58.8% of the selected studies' artefacts, were semi-automated. In this case, the possible interaction between the human analyst and the tool varied. These variations ranged from input pre-processing, refining data during processing to output interpretation and analysis. In contrast, 41.2% of the primary studies carried out full automation. Despite the authors proposing full automation, none proposed complete replacement requirement engineers. However, the output of automated RE provided the human analyst with an artefact to be verified, thereby reducing development efforts, and saving a great deal of time spent on RE activities. Similarly, it was found that giving analysts an initial version of the requirements model based on informal textual descriptions could significantly speed up the development process because domain and modelling experts could begin modifying the requirements model much earlier [85]. In the primary studies, there is no specific mention of the extent of automation. This suggests that the primary studies needed to provide explicit information about the level or degree of automation achieved in their research.

Fig. 11
figure 11

Degree of tool automation

In addition to classifying support tools reported as either semi-automation or full automation, we have explored the use of the ten levels of automation of decision and action selection and the four-stage model of human information processing proposed in [84]. Based on the system description provided in the primary studies, we have subjected each paper to evaluation to determine the extent of the automation based on the levels of automation design and action selection provided in Table 6. This was done against the four types/stages of automation of information processing, i.e., (i) Information acquisition (ii) Information analysis (iii) Decision selection, and (iv) Action implementation. Figure 12 shows the results of the levels of automation of various support tools reported in the primary studies. From the results, information analysis has a higher degree of automation than the other three stages. In summary, information analysis is a critical stage in information processing, and automation technologies like machine learning and other AI technologies have advanced significantly in recent years, making it feasible to automate complex analytical tasks. This trend has led to more automation of information analysis than the other stages, enabling the extraction of valuable insights from data efficiently and making informed decisions.

Table 6 Levels of automation design [84]
Fig. 12
figure 12

Level of automation of Degree of tool automation

4.5 What are the development techniques/approach employed in the development of the support tools?

In Fig. 13, the results are classified by development techniques or approaches employed by authors. Natural Language Processing techniques are the most widely used, accounting for 50.6% (43) of the selected primary studies. In the context of automated RE tools, NLP techniques are used to process and interpret natural language requirements, enabling automated understanding, extraction, and analysis of textual information. Machine Learning techniques recorded 14.1% (12) as a development approach, which involves the development of algorithms and models that can learn from data and make predictions or take actions without explicit programming for tasks relating to requirements classification, clustering, or prediction. On the other hand, domain ontology techniques accounted for 5.9% of the selected studies employed to capture and model requirements knowledge. These ontologies capture domain-specific concepts, relationships, and constraints, providing a shared understanding and common vocabulary for representing requirements within a particular domain. Studies that used text mining techniques are four which represent 4.7% of the studies. Some studies combined more than one technique, for example, a combination of NLP and ML; NLP and Ontology had five studies each, accounting for 5.9% each. Other techniques applied were Information retrieval, Data mining, courteous logic, Tropos methodology, and Casco Accurate Description Language (CASDL). These categories supported different levels of abstraction (in terms of techniques, methodology, and logic). In addition, Table 7 shows the specific technologies used to develop the proposed tools. Accordingly, POS tagging, Parsers are the most widely used NLP technologies in tool development. While Naïve Bayes, support vector machines and neural networks are the most frequently used machine learning techniques for classifying and clustering.

Fig. 13
figure 13

Distribution of development technique/approach

Table 7 Technologies employed in the development of tools support

4.6 How are these tools evaluated?

This question aimed to identify the evaluation approach(s) employed in evaluating the automated RE tools. Table 8 shows the results categorized by evaluating method and RE phase.

Table 8 Evaluation methods

Automated RE tools play a crucial role in analysing and managing requirements. Thus, evaluating these tools is essential to ensure their effectiveness, suitability, and value for the intended purpose. Automated RE tool evaluation involves assessing various aspects of the tool with several factors considered, including functionality, usability, scalability, performance, integration capabilities, support for industry standards, and compatibility with existing tools and processes. To achieve this, several evaluation methods are employed, as reported by the primary studies. The primary studies employ the experimental method, case study evaluation, prototyping/proof of concept and user study. The choice of evaluation method is aligned with the specific objectives of the evaluation. It provides valuable insights into the tool's performance, usability, availability of resources and overall suitability for the intended use.

Most of the primary studies (41) applied a controlled experimental method. This indicates that these studies conducted experiments to evaluate and demonstrate the effectiveness of their approaches. The extent of the experiments varied across studies. Some authors had a single experiment with independent examples or data to explain their proposed method (e.g., studies referenced as [64, 66, 86]). Other studies used numerous in-depth examples based on data from industrial practice, providing a more comprehensive evaluation (e.g., studies referenced as [28, 75, 76]). The case study evaluation method was employed by 25 studies, representing 29.4% of the selected primary studies. Case studies are often conducted on practical live projects or industry applications to assess the proposed approaches' feasibility, effectiveness, and real-world applicability. These studies typically involve in-depth investigations of specific cases or scenarios to provide a more contextualised understanding of the benefits and limitations of the developed tools. An example of industry evaluation using case study evaluation is the study by Arora et al. [7], which conducted an evaluation using four industry requirements documents combined with expert evaluations to extract domain models from textual requirements. The results from the evaluation show that the potential of automated tools in modelling is enormous because their tool achieved approximately 90% accuracy in generating domain model classes. Prototyping, specifically for proof-of-concept purposes, was used in 12 studies. Prototyping involves building a preliminary version or prototype of the proposed system or tool to showcase its functionality, feasibility, and potential applications. These prototypes demonstrate how the methods can be applied in practice. The last evaluation method, user study, was employed by seven studies representing 8.2%. User studies involve gathering feedback and insights directly from end-users to evaluate the proposed tools' applicability, usability, and user experience. Most of the authors evaluated the performance of their artefacts with that of a human expert and compared them against defined/specified functionalities.

5 Discussion

As stated previously, the purpose of this study was to investigate automated RE support and bring to light more discussion supported by extant literature. Eighty-five papers relating to automated RE support were selected for this study. This objective was accomplished through the formulation and answer to research questions. The implications of the findings from Sect. 4 are therefore discussed in this section.

On a general note, it was observed that publications related to automated RE date to 1995, and the state of research has, over the years, improved significantly. The observed improvement in publications on automated RE can be attributed to several factors. Technological advancements in artificial intelligence and machine learning have provided new tools and techniques for automated RE. The increased recognition of the importance of RE in software development has led to more research and development efforts in the field, with a focus on automation. The limitations of traditional manual RE methods have driven the exploration of automated solutions for improved efficiency, accuracy, and scalability. On the other hand, we found that most of the publications focused on functional requirements. This type of requirement is concerned with the functional operations of the system, defining system functionalities and constraints. It is important to note that agile teams in the requirements engineering process are primarily concerned with this type of requirement. Another important aspect of our findings is the geographical location of the authors of our selected studies, as most of the papers were authored by Europeans and Asians. Most of the European contributions came from authors from the United Kingdom, the Netherlands, Germany, Sweden, Norway, Italy, Luxembourg, Spain, Austria, Ireland, Serbia, Switzerland, and Portugal. Asian countries took second place in the contribution, involving authors from China, Malaysia, India, Palestine, South Korea, Pakistan, Jordan, Singapore, Japan, Israel, Turkey, and UAE. North American countries came third, involving only US, Canada, and Mexico authors. Finally, the remaining continents, Africa, Australia, Oceania, and South America, have contributions from one country, involving Morocco, Australia, New Zealand, and Argentina, respectively. The empirical data from the selected 85 studies could not be generalized due to the uneven distribution of authors across geographic regions. Therefore, there is a need for location-based studies like that of [87] to empirically establish state-of-the-art research in most continents regarding automated RE and software engineering generally. Geographical, psychological, and sociological factors may influence how anything is interpreted and understood when described in natural language [88].

Concerning RQ1, we observed that the class diagram was the most widely generated model. This could result from the class diagram representing high-level models that are useful during the requirements analysis and the initial stages of the system design. Class diagrams are also the most-used UML model in software development in general. This model type structurally captures the domain problem by representing them as classes, attributes, relationships, and association cardinalities. Overall, a class diagram was also used for stakeholder discussions, providing bases for discussion among the development team and providing agile team members with a basic understanding of the system's overall structure. Thus, a class diagram is easy to understand but consumes significant time to construct. Nevertheless, automated tools will go a long way in reducing the time spent generating class diagrams. In addition to class diagrams, use case models, structured requirements documents, and entity-relationship diagrams have also been generated as output from various tools. Automated tool support for automatically constructing an analysis model is critical to model-driven development (MDD) because it enables the MDD lifecycle to move quickly from the coding phase to the specification phase when using precise formal languages [14]. Hence, using automated RE tools to generate class diagrams and other UML diagrams has a promising future in the software RE domain and software development in general. Furthermore, the relationship between the Automated RE-generated UML output and the input is one of translation, interpretation, and representation. The UML output reflects the information and structure present in the input data but in a more organised, standardised, and visual form that facilitates analysis, communication, and further development activities. Therefore, input representation and standardisation, integration and interoperability, ambiguity handling, and support for various input sources are issues related to tool input that might contribute to developing more effective input for automated RE tools. Overall, the research implications surrounding the additional values of Automated RE-generated outputs, especially UML outputs, focus on automated RE technique evaluation, integration with development processes, automation of UML model evolution, context-specific adaptations, human aspects, and performance scalability. Addressing these research implications will allow the area of automated RE to progress, providing more robust and effective methods for automating the production of UML models in the RE process.

The primary RE task is requirements analysis and validation, which includes understanding user requirements, classifying them, and modelling static and dynamic perspectives of software requirements [89]. We have identified the most automated RE phase in response to RQ2: requirement analysis. According to empirical evidence, requirements analysis is the most widely automated RE phase. This could be because the phase allows for analysing the requirements outlined. Alternatively, it also allows for stakeholder deliberations. One of the most essential activities in the RE process is requirement analysis. This activity assists developers and other relevant stakeholders in understanding requirements, their overlaps, and conflicts, as well as in managing conflict and creating a unified set of requirements. Usually, the "Requirement analysis" phase of any software development process is often the most important because it serves as the foundation for all subsequent development work [48]. It is also important to note that, at this stage, various UML models are conceived, and the entire system functionalities are defined and analysed. Thus, this calls for more practical research into the automation of this stage to minimise human intervention to the barest minimum. One of the most significant research issues nowadays is automated software engineering, particularly when analysing requirements and modelling [5]. Thus, extracting requirements artefacts in the analysis stage requires a great deal of skill, and this has been the motivating factor for some of the primary studies to automate the analysis phase. For example, in [48], it was found that before modelling a system, a developer must analyse the use case descriptions to identify actors and functionalities to understand the system's dynamics. Similarly, extracting actions and actors from a linguistic viewpoint requires skill and time and is challenging to master [78]. More specifically, the motivation for the automation of the analysis phase is: saving time and reducing development costs [5, 13, 48, 90], leveraging the powers of NLP tools and technologies [14, 18, 19, 80, 91], avoiding or minimising human mistake in reading and analysing volumes of text written in NL [79, 89, 92] and iterative nature of the analysis phase [48]. Therefore, we propose that future research consider an automated analysis framework that will fit with agile development. Furthermore, the automated analysis that will allow for traceability between the textual requirements and the formalised models be developed. This way, developers and customers can easily see which original statements from which some formalised requirements are derived. Thus, investigating the research implications of the automated analysis phase in automated RE can help develop the field by providing insights into the effectiveness, accuracy, and consistency of automated RE tools and their adoption, impact, and improvement. It can direct future research and bridge the gap between academic research and practical application in industry.

Accordingly, we have identified the extent of automation of the RE support tools in response to RQ3. As mentioned earlier, most of the tools had semi-automation, which should allow a reasonable part of the activities of a human analyst to be done automatically. Ironically, this does not in any way replace human expertise. Whether complete or semi-complete automation, there is a need for human expert validation since it is the human that will use the output for the remaining phases of development. The automation has proved to be effective in reducing the workload on the human analyst to manually carried out the RE activities such as model generation [77, 80, 81], inconsistencies in requirement [66, 67], checking duplicate requirements [75] and omissions in requirement [16]. Remarkably, there is still the need for researchers in the field to provide more explicit and specific descriptions of the level of automation that will significantly reduce human efforts and development time. This could include information on the automated tasks, processes, or functionalities covered and the extent to which human intervention or manual effort is required. Another important aspect of automation is the human-in-the-loop considerations. At the same time, semi-automation implies a level of human involvement in the RE process, while full automation suggests minimal or no human intervention. Therefore, research can delve into the role of humans in the loop and investigate their contribution, decision-making authority, and interaction with automated tools. This can provide insights into the optimal balance between automation and human expertise, ensuring that automated RE tools effectively leverage human intelligence while reducing manual effort.

We identified Natural Language Processing techniques as the most used in developing various RE automated tools in response to RQ4. As mentioned earlier, NLP plays a significant role in RE activities because most requirements are written in Natural languages. Some natural language techniques employed by the primary studies include linguistic analysis [93, 94], clustering algorithm [16, 95], pattern matching [38], core NLP [78, 94], NL Pasrser [9, 28, 47, 68, 76, 79, 88, 90, 95], text chunking [24, 94], Part of Speech (POS) Tagger [24, 80, 94] to mention but a few. These techniques have been applied in various RE phases, elicitation, analysis, specification, management, and validation. The NLP helps in the text analysis and context meaning to allow for understanding text for onward requirements analysis. Generally, requirements engineering is considered a critical success factor in software development, and most requirements are written in NL, which is challenging to analyse computationally. It has been noted that the need for adequate techniques and tools for computer-assisted processing of early software requirements is one of the major problems in software development [86]. To address these challenges, NLP techniques have been employed. The NLP techniques allow text written in NL to be analysed through major components; lexical analysis (pre-processing, tokenisation, morphology rules, lexicon), syntactic analysis (parsing and mapping), and semantic analysis (drawing, checking, including) [78]. Therefore, pre-processing, classification, and post-processing (mapping relationships) are common processes used in generating most of the analysis models in the literature. In addition to the NLP techniques, machine learning techniques and their application in automated requirements engineering are evolving with many potentials. The proliferation of digitisation has resulted in vast data that allows machine processing. Machine Learning has been employed in some of the primary studies of tasks related to classification [39, 40, 48, 96] and clustering [78, 97]. The combination of NLP and ML techniques has also proven more effective in automated RE support. Since both technologies perform different tasks, while NLP is used mostly for text pre-processing (lexical analysis), machine learning is used for classification and clustering. For example, the outputs of tools like ReqAligner [75], which assists analysts in identifying duplicate functionality in textual use cases, and DoMoBOT[13], an automated tool for interactive domain modelling, have aided the development of many automated tools. Like other tools reported in the primary studies, these have proven the effective use of NLP and ML techniques in automated RE support. However, the increasing use of NLP and ML approaches in developing automated RE tools has various research implications, including algorithm selection, performance evaluation, data requirements, transfer learning, privacy, human-AI collaboration, and bias considerations. As a result of addressing these implications, researchers can advance the field of automated RE and enhance the capabilities, reliability, and ethical soundness of automated requirements engineering processes.

With respect to RQ5 on how these tools are evaluated, we identified the controlled experimental evaluation method to be the most widely used by authors. The experiment is one of the most widely used evaluation methods in research. In tool development, an experiment has proven effective in evaluating tools, especially at the early stage before industrial case study evaluation. It helps reveal lags and uncovers errors and shortcomings of a tool before subjecting it to an industrial case study. During the evaluation, most of the proposed artefacts are evaluated against the expected functionalities or the manually generated analysis models. For instance, [13] evaluated the automatically generated class diagram against models manually generated from various problem specifications commonly used in universities. Experiments are also cost-effective, and since most authors are academics, their colleagues and students usually constitute part of the experiment in testing or evaluating such tools with minimal budgets. The fact that experimental evaluation is the most widely used approach highlights its significance in assessing the effectiveness and performance of automated requirements engineering tools. Researchers and practitioners should continue to prioritize rigorous experimental evaluations to provide empirical evidence and quantitative data supporting the claims and benefits of these tools. While case studies and prototypes represent a smaller percentage of evaluation methods, their usage indicates their value in providing deeper insights into the practical application and real-world effectiveness of automated requirements engineering tools. This finding suggests that further exploring case studies and prototypes can contribute to a more comprehensive understanding of tool performance in specific contexts or scenarios. The lower usage of user studies as an evaluation method suggests a need for more attention to the user's perspective. User studies can provide valuable insights into user experiences, satisfaction, and usability aspects of automated requirements engineering tools. The distribution of evaluation methods indicates the importance of adopting a balanced approach that includes a mix of evaluation methods. Combining experimental evaluations, case studies, prototypes, and user studies can provide a more holistic understanding of automated requirements engineering tools' strengths, limitations, and practical implications. In terms of comparative evaluation of different tools against each other, this has not been done, partly because there are no existing established requirements cases on which such comparisons could be performed.

6 Conclusion and future work

6.1 Limitation of the review

The limitations of any systematic review are multifaceted, encompassing potential subjective selection of studies for inclusion. This can arise if the selection process needs more transparency or if personal biases influence the decision-making. Limiting the search to specific databases can also be a limitation, as it may result in incomplete coverage of the literature. A flawed protocol can impact the reliability and validity of the review and introduce bias to the findings. To mitigate these limitations, the guidelines provided in [60] were employed to minimize biases in study selection. However, it is worth noting that during the search process, there is always a chance of overlooking relevant studies, as highlighted by [62]. Moreover, the influence of language on search strings in software engineering further complicates the search for studies. To enhance data gathering accuracy and minimize internal validity threats, an iterative selection process was employed. This approach aligns with the rigour and thoroughness principles underpinning the systematic literature review methodology. By employing this iterative process, we can provide a comprehensive and reliable synthesis of the existing evidence on the chosen topic, making the review a valuable resource for decision-making and further research. By establishing specific inclusion and exclusion criteria, search strategy, data extraction methods, and quality assessment criteria, the selection preference was minimized (at least to some extent), focusing on obtaining the most relevant studies. This approach aimed to produce more generalizable results for the systematic literature review, alleviating potential external validity threats.

6.2 Conclusion and future work

This paper presents a systematic literature review on automated requirements engineering. The review was conducted by searching for and classifying all available studies on automated requirements engineering, following established guidelines for conducting a systematic literature review. Eighty-five papers were selected, reviewed, and systematically analysed based on five specified research questions. These questions aimed to explore the state-of-the-art research in automated requirements engineering, focusing on tool output, automation of requirements engineering phases, level/degree of automation, development approaches/techniques, and tool evaluation methods. As a result, the following responses to the research questions were obtained:

  • Researchers increasingly focus on automated requirements engineering, resulting in many publications and various outputs. The most significant output supported by automated tools is UML models (44.7%), while other outputs are primarily related to activities/tasks, such as omission of steps, consistency checking, and requirement validation.

  • Most automated tools (49.53) are developed to automate activities in the analysis phase of requirements engineering, making it the most widely automated phase.

  • Results indicate that 59% of the tools are semi-automated, requiring some human intervention to perform specified tasks.

  • Natural language processing technologies are widely employed in developing automated requirements engineering tools, accounting for 50.6% of the total studies, with POS tagging and Parsers being the most used NLP technologies.

  • Controlled experimental methods are most frequently used (48.2%) to evaluate automated requirements engineering tools, while user studies are the least employed evaluation method (8.2%).

Over the past two decades, significant research progress has been made in the academic space regarding automated requirements engineering. However, there is limited application and evaluation of results in the industry, which would provide valuable real-world feedback. This is mainly due to the experimental stage of automated requirements engineering applications. Another notable finding is the need for comparative evaluation analysis among these tools, which may be attributed to the absence of a common dataset. In conclusion, this systematic review endeavours to aggregate empirical findings from publications on automated requirements engineering, shedding light on current practices. These practices are expected to guide researchers and practitioners in applying and evaluating support tools designed to assist requirements engineering processes/activities. Furthermore, this review introduces a new paradigm in requirements engineering, leveraging artificial intelligence and related techniques to facilitate data-driven requirements engineering.

Our future study will determine the extent of traceability among the produced artefacts. There is a need to create a traceability mechanism to maintain links between requirements elements and the created artefacts. Additionally, we will conduct an interview-based study to explore automated requirements engineering practices by industry practitioners to complement the findings of the systematic literature review. On the other hand, there is an increasing potential for using NLP technologies for automated requirements analysis due to the recent advances in artificial intelligence, which enable robust context-sensitive analysis of natural language texts. Therefore, it is also essential to empirically investigate deeper to facilitate a more comprehensive understanding of an optimal process for automated requirements analysis, for example, identifying the data concepts (classes) and features/attributes in the first stage and as a basis for further detailed analysis of functionality. Finally, there is equally the potential for studies on a comparative evaluation of different approaches on the exact requirements of case studies.