Introduction

Industrial enterprises face demanding challenges concerning the competitiveness and sustainability of their operations (Aheleroff et al., 2022). Global competition, resource scarcity, and increasing customer demands for individualized products necessitate shorter product development cycles and more agile production environments (Ashtari Talkhestani et al., 2019). To tackle these challenges, it is envisioned that production systems become increasingly interconnected, cooperative, and autonomous. Assets such as production machines and products will exhibit a high degree of intelligence, supported by enhanced data collection and information processing capabilities (Monostori, 2014). This vision is commonly referred to as Industry 4.0 (Sakurada & Leitão, 2020). The realization of this vision is expected to yield several benefits, including the optimization of production processes, increased product individualization, improved robustness of production systems, enhanced system transparency, and increased resource efficiency (Cardin, 2019; Monostori, 2014). To realize these benefits and implement autonomous, cooperative, and interconnected production systems, suitable use of information and communication technology is required.

The Digital Twin (DT) and its standardized implementation, the Asset Administration Shell (AAS), are often seen as one of the most prominent paradigms to enable the process of digitization within Industry 4.0 (Sakurada et al., 2022a). Software Agents, particularly Multi-Agent Systems (MAS), are often viewed as a key enabler for autonomous and self acting DTs (Sakurada et al., 2022a; Vogel-Heuser et al., 2020; Sakurada & Leitão, 2020) as well as an enabler for realizing the full potential of Industry 4.0 (Karnouskos et al., 2019).

Therefore, this work investigates two software paradigms commonly viewed as key enablers of Industry 4.0: Software Agents (referred to as Agents) and Digital Twins (Karnouskos et al., 2020).

Agent paradigm

An Agent is “[...] a computer system, situated in some environment, that is capable of flexible autonomous action in order to meet its design objectives” (Jennings et al., 1998). Agents are commonly associated with properties and capabilities such as autonomy, reactivity, proactivity, deliberativeness, persistence, encapsulation, and communicative abilities (VDI, 2010), which are required for interacting with one another in so called MAS. There are a number of advantages that distinguish Agent-based automation solutions from conventional, centralized concepts (Jennings & Bussmann, 2003; VDI, 2010):

  • Conceptual advantages arise from inherent problem decomposition during engineering

  • No single point of failure or communication bottleneck

  • Interaction with the environment and utilization of local knowledge, reducing communication traffic.

  • Flexible communication paths and organizational relationships

  • The distribution of computing power enables high scalability, facilitating flexible and efficient adaptation to changing production requirements and workloads.

  • Adaptive and flexible behavior, allowing for dynamic reconfiguration and coordination of Agents in response to changing conditions and requirements in the industrial environment

  • Independent operation of Agents, enabling MAS to compensate failures of other Agents

  • By coordinating efforts among Agents, decentralized optimization of production processes can be achieved, improving resource allocation, task allocation, and scheduling

Digital twin paradigm

The concept of the DT as a virtual representation of an asset has first been described by Grieves (2014). Over the past decade, various definitions have surfaced in an attempt to delineate and encapsulate this concept more precisely. NASA defined the DT “as an integrated multiphysics, multiscale, probabilistic simulation of an as-built vehicle or system that uses the best available physical models, sensor updates, fleet history, etc., to mirror the life of its corresponding flying twin” (Glaessgen & Stargel, 2012). Moreover, Madni et al. (2019) distinguish the DT from a Computer Aided Design/Computer Aided Engineering model. In this context, they emphasize, among other things, that the DT is a “...specific instance...” of a physical twin that “...combines data...” and is used to “...observe system performance...” and “...enable traceability...” across different life cycle phases. In this work, the definitions introduced by Tao et al. (2018) and Kritzinger et al. (2018) are referred to and used as a guideline in order to establish a common understanding of this concept. Both Tao et al. (2019) and Kritzinger et al. (2018) have presented two definitions and descriptions of the DT, its components, and the objectives of its usage, which are frequently cited and acknowledged by the academic community. Accordingly, the aim here is to cultivate a shared understanding of the DT concept through such referencing. Tao et al. (2018) define the DT as a digital representation of an asset throughout its life cycle, capable of reflecting its static properties, its dynamic behavior as well as its current condition. In this context, a DT can mirror the real world object or system in real time, thus being beneficial for use cases like monitoring, forecasting, analysis and optimization purposes as well as control (Kritzinger et al., 2018; Tao et al., 2018). Interaction between the digital representation, consisting of relevant digital models, and the physical asset is achieved by establishing a bidirectional and automated data flow (Kritzinger et al., 2018). This enables continuous model improvement and synchronization with the real world asset or system (Tao et al., 2018). The DT includes static data such as technical documentation, as well as dynamic data such as mathematical or simulation models that reflect the asset’s behavior (Kritzinger et al., 2018).

In this work the DT is considered to take both, a passive as well as an active role. In the passive role, similar to the digital shadow concept proposed by Kritzinger et al. (2018), synchronization of digital models with the physical asset occurs by establishing a unidirectional data flow. During this process, data is collected, stored, and used for condition monitoring, analysis, or optimization using the existing digital models. However, no immediate or simultaneous influence is exerted on the physical asset. In contrast, in the active role, the DT is fully integrated by establishing a bidirectional data flow and has control over the physical asset. Both roles are taken into account by the authors of this paper for the systematic comparison of the paradigms.

The advantages of using DTs are the following (Kritzinger et al., 2018; Tao et al., 2018, 2019; Pires et al., 2021; Mashaly, 2021):

  • Efficient design and development of Industry 4.0 conform production systems by means of virtual modeling and simulation of products, processes, and systems in the early stages of development or during reconfiguration.

  • Facilitating cooperation among designers, engineers, operators, and manufacturers during the engineering phase, thanks to a digital system model.

  • Improving operational efficiency by providing real time information about the condition, performance, and behavior of physical assets.

  • Enabling condition monitoring using real time operational data and historical data

  • Supporting operational decisions by visualizing operational data.

Motivation

While the previous conceptual descriptions of Agents and DTs suggest certain differences between the two paradigms, a growing number of applications of Agents and DTs for similar purposes can be observed in the literature. For instance, both Agents and DTs have been employed in production scheduling (Klein et al., 2018; Tliba et al., 2023), fault diagnosis (Xu et al., 2019; Seng Ng & Srinivasan, 2010), reconfiguration of manufacturing processes (Leng et al., 2020; Rodrigues et al., 2018), and the control of production resources at the control device level (Leng et al., 2020; Schutz et al., 2011). Furthermore, similar sets of requirements for realizing the vision of Industry 4.0 are addressed by approaches utilizing both paradigms (Seitz et al., 2021; Redelinghuys et al., 2020). Such similarities in the purposes served by Agents and DTs can be indicative of general similarities between the two paradigms. A systematic comparison of Agents and DTs allows for a more in-depth investigation of the extent of this similarity. By identifying applications of the two paradigms that exhibit similarities, researchers specializing in Agents, for example, can gain insights into potential solutions and best practices for a given problem from the realm of DTs, and vice versa. This approach can help researchers to avoid pitfalls identified in one paradigm when working with the other. Analogously to similarities between Agents and DTs, significant differences between the two paradigms can be identified through systematic comparison, providing guidance to researchers, especially those with limited experience, on which paradigm is best suited for solving a particular problem.

In addition to the applications of Agents and DTs for similar purposes discussed previously, some publications employ Agents and DTs synergistically. For instance, Vogel-Heuser et al. (2021) utilize DTs as a standardized knowledge base for Agents, while Ashtari Talkhestani et al. (2019) present an architecture for an intelligent DT and employs Agents to facilitate co-simulation between different DTs. Another example of the concurrent and synergistic use of Agents and DTs involves the application of design patterns from the Agent paradigm to develop “proactive” AASs, resulting in DTs exhibiting Agent-like capabilities, such as proactivity (López et al., 2023). A systematic comparison of Agents and DTs will also aid in establishing guidelines on which applications can benefit from a synergistic utilization of Agents and DTs, where the strengths of one paradigm compensate for the weaknesses of the other. Analogously to synergies, dissonances between the two paradigms can be identified, resulting in guidelines on how not to apply Agents and Digital Twins simultaneously. Based on a systematic comparison, a starting point can be set for a strategic co-evolution or even a prospective integration of the two paradigms.

To summarize the previous paragraphs, three observations should be highlighted:

  1. 1.

    Agents and DTs are sometimes applied for similar purposes.

  2. 2.

    There are use cases in which Agents and DTs are used synergistically.

  3. 3.

    The formal definitions of Agents and DTs indicate significant differences between the paradigms.

This work further explores these observations to determine the similarities and differences of Agents and DTs within actual implementations in industrial automated production systems. By focusing on actual implementations of Agents and DTs, purely conceptual reports are disregarded in this work. This is done to obtain insights into the capabilities required for industrial applications and evaluate their feasibility. The exclusion of purely theoretical reports is based on the premise that concepts can be proposed regardless of their applicability in or suitability for industrial practice.

Therefore, the research questions pursued in this work can be summarized as follows:

  1. 1.

    What are the capabilities and properties exhibited by Agents and DTs?

  2. 2.

    What purposes do Agents and DTs serve within industrial production systems?

  3. 3.

    What are differences, similarities, and potential synergies between Agents and DTs in industrial practice?

To address these research questions, a systematic analysis of reports describing applications of Agents and DTs is conducted. Based on the analysis, the properties and capabilities exhibited by Agents and DTs are identified. Additionally, each application of Agents and DTs is allocated to the elements of the Reference Architecture Model Industry 4.0 (RAMI 4.0), i.e., the layers, hierarchy levels and life cycle phases, which the respective report primarily addresses. Finally, the purposes fulfilled by the Agents and DTs are determined for each report. The precise method for conducting these analyses is described in “Methodology” section.

The insights obtained from the conducted analyses can serve as a basis for practitioners, looking for the most suitable software paradigm to solve specific problems. The focus on actual implementations thereby allows to move beyond conceptual comparisons and investigate what is actually implemented in practice. This in turn can serve as a base for discussions on the conceptual definitions of the paradigms’ definitions. In fact, it has been suggested by other authors that “the Digital Twin would benefit from a more detailed comparison and review in the context of similar and connected fields” (Jones et al., 2020) and that “some authors use the term [Digital Twin] merely as a catchphrase” (Sjarov et al., 2020), indicating the lack of a clear and common definition and standardization, which is investigated within this work. Finally, the results of this analysis provide insights into potential synergies between the two paradigms, thereby identifying future research opportunities.

Related works

Several reviews have been published aiming to describe characteristics of Industry 4.0-related paradigms and/or a comparison thereof. de Paula Ferreira et al. (2020) review simulation in Industry 4.0 and derive “design principles” to classify simulation paradigms, including Agents and DTs. However, they do not conduct a detailed comparison between Agents and DTs (de Paula Ferreira et al., 2020). Müller et al. (2021) present a list of abilities and characteristics to describe industrial autonomous systems, which is then used to exemplify a single use case (Müller et al., 2021). Jones et al. (2020) aim at characterizing the DT paradigm in depth, providing a description of its characteristics, but do not apply the list to compare different DT applications (Jones et al., 2020). Cohen et al. (2023) conduct a review of artificial intelligence applications in DTs and offer recommendations on which artificial intelligence techniques are suitable for different purposes of DT applications. Agents are not within the scope of Cohen et al. (2023). Lehmann et al. (2023) explore the symbiotic relationship between intelligent DTs and MAS. Lehmann et al. (2023) highlight the similarities in terms of specific characteristics of MAS, such as being active, online, goal seeking, and anticipatory. However, their comparison between DTs and MAS is qualitative, and no systematic literature review has been conducted.

Furthermore, research has been devoted to analyzing the alignment of Agents and DTs with RAMI 4.0. Verbeet and Baumgärtel (2020) propose an approach to implement an autonomous Industry 4.0 system using Agents and classify the mandatory and optional capabilities of Agents base on the “layers” and “hierarchy levels” axes of RAMI 4.0. Melo et al. (2023) align the DT concept with the different dimensions of RAMI 4.0 and consider the capabilities of MAS to support the development of Digital Twin ecosystems.

As mentioned earlier, numerous literature reviews exist, but none conduct a systematic comparison of Agents and DTs to identify differences, similarities, and synergies as well as common purposes.

Table 1 Search query used to identify relevant records

Contributions of this systematic comparison

The analysis of related works reveals that the research questions stated in “Motivation” section have not been addressed in previous studies. Therefore, this systematic comparison of Agents and DTs offers the following contributions:

  1. 1.

    Systematic literature review following the most recent Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement to identify relevant reports concerning the application of Agents and DTs in industrial automated production systems.

  2. 2.

    Comparison of Agents and DTs based on their capabilities and properties as well as their allocation within RAMI 4.0

  3. 3.

    Analysis of use cases of Agents and DTs including the determination of the purposes for which they are applied.

  4. 4.

    Identification of differences, similarities, and synergies between Agents and DTs.

The remainder of this work is structured as follows: “Methodology” section describes the methodology employed to (1) identify relevant reports regarding industrial applications of Agents and DTs and (2) systematically compare them. “Results” section presents the results of the systematic literature review, along with the determination of system properties, capabilities, and purposes. In “Discussion” section, a discussion is conducted on the applied methods and the presented results.

Methodology

The methodology applied in this work consists of two main parts: a systematic review process to identify relevant reports from the existing literature (“Methodology” section) and a method to facilitate a systematic comparison of the reports identified within the systematic review process (“Methodology for the systematic determination of capabilities, properties, and purposes” section and subsequent sections).

Method of the systematic literature review

The systematic literature review adheres to the PRISMA 2020 statement (Page et al., 2021). The following sections outline the search query, the reviewing process, and the criteria for excluding or including records. In this context, the term “record” refers to publications obtained from the search engine based on the search query, while “report” refers to those records that met the inclusion criteria and were subsequently analyzed concerning the application of Agents or DTs.

To identify potentially eligible records, a systematic search was conducted on June 15, 2023, using the Scopus © database (Elsevier, 2022). The search query is presented in Table 1.

Results were limited to English and German publications. Since the Industry 4.0 framework was introduced at the Hanover fair in 2011 (Geisberger et al., 2011), records dating back to 2011 were included. However, this literature review focuses on recent works in the field, i.e., published in the last 5 years, to provide an up-to-date analysis. This is also due to the fact that a significant rise in publications in the field can be seen starting from 2017. To achieve this up-to-date result, the latest works from 2023 are included without limitations on citations since these works have not had time to accumulate such citations. With an increase in the years since publication, an increasing threshold on the number of required citations was applied to ensure that only high quality research is included. Further, to achieve the focus on recent research, a significant increase, i.e., from 5 to 10, in the applied threshold for required citations of works that are older than 5 years has been applied (see Table 2). However, instead of excluding such works all together, the latter acknowledges that the survey should include seminal works from the period of 2011 to 2016 that had a significant influence on the field.

Table 2 Minimum number of citations by year of publication

The reviewing process was structured into three phases, with the same inclusion and exclusion criteria applied throughout the entire process. For a record to be included, it had to fulfill all of the following criteria; otherwise, it was excluded. These three inclusion criteria were evaluated simultaneously:

  1. 1.

    The record must primarily focus on either Agents or DTs.

  2. 2.

    The area of application must be industrial automated production systems.

  3. 3.

    The record must include an application, implementation, or case study. Purely conceptual or review papers were excluded. The implementation described should provide sufficient detail to allow characterization using the method presented in “Methodology for the systematic determination of capabilities, properties, and purposes” section.

The records underwent analysis in three consecutive phases. In phase 1, only the title of each record was analyzed to eliminate those that were clearly out of scope. During phase 2, the title, abstract, and keywords of the remaining records were screened. Possible ratings included 0 (clearly out of scope), 1 (unclear), and 2 (criteria for inclusion fulfilled). If a record received a rating of 1, it was independently screened again by a different reviewer. For all records rated 1 or 2 in phase 2, the full text was obtained and analyzed in phase 3. The analysis followed the method outlined in “Methodology for the systematic determination of capabilities, properties, and purposes” section. If the analysis of the full text revealed that any of the inclusion criteria were not met, the respective record was excluded during the full text screening in phase 3.

To prevent bias during the first two phases, information regarding authors, journals/conferences, and citation counts was omitted. Additionally, different reviewers were assigned to screen each record in every phase. To ensure consistent decisions regarding inclusion and exclusion, a joint analysis of at least 15 reports was conducted in each phase. Furthermore, 10% of the included reports were independently screened by two reviewers in phase 3.

Methodology for the systematic determination of capabilities, properties, and purposes

To compare Agents and DTs, the purposes, properties, and capabilities of both paradigms were used as comparative aspects. Table 3 presents the definitions of the terms purpose, property, capability as interpreted in this work. The terms have been adapted from model-based systems engineering (Weilkiens, 2014) serving to characterize complex systems. In this context, the term property is utilized to characterize the system in its target state, offering an overview or abstract representation of the system’s potential in achieving its intended objectives. On the other hand, capability, synonymous with functionality, delineates the specific skills or abilities that the system must possess, and thus, is more granular and precise compared to properties.

Table 3 Definition of purpose, properties, and capabilities

The sets of capabilities, properties, and purposes which were attributed to the applications of Agents and DTs were derived from a literature analysis and iteratively adjusted in the initial screening. In total 26 capabilities were identified as well as four properties and nine categories of purposes. The process of identifying the capabilities, properties, and purpose of the Agents and DTs in a given report is described in “Identification of capabilities”, “Identification of properties”, and “Identification of purposes” sections, respectively.

Identification of capabilities

The capabilities of Agents and DTs were identified based on text passages describing the implementation of the paradigms in the corresponding reports. This analysis focused on the description of the actual implementation of Agents and DTs, usually found in the methodology section of a report. This means that capabilities which were addressed in the introduction of a report but not in the section describing the actual implementation were not considered.

The set of capabilities exhibited by Agents and DTs is presented in Table 4. Initially, the set of capabilities was iteratively adjusted to encompass all the functions and behaviors of Agents and DTs. The set of capabilities was created irrespective of the paradigm, i.e., no capability is considered as exclusive to one paradigm a priori. The exact definitions of the capabilities as applied in this work are stated in Tables 6, 7, 8 and 9.

The following text passage provides an example of identifying the capabilities exhibited by a MAS. In the report, Agents are utilized as a decision support for human operators in planning the adaptation of production resources. The passage describes how different options for adapting the production resources to fulfill a given product order are presented to a human operator: “To support the decision maker in the last phase of the adaptation process, the previously found adaptation options (’ flexibility ’) are compared by their KPIs and only a preselection (’ prioritization ’) of adaptation options is given to the user (’ human interaction ’) who is then capable of selecting the most appropriate one depending on the context. The preselection is done via negotiation (’ negotiation ’) between the different debater roles (’ internal system interaction ’) in the Agent system. Suitable approaches for that can be found in the field of multi criteria decision making (’ decision making ’). In most cases, the selected adaptation option by the user requires the adaptation (’ adaptability ’) of the underlying model of the manufacturing machines.” (Marks et al., 2017) It is worth noting that this text passage contains a notably high number of indicators of capabilities and was chosen for illustrative purposes.

It was assumed that the MAS described above exhibits the capabilities highlighted in the quoted text passage. No distinction was made regarding the level of detail or the number of indicators for each capability. Thus, an application of Agents or DTs either exhibits a given capability (1) or does not exhibit it (0).

Table 4 Capabilities for the comparison

Identification of properties

Four properties were evaluated: autonomy, intelligence, sociability, and fidelity, as defined in Table 5. The decision to differentiate between intelligence and autonomy as separate categories was made due to the fact that there can be technical systems that exhibit intelligent and partially autonomous behavior while still being fully controllable by external sources (see also (Müller et al., 2021) for this distinction). However, the fact that a system is controllable from the outside contradicts common definitions of autonomy. Another example is an autonomous system that is unable to cooperate with its peers. The system might be very autonomous in the sense that it can handle unforeseen events etc. but it lacks capabilities in the area of sociability. Thus, it makes sense to analyze sociability and autonomy separately. The different properties are not mutually exclusive and partially overlap with each other. The chosen properties are not exclusively associated with one paradigm, e.g., intelligence, which is a key property of Agents, is also increasingly exhibited by DTs (Ashtari Talkhestani et al., 2019).

Table 5 System properties and their descriptions
Table 6 Capabilities within autonomy
Table 7 Capabilities within sociability
Table 8 Capabilities within intelligence
Table 9 Capabilities within fidelity

To facilitate a comparison of the properties of Agents and DTs on an ordinal scale, a property fulfillment score is introduced and determined. The property fulfillment score is calculated based on the previously defined set of capabilities. For each capability, it was determined whether and to what extent it contributes to the fulfillment score of a given property. This determination is inspired by established measures of the degree of autonomy in technical systems, which are described in the following paragraphs. Similarly, the relevant capabilities for the properties of intelligence, sociability, and fidelity could be determined and their contribution to the property fulfillment score could be established.

There are several attempts, mainly domain dependent, to determine the autonomy of a system:

In 2007, the “ALFUS framework” was published, which defined a framework for autonomous capabilities of unmanned aerial systems. Huang et al. (2007) propose measuring the autonomy of the system based on the decreasing level of human intervention required. The levels of robot autonomy, described by Beer et al. (2014), consider the degree of required human intervention in sensing, planning, and acting primitives. The authors propose nine levels, ranging from manual task execution to full autonomy within each of the primitives (Beer et al., 2014).

The SAE standard J3016 describes different levels of autonomy for autonomous driving systems (Shi et al., 2020). These levels are based on the allocation of responsibility and task execution that are either executed by the driver or the automation system. The tasks include motion control, sensing of the environment and all necessary sub-tasks to fulfill a secure transportation mission. The increasing autonomy mainly refers to the decreasing reliance on a driver as a fallback solution in case of unforeseen events and operating in unfamiliar domains. The levels range from 0 (no driving automation) to 5 (full driving automation). The “Industry 4.0 platform” has adapted these levels to industrial production applications, focusing on the degree of required assistance by the user as well (Federal Ministry for Economic Affairs and Energy, 2019). Therein, the “Industry 4.0 platform” provides an indication of which capabilities a system requires to achieve a given level of autonomy.

The above mentioned frameworks associate the level of autonomy that a system exhibits with the degree to which the respective system is capable of executing autonomous behavior. Thus, as a higher level of autonomy requires more complex capabilities, a system with a higher level of autonomy reflects the possession of more complex capabilities. To avoid the pitfall of associating a high level of autonomy to a system that possesses a number of complex capabilities but lacks other, more basic ones, the methodology of this contribution directly maps the capabilities to so called Complexity Degree (CD). The CDs are integers from 0 to 5. They rate how complex the implementation of a given capability is and how strongly it contributes to the property fulfillment score of each of the four properties. Thus, each capability is analyzed in the context of each property. It should be noted though that each of the four subsets of the different capabilities and associated CDs does not contain all 26 capabilities, since not every capability is relevant to the respective property.

The fulfillment score of a property is calculated using Eq. (1). In Eq. (1), x is the number of capabilities which contribute to the property, \(n_i\) represents the complexity degree of the i-th capability, and \(cap_{p,i}\) takes the value 1 if the i-th capability is exhibited by the Agents or DTs in report p and takes the value 0 otherwise. This means that a property fulfillment score of 100% is achieved if all capabilities contributing to the property are exhibited by the Agents or DTs in a given report.

$$\begin{aligned} \frac{1}{\sum \limits _{i=0}^{x} n_i} \cdot \sum \limits _{i=0}^{x} \text {cap}_{p, i} \cdot n_i \end{aligned}$$
(1)

In the following, each property and the underlying capabilities that contribute to property fulfillment scores of all four properties are described and assigned a CD based on a short literature analysis.

Autonomy and its associated capabilities

The set of capabilities which contribute to the autonomy of a system is based on the definitions reviewed by Müller et al. (2021) who argue that the absence of human intervention, self determination, perception of the environment, decision making capabilities, goal orientation and the ability to adapt as well as proactive planning and execution of actions are important aspects of a system’s autonomy. This is in line with other definitions, for example in Vogel-Heuser et al. (2020) or Meyer et al. (2009). Therefore, the capabilities listed in Table 6, sorted by their associated CD, have been extracted from the literature as contributors to autonomy.

Sociability and its associated capabilities

Distributed systems should possess the ability to optimize their interactions and communicate with other systems. According to Wooldridge (2002), sociability can be classified into different degrees, which determine the complexity of the capabilities described in this work. Table 7 defines the set of capabilities of sociability.

Intelligence and its associated capabilities

Meyer et al. (2009) provide an overview of intelligent products, define intelligence and present levels for the classification of intelligent products. In this work, these findings are generalized to intelligent production resources. The minimum requirement to be classified as intelligent is the ability to handle information sent by external systems, such as sensors. Products that can derive problems from the received information are classified at the second level of intelligence. At the third level, resources possess the degree of intelligence to manage themselves and make their own decisions. These levels of product intelligence serve as a basis for determining the CD of the capabilities associated with intelligence. Table 8 defines which capabilities constitute intelligence.

Fidelity and its associated capabilities

Both Agents and DTs should possess the ability to accurately analyze, describe, and reflect the physical system(s) they are associated with. This often requires a high fidelity of models used to describe the system. Table 9 lists the capabilities of which fidelity consists.

Identification of purposes

The utilization of Agents or DTs in the context of production always serves a specific purpose. Therefore, based on the literature analysis presented, several categories of purposes were identified. These categories are the main focus when applying Agents and DTs in the context of industrial automated production systems. The analysis conducted for the first research question (“Motivation” section) aims to identify whether certain purpose categories are more commonly associated with one paradigm or the other. This will subsequently address possible correlations between required capabilities and specific purpose categories. The analysis primarily focuses on the life-cycle phases of commissioning and operation of existing applications of Agents and DTs. To identify these purpose categories, an initial list was compiled based on the existing literature, and this list was further extended and updated during the initial screening process.

For the initial definition of purpose categories, the work of Leitão (2009) and Colombo (1998) was consulted. These works introduced an architecture for manufacturing control systems, and the individual elements of this architecture were interpreted as purposes in this work. The major purpose categories in manufacturing control systems include planning, scheduling, dispatching and control. Typically, these tasks are executed sequentially in the mentioned order. Monitoring is used to track essential information about processes, products, and resources from the shop floor or manufacturing system. This information is then used as a feedback loop to improve planning, scheduling, dispatching, and control activities (Leitão, 2009). The results of monitoring can also be utilized for diagnosis, error identification and correction, and predictive maintenance which are grouped under the category diagnosis and fault management. Other purpose categories frequently associated with Agents and DTs, but not covered by Leitão (2009), include user assistance, virtual commissioning, and general process optimization. Table 10 provides a summary of the individual purpose categories along with a brief description. The purpose of an application involving Agents or DTs was determined by the reviewer after thoroughly reading a report and assigned to one of the nine purpose categories listed in Table 10.

Table 10 System purposes and their descriptions

Allocation of applications in the reference architecture model industry 4.0

Fig. 1
figure 1

RAMI 4.0 (DIN, 2016)

The use cases described in the reports were further classified based on the individual elements of the axes of RAMI 4.0 (DIN, 2016) (see Fig. 1). This classification was performed to enable both the categorization and aggregated comparison of all use cases for each paradigm.Since both Agents and DTs are widely applied in Industry 4.0 use cases, RAMI 4.0 serves as a suitable framework for classifying these paradigms. RAMI 4.0 allows for the description of an Industry 4.0 component throughout its lifecycle within the context of Industry 4.0 (DIN, 2016). The classification within RAMI 4.0 aims to reveal insights about frequently mentioned layers, hierarchy levels, and life cycle phases in applications of each paradigm. A comprehensive description of the six layers, seven hierarchy levels, and four life cycle phases can be found in (DIN, 2016).

To allocate reports describing the application of Agents and DTs within RAMI 4.0, an examination was conducted to determine whether a specific element of one of RAMI 4.0’s three axes was relevant to the application described in the respective report. For example, a single element could be the functional layer, at the work center level in the usage phase of an instance. If a particular element was identified as being of significant importance for the use case, it was marked as the “focus element”. Elements that were addressed within a report but were not considered the focus of the report were still extracted to analyze the application of the paradigms across all elements.

As the focus of this work is the analysis of implemented Agents and DTs, all implementations are considered to be instances. Therefore, the elements of the “Lifecycle & Value Stream” axis are always considered as an instance in the usage phase. Consequently, the lifecycle and value stream axis was not explicitly extracted from the reports and will not be discussed in (“Results” section).

Results

This section presents the results of the analysis conducted using the information extracted from the reports included in the systematic literature review. “Results of the systematic literature review” section provides a brief overview of the results obtained from the literature review. Subsequently, “Capabilities and properties of agents and digital twins” section describes the capabilities and properties associated with the applications of Agents and DTs within the reports. “Allocation of paradigms’ applications in RAMI 4.0” section illustrates the placement of Agents’ and DTs’ applications within RAMI 4.0 and offers an interpretation of the findings. “Purposes of agents and digital twins” section compares system purposes of applications of both paradigms. A comprehensive overview of all the results can be found in Table 12.

Results of the systematic literature review

As shown in Fig. 2, the three-phase approach outlined in “Method of the systematic literature review” section, led to the exclusion of 981 records in phase 1 and 344 records in phase 2. The full text review in phase 3 resulted in the exclusion of 90 reports (R1: 41, R2: 27, R3: 22). Consequently, a total of 145 reports were included for the identification of system properties, capabilities, and purposes (as explained in “Methodology for the systematic determination of capabilities, properties, and purposes” section) and for the subsequent systematic comparison. Out of these 145 reports, 58 reports focus on Agents, 86 reports focus on DTs, and one report focuses on both Agents and DT (Xia et al., 2021). The analysis of the latter report was divided into separate analyses of the DT implementation and the Agent implementation, with the results being incorporated into the overall results of the respective paradigm (see also Table 12).

Fig. 2
figure 2

PRISMA 2020 flow diagram according to Page et al. (2021)

Capabilities and properties of agents and digital twins

The following sections describe Agent’s and DT’s capabilities (“Agents’ and digital twins’ capabilities” section) and properties (“Agents’ and digital twins’ properties” section).

Agents’ and digital twins’ capabilities

Figure 3 shows the rate of occurrence of the investigated set of capabilities in Agents and DTs. The rate of occurrence is calculated by dividing the number of implementations of one paradigm exhibiting a specific capability by the total number of reports focusing on that paradigm. For example, out of 59 reports presenting Agents, 12 implementations exhibit the capability of learning, which amounts to 21% as shown in Fig. 3.

Fig. 3
figure 3

Rate of occurrence of capabilities in Agents and Digital Twins

The analysis reveals that some capabilities are rarely found in both Agents and DTs. Proactivity, uncertainty handling, and mobility are capabilities that were scarcely addressed in the investigated reports. It should be noted that although Agents and DTs may possess these capabilities, the respective reports did not discuss them, suggesting that they may not be essential for the specific use-cases.

There are also sets of capabilities that are predominantly found in either Agents or DTs. A capability is considered “Agent-specific” or “Digital Twin-specific” if it occurs at a rate of 50% or higher in one paradigm but not in the other. Therefore, the Agent-specific capabilities include encapsulation, internal system interaction, context awareness, decision making, external system control, coordination, cooperation, and flexibility (in descending order). It is expected that Agents possess these capabilities, as they are commonly applied as MAS, thus requiring capabilities such as internal system interaction, cooperation, and encapsulation. However, the higher occurrence rate of the capability of external system control in Agents compared to DTs is surprising, considering that a two-way connection between an asset and a DT is often described as a requirement in DT definitions (Kritzinger et al., 2018; Tao et al., 2018).

The DT-specific capabilities include simulation, condition monitoring, real time capability, and visualization. This observation aligns with common definitions of DTs. However, it is surprising that the capability of simulation occurs in less than 70% of DT use cases, considering that simulation models are often regarded as a core feature of DTs. Additionally, the capability of adaptability occurs less frequently than expected, with a rate of less than 20%. This indicates the challenge of detecting asset changes and adjusting the DT accordingly to maintain accuracy. The same applies to the capability of synchronization, which occurs in less than 40% of cases.

Fig. 4
figure 4

Comparison of characteristic capabilities of Agents and Digital Twins

This initial analysis reveals substantial differences between the paradigms of Agents and DTs. However, it is also observed that almost no capability is exclusive to one paradigm (with the exception of mobility, which was identified in only three applications of Agents, as seen in Fig. 3). This observation is further supported by Fig. 4, which illustrates the occurrence of Agent-specific and DT-specific capabilities in both paradigms.

In the case of Agents, it is observed that the majority possess at least five out of the seven Agent-specific capabilities. Two-thirds of Agents possess at least one DT-specific capability. Similarly, approximately half of the analyzed DTs possess at least one Agent-specific capability. Five DT applications possess as many as five Agent-specific capabilities. The distribution of DT-specific capabilities among DTs is relatively uniform. Twelve DT applications possess no DT-specific capabilities, while 20 applications possess all four capabilities.

Agents’ and digital twins’ properties

Figure 5a illustrates the property fulfillment score of DTs for the mentioned properties based on the screened capabilities. It is evident that fidelity is the most distinctive property of DTs, as indicated by its median score of 56% and the top quartile at 77% fulfillment.

Fig. 5
figure 5

Properties of DTs and Agents and their fulfillment score

This aligns with the objective of DT applications, which aim to accurately represent physical assets. On the other hand, sociability has a low property fulfillment score, with a median of 9% fulfillment. This is consistent with the definition provided in “Digital twin paradigm” section, where social behaviors such as cooperation or communication are not emphasized. However, it is surprising that autonomy also has a low property fulfillment score. Definitions of DTs often emphasize their capability to independently interact with physical assets, but the screening results indicate that this is rarely implemented in practice.

Additionally, intelligence has a low fulfillment score, with its median being at 13%. Also in this case, definitions like (Tao et al., 2018; Reiche et al., 2021; Jazdi et al., 2021) claiming that intelligent components exist within the DT, differ from actual applications.

In contrast, Agents exhibit a different property profile, as shown in Fig. 5b. The majority of the screened applications only demonstrate limited fidelity, suggesting that Agents may not be capable of representing physical assets in great detail. However, in terms of intelligence, autonomy, and sociability, Agents exhibit significantly higher scores than DTs.

Table 11 Average fulfillment score of properties by complexity degree of capabilities

The data displayed in Table 11 shows for each property and paradigm to what extent the capabilities of each CD have been achieved. For example, a score of 16% for fidelity capabilities of CD1 means that on average an Agent-based implementation exhibited 16 % of the CD1 fidelity capability subset. As there is only the capability visualization for CD1 fidelity, this also means that 16 % of the Agent-based implementations exhibited this capability. However, in case of, for example, the two CD3 capabilities for the property autonomy, decision making and mobility, the average score of 39 % for Agents only refers to the average fulfillment and does not necessarily mean that 39 % of the implementations exhibited both capabilities. This is also visible in Fig. 3, showing that mobility is less frequent in Agents than decision making.

Sociability, which scored the highest with a median of 64 % in reports featuring Agents, is distributed similarly across the CDs (see Table 11). Therefore, most screened reports featuring Agents had the required basic capabilities for interaction, as well as more complex capabilities such as coordination and cooperation. This finding aligns with the definition provided in “Agent paradigm” section.

The results for intelligence in Table 11 show that Agents excel in medium complexity capabilities (CD 2 and 3) such as decision making and social aspects, which aligns with common definitions of Agents. However, higher CDs of intelligence, involving learning and a high degree of autonomy, are less frequently achieved. Nevertheless, Agents still score significantly higher in these more complex degrees of intelligence compared to DTs.

Autonomy, being one of the core properties of Agents, is also notably strong compared to DTs. However, it is important to note that most approaches exhibit low degrees of autonomy, while the higher degrees, consisting of capabilities like flexibility, proactivity, and adaptability, are rarely achieved (see Table 11). This may be due to the divergence between the definitions of Agents and the requirements of production, which was the focus domain of this work. In production environments, high degrees of autonomy commonly associated with Agent-based concepts are often neither desirable nor necessary. For example, in many applications, Agents are responsible for making decisions based on local information to control the routing of orders and workpieces through the system. Within such constrained environments, it is often enough to incorporate dynamic states of machines and the environment (e.g., context awareness) in simple rule based decision processes. This is in contrast to environments where the Agents encounter situations that are fully unknown and thus no general applicable rules can be provided, corrective actions cannot be taken in time by humans due to latencies, or the environment exposes the Agent to significant threats. In such cases, higher degrees of autonomy would be required. Generally, Agents exhibit significantly higher scores of autonomy compared to DTs.

Allocation of paradigms’ applications in RAMI 4.0

As described in “Allocation of applications in the reference architecture model industry 4.0” section, part of the analysis is to investigate which elements within RAMI 4.0 are addressed in the analyzed reports. The results of this analysis are presented in the following paragraphs.

Figures 6 and 7 show for each element of RAMI 4.0 at what rate the respective elements occurred as the focus layer of the reports (Figs. 6a, 7a) and at what rate the respective elements occurred within the reports, in general (Figs. 6b, 7b). The value of the rate of occurrence (indicated by the tone of the color in each RAMI 4.0 element) is determined by dividing the number of occurrences in the respective RAMI 4.0 element by the total number of reports focusing on the respective paradigm. The visualization of the results is adapted from Verbeet and Baumgärtel (2020).

Figure 6 presents the allocation of Agents’ applications in RAMI 4.0. Figure 6a displays only the focus elements, showing that the focus of Agents’ applications lies primarily on the integration to business layers as well as the field device, control device, station, and work center hierarchy levels. The most frequently selected focus element is functionalstation. The most frequently selected focus element is functional - station. The top three focus elements are found within the functional layer, specifically at the control device through work center hierarchy levels. The asset layer is never focused on, nor are the product and enterprise hierarchy levels, or the business layer. This indicates that Agents are primarily applied within industrial production for establishing communication between different units of a production system and executing functions to facilitate the correct operation of the system. Interestingly, the processing of information is rarely the focus of Agents, suggesting that Agents are commonly applied in industrial applications where information processing is either a relatively simple task or performed by means other than Agents. As Agents typically do not run on field devices or control devices (even though such applications exist (Ulewicz et al., 2012)), their frequent allocation to the station and work center levels are as expected.

Fig. 6
figure 6

Placement of Agents’ applications in RAMI 4.0

Figure 6b depicts the rate of occurrence of RAMI 4.0 elements without differentiation of focus element. It is evident that almost all Agents’ applications (94 %) feature the RAMI 4.0 element of functional - station, which is the most commonly selected element. The distribution of applications throughout RAMI 4.0 extends to the edges of the axes, indicating that the application of Agents requires consideration of many aspects within RAMI 4.0, essentially up to the enterprise level.

Figure 7 presents the categorization of DTs’ applications. As seen in Fig. 7a, DTs show a focus on the communication, information, and functional layers, as well as the station and work center levels. The combination of the work center level and the information layer appears to be particularly focused in DTs’ applications. However, the surrounding layers are also covered, and there is no layer or level that is not addressed.

Fig. 7
figure 7

Placement of Digital Twins’ applications in RAMI 4.0

Figure 7b displays the categorization of DTs’ applications, considering all elements. The results align with the focus element categorization, as the same layers and levels are most commonly found. However, there is a broader spread of observed RAMI 4.0 elements across both layers and levels, covering almost every combination of layer and level. This indicates a wide range of application areas for DTs.

Comparing the placement of Agents’ applications (Fig. 6) and DTs’ applications (Fig. 7) it is evident that both paradigms mainly focus on the communication, information, and functional layers, with a primary emphasis on the station and work center hierarchy levels. DTs’ applications exhibit a stronger presence towards the edges of the axes compared to Agents’ applications. Another interesting observation is that DT applications have a strong focus on the processing of information, whereas Agents’ applications are primarily focused on executing functions within the planning and control of production processes. This aligns with the definition presented in “Digital twin paradigm” section. However, it is worth noting that for Agents, one might have expected their cooperative and social nature to also benefit use cases in the field of information processing and management. Nevertheless, such use cases appear to be less common in practice.

Purposes of agents and digital twins

This section presents the results of the analysis regarding the purposes connected to the applications of Agents and DTs. As described in “Identification of purposes” section, nine categories of purposes have been identified that Agents and DTs are meant to fulfill: process optimization, user assistance, virtual commissioning, planning, scheduling, dispatching, control, monitoring, and diagnosis and fault management.

Figure 8 shows how often Agents and DTs are applied to serve a purpose each one of the nine categories. Five out of the nine purpose categories are predominantly addressed by one paradigm: user assistance, process optimization, and virtual commissioning are specific to DTs, while scheduling and dispatching are mostly associated with Agents.

Fig. 8
figure 8

Rate of occurrence of purpose categories in Agents and Digital Twins

On the other hand, planning, control, monitoring, and diagnosis and fault management are areas where both Agents and DTs are applied. This indicates that Agents are primarily used during the operational phase of a production system’s life cycle, whereas DTs are also utilized in the engineering and commissioning phases, demonstrating the wider range of use cases for DTs observed in “Allocation of paradigms’ applications in RAMI 4.0” section.

During the operational phase, Agents are employed for production planning, scheduling, and dispatching. Production control is a category that is evenly distributed between Agents and DTs. Monitoring tasks are predominantly performed by DTs, while diagnosis and fault management is carried out by both Agents and DTs, with Agents being the more commonly used paradigm. These observations align with the findings from “Capabilities and properties of agents and digital twins” section, which indicated that there are certain areas where Agents and DTs exhibit strong similarities, while other areas are more specific to each paradigm.

The observation that the DT paradigm is applied to industrial use cases spanning from the engineering phase (process optimization) to the operational phase (e.g., control) confirms the observations presented in “Allocation of paradigms’ applications in RAMI 4.0” section, indicating that the DT paradigm encompasses a wider range of elements within RAMI 4.0. Similarly, as discussed in “Allocation of paradigms’ applications in RAMI 4.0” section, Agents are primarily utilized for executing functions within the station and work center levels. By referring to Fig. 8, these functions can now be attributed to the domains of production planning, scheduling, dispatching, as well as diagnosis and fault management.

Discussion

This work presented a systematic comparison of applications of Software Agents and DTs in industrial automated production systems. The goal of this study was to identify similarities and differences between the two software paradigms, as well as potential synergies between them. The comparison was based on a systematic literature review conducted according to the PRISMA 2020 statement. The focus of the review was on actual implementations of Agents and/or DTs. The two software paradigms were compared in terms of their purposes, properties, and capabilities. Additionally, the study investigated which elements of the RAMI 4.0 architecture are addressed when applying Agents and DTs.

The remainder of this section discusses the methods employed in this study. Subsequently, the results are discussed, and future research opportunities are identified.

Discussion of applied methods

The set of reports analyzed in this study depended largely on the chosen search term and database. Scopus was used as experience shows that it yields the highest number of reports. The search term described in “Methodology” section was formulated to capture reports in the context of automated industrial production systems. Special care was taken to ensure a neutral phrasing of the search term to avoid any bias towards a specific software paradigm, application, use-case, or RAMI 4.0 element. The balance of 58 reports on Agents to 86 reports on DTs, with one report describing both an Agent’s and a DT’s application (Xia et al., 2021), indicates that this goal was satisfactorily achieved.

It is worth noting that the 3,063 results obtained from Scopus, out of which 1500 reports were excluded due to low citation count, represent only a fraction of all research conducted on Agents and DTs in the context of industrial automated production systems. However, it is estimated that increasing the number of reports would have a minimal impact on the quality of the results, as the sample size of 3,063 is already large and the presented results are clear, based on observations that are unlikely to be due to statistical variations. Therefore, future works on comparing Agents and DTs can utilize the same set of reports as this work or follow the same methodology as outlined in “Method of the systematic literature review” section. However, when applying the methodology in the future, the applied minimum number of citations per year should be carefully adjusted, if reports deemed eligible in this work are to be included even if their number of citations does not increase over time.

Another aspect of the literature review that can be discussed is the number of reviewers per report. Most reports were analyzed by one reviewer, with 15 reports being analyzed by all reviewers to align the review process before the major screening effort. Additionally, 10 % of the reports were analyzed by one additional reviewer after the initial screening to ensure consistency. The results of the comparative screening remained consistent throughout all phases. Minor deviations were observed between individual reviewers, such as whether an implementation of the DT had an individual capability or not. However, these deviations were uncommon, and the results consistently portrayed a clear picture across all reviewers. Having multiple reviewers with different specializations, as was the case in this study, helps prevent bias in a particular direction.

It was decided during the design of this study to only consider reports that presented actual implementations of Agents and DTs for industrial applications. This decision was made to obtain a sense of what kind of capabilities are required and useful in the industrial context and what capabilities are realistically implementable. Purely conceptual reports were omitted because it was deemed that concepts can be proposed irrespective of the need for a concept and its applicability in industrial practice.

To compare Agents and DTs, a set of capabilities, properties, and purposes was created. These sets were developed based on a review of existing literature.

The challenge in establishing a set of capabilities was to encompass all the tasks, features, and functions fulfilled by an Agent or DT. The review process demonstrated that the established list of capabilities adequately met this requirement, although it should be acknowledged that the set of capabilities could be adjusted by adding new elements or combining existing ones. An issue encountered in identifying the capabilities of a given application was that reports often use different terminology or certain capabilities were only implicitly observed. In such cases, the interpretation of the reviewer became necessary, and it was in these instances that most divergences between reviewers occurred. A crucial aspect of the chosen approach was to provide clear definitions for each capability, along with corresponding examples, to establish a common understanding among the authors and readers.

The applied methodology for the calculation of the fulfillment score of the different properties enabled a quantitative comparison of the paradigms. It was also utilized to outline the fulfillment of the different CDs within each property. However, the fact that the CD was also utilized to determine the weight of each capability for the property fulfillment score puts a strong emphasis on the more complex capabilities. This emphasis could also be put on the basic or fundamental capabilities, which would lead to different results. The argument for the latter approach would state that, for example, systems are not autonomous if they do not possess the basic capability of encapsulation. However, the analysis in this contribution followed established metrics like those described in “Identification of properties” section that emphasise more complex autonomous capabilities.

The set of nine categories of purposes presented in “Identification of purposes” section fully encompassed the purposes pursued by the analyzed reports. In some cases, multiple categories of purposes were assigned to a single application of Agents or DTs. However, this was only done in a few cases where the authors explicitly stated the application of a paradigm for multiple purposes. On the other hand, some authors did not explicitly state the purpose of the Agents and DTs they implemented. In such cases, the categories of purposes were identified based on general statements in the reports.

In addition to identifying the capabilities of different use cases, a classification of the two paradigms was made in accordance with the different dimensions of RAMI 4.0. The objective was to identify commonalities and differences. Since RAMI 4.0 is a widely recognized reference architecture for Industry 4.0, in which both paradigms are increasingly used, it provided an appropriate basis for the analysis. Furthermore, RAMI 4.0 was considered particularly suitable because it allows multidimensional classification along relevant dimensions. This multidimensional classification facilitated a more comprehensive understanding of these paradigms in relation to Industry 4.0. The use of existing standards to characterize the dimensions promoted the establishment of a uniform and consistent understanding of the paradigms. However, it should be noted that a certain bias is inherent in the fact that the analysis was limited to the dimensions prescribed by RAMI 4.0. This means that other relevant aspects may have been overlooked. Other architectural models and classifications could have also enabled the classification of the paradigms. Aheleroff et al. (2021) present an architectural model for the DT based on RAMI 4.0. Another architectural model worth mentioning is the “China Intelligent Manufacturing System Architecture”, which is very similar to RAMI 4.0. It consists of dimensions such as “System Hierarchy,” “Lifecycle,” and “Intelligent Functions,” and efforts are being made to align both architectures (Federal Ministry for Economic Affairs and Energy, 2018). It can be assumed that using this alternative reference architecture, which shares strong similarities with RAMI 4.0, would have yielded similar results.

Generally, it would have been desirable to compare the results of this systematic comparison of Agents and DTs with previous, similar works on the topic. However, as stated in “Related works” section, there exist no other similar comparison-based analyses to which the results could be compared. Despite the lack of comparable analyses, the methodology applied in this work is based on PRISMA and leads to reproducible results.

Discussion of results

The results revealed both differences and similarities between Agents and DTs. Regarding the capabilities of the two paradigms, it was found that simulation, condition monitoring, real time capability, and visualization are DT-specific capabilities that imply a high degree of fidelity. On the Agent side, the dominant capabilities are encapsulation, internal system interaction, context awareness, decision making, external system control, coordination, cooperation, and flexibility. This means that Agents tend to achieve medium to high scores in intelligence, autonomy, and sociability.

While both Agents and DTs exhibited paradigm-specific capabilities, all capabilities except for mobility occurred in both paradigms. Moreover, more than half of both Agents’ and DTs’ applications possessed capabilities that are characteristic of the other paradigm. This indicates that there is no clear boundary between the two paradigms.

The distribution of capabilities further demonstrated that the typical capability profile of Agents was more uniform than that of DTs. This uniformity was also evident in the analysis of the purposes for which Agents are applied. In the industrial context, Agents are predominantly utilized during the operational phase for planning, scheduling, dispatching, control, and diagnosis and fault management of production operations. This observation aligns with the allocation of Agents’ applications in the RAMI 4.0 architecture, which shows their primary deployment on the functional layer, specifically on computers near the production process (i.e., at the station and work center levels). This uniformity can be attributed to the existence of established tools and de facto standards in the field of Agents, such as the Java Agent Development Framework and the Agent Management Specification. This allows different MAS to exhibit a similar structure which also requires a similar capability set within MAS.

A different pattern emerged in applications of DTs. The purposes served by DTs were more diverse, ranging from the engineering phase (process optimization) to real-time control of production operations. The allocation of DT applications within RAMI 4.0 also demonstrated greater diversity, with a focus on information processing at the work center level. Furthermore, the set of capabilities found in DTs was less uniform compared to Agents, as less than half of the analyzed DT applications exhibited over 50 % of the DT-specific capabilities. All of these findings indicated comparatively less standardization and uniformity in DT applications within the industrial sector. This observation is consistent with the conclusions of other authors: There lacks, [...], a clear, encompassing architecture covering necessary components of a DT to realize various use cases in an intelligent automation system. (Ashtari Talkhestani et al., 2019). Whether this lack of standardization Similar to the Software Agents paradigm, there is a standard for DT implementations, the AASs. However, this work revealed that implementations of AASs are rare compared to custom DTs implementations.

Summarizing the results and discussion presented earlier, the most significant differences between Agents and DTs align with their definitions provided in “Agent paradigm” and “Digital twin paradigm” sections. It was observed that DTs are primarily applied to manage information related to production processes and present it to the user, while Agents are mostly employed to plan and manage the operation of production processes. This is evident from the purposes served by the paradigms (Fig. 8) and the capabilities they possess (Fig. 3). However, a clear similarity exists in that both Agents and DTs are software programs that run closely to and interact closely with production resources (Figs. 6, 7). On a more detailed level, it was observed that DTs primarily possess capabilities that enable them to accurately represent production resources, while Agents possess capabilities for cooperative planning and operation of production resources (“Capabilities and properties of agents and digital twins” section). This is also reflected in the properties observed in Agents and DTs (“Agents’ and digital twins’ properties” section). Furthermore, it is noted that there are almost no capabilities exclusive to either Agents or DTs, and they showed strong resemblance in terms of their ability to interact with other technical systems, interact with human operators, and learn about their environment. Additionally, Agents and DTs are applied for similar purposes, such as controlling production resources, planning, monitoring, diagnosing production operations, and managing faults.

Finally, several synergies that represent potential for future research activities can be highlighted. The concept of Industry 4.0 envisions the emergence of autonomous, interconnected, and intelligent production systems that access, exchange, and process large amounts of data (“Introduction” section) (Monostori, 2014). Analyzing this vision and the findings of this work, it is evident that aspects related to the autonomy, intelligence, and interconnection of production systems are primarily addressed by Agents. This includes capabilities encompassing these properties, such as cooperation, decision making, reactivity, and flexibility (“Identification of properties” section). It is worth noting that interconnection, which refers to the degree of interaction between (sub)systems, is part of the sociability property as interpreted in this work. On the other hand, other aspects of envisioned future production systems, such as providing real-time data, documenting the evolution of production (sub)systems, and processing information, are typically addressed by DTs. DTs commonly employ capabilities such as simulation, real-time capability, visualization, and condition monitoring. Based on the analysis presented in this work, it can be concluded that the capabilities and properties of both Agents and DTs should be combined to realize the current vision of future production systems. Agents and their corresponding methods, tools, and standards should be utilized for the autonomy-related and cooperative aspects of managing production systems, while DTs provide the necessary informational background and real time connections to their corresponding assets. The analysis of Agents’ and DTs’ properties supports this conclusion, as a combination of Agents and DTs promises high degrees of intelligence, fidelity, autonomy, and sociability. Therefore, the existing knowledge on Agents and DTs should be integrated to apply the two paradigms synergistically. Despite the potential of concurrent utilization of Agents and DTs, relatively little research has been conducted on this topic, as demonstrated in “Motivation”section. However, some works have already leveraged the advantages of both Agents and DTs, similar to the description above (Vogel-Heuser et al., 2021; Göppert et al., 2023). To promote this vision, the connection and collaboration of DTs and Agents need to be standardized.

Summarizing this section, this systematic comparison can serve as a starting point for establishing guidelines on how to apply Agents and Digital Twins in industrial production applications in the future. Based on this comparison and future research, roadmaps can be established for the co-evolution or even the prospective integration of the two paradigms. To foster potential synergies between the two paradigms, identified within this work, their interaction and integration should be standardized. Several purposes served by Agents and Digital Twins (e.g., planning, control, and diagnosis and fault management) have been identified, where experiences, pitfalls, and best practices established within one paradigm can potentially be transferred to the other paradigm.

Supplementary information

The dataset related to this article can be found at https://doi.org/10.5281/zenodo.8120624 (Reinpold et al., 2023).