Dynamic Optimization of Role Concepts for Role-Based Access Control Using Evolutionary Algorithms

Anderer, Simon; Kempter, Tobias; Scheuermann, Bernd; Mostaghim, Sanaz

doi:10.1007/s42979-023-01805-1

Dynamic Optimization of Role Concepts for Role-Based Access Control Using Evolutionary Algorithms

Original Research
Open access
Published: 24 May 2023

Volume 4, article number 416, (2023)
Cite this article

Download PDF

You have full access to this open access article

SN Computer Science Aims and scope Submit manuscript

Dynamic Optimization of Role Concepts for Role-Based Access Control Using Evolutionary Algorithms

Download PDF

Simon Anderer¹,
Tobias Kempter¹,
Bernd Scheuermann¹ &
…
Sanaz Mostaghim²

971 Accesses
Explore all metrics

Abstract

To ensure the security of current information systems, role-based access control (RBAC) is a widely used concept. For this purpose, based on an initial assignment of permissions to users, permissions are grouped to roles, which are then assigned to users. The corresponding (NP-complete) optimization problem, the so-called role mining problem (RMP), aims at finding a minimal set of roles and a corresponding assignment of those roles to users. Previously, the RMP has been considered as a static optimization problem. However, the application of RBAC in real business use cases requires the inclusion of dynamically occurring events that reflect changes in the business environment of companies, as well as events that result from direct user interaction with the role mining process. Therefore, in this paper, we provide a comprehensive overview and classification of the most relevant events for role mining and present methods for integrating them into the framework of an evolutionary role mining algorithm. The functional performance of these methods as well as the overall performance gain of dynamic role mining compared to the static approach is then examined in a series of experiments.

A review on genetic algorithm: past, present, and future

Article 31 October 2020

Exponential distribution optimizer (EDO): a novel math-inspired algorithm for global optimization and engineering problems

Article 30 January 2023

Evolutionary algorithms and their applications to engineering problems

Article Open access 16 March 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Nowadays, the security of IT systems in companies and organizations depends on a variety of factors. They must provide a good protection against external risks and attacks like malware and phishing. One threat to corporate IT systems that should not be underestimated is internal malicious use, whereby employees manipulate data and exploit their position in the company to enrich themselves. Such cases are referred to as occupational fraud. It has been shown that fraudulent or erroneous behavior by employees can lead to significant business losses [1].

One approach to addressing this problem is to apply thorough authorization management and access control mechanisms. One widely used approach to this is role-based access control (RBAC). Instead of assigning permissions directly to users, they are grouped into roles, which are then assigned to users [2]. The corresponding optimization problem is called the role mining problem, which is about finding a minimum number of roles and a corresponding assignment of permissions to roles and roles to users, and was shown to be NP-complete [3]. The underlying permission-to-user assignments are usually assumed to be invariant in time. However, this assumption does not match the requirements of real-world use cases. For instance, employees change positions and departments and join or leave the company. Such behavior leads to structural changes, meaning the assignment of permissions to users must be understood as a dynamic construct. Furthermore, users of role mining software should be able to influence the role mining process, e.g., by specifying preferences or by manually manipulating solutions. This leads to additional events that need to be dynamically included into the role mining process.

To address this, we have created a definition of the Dynamic Role Mining problem and explored different strategies for assigning roles to new employees of an organization [4]. The goal of this research is to provide a comprehensive overview of dynamically occurring events that are relevant in the context of RBAC and their integration into evolutionary algorithms for role mining. For events arising from structural changes of an enterprise, corresponding event handling methods are provided and investigated in detail. Moreover, all presented event-handling methods are evaluated in a series of experiments to examine the advantages of dynamic optimization compared to static role mining.

The remainder of this paper is organized as follows: Sect. “Role Mining in Static Environments” presents the basic role mining problem and corresponding solution strategies. In Sect. “The Dynamic Role Mining Problem”, we introduce to dynamic optimization problems in general before defining the Dynamic Role Mining Problem. Sect. “Role Mining In Dynamic Environments” provides a broad overview of different event types relevant for role mining and their integration into the framework of an evolutionary algorithm. Furthermore, for events triggered by structural changes in business environments, corresponding event-handling methods are presented. In section “Experiments and Evaluation”, we evaluate the results of the experiments performed. Section “Conclusion and Future Works” provides a conclusion of the presented research and points out avenues for future research.

Role Mining in Static Environments

This section introduces the Sect. “The Basic Role Mining Problem”, presents some of the most common solution strategies, and illustrates how evolutionary algorithms can be adapted to solve the RMP. These provide the framework for integrating the dynamically occurring events considered in the following sections.

The Basic Role Mining Problem

A first definition of the RMP was proposed by Vaidya et al. as Minimum Biclique Cover Problem [3]. However, for the application of evolutionary algorithms, the matrix-representation of the RMP is more suitable:

$U=\{u_1,u_2,...,u_m\}$ a set of $M=|U |$ users
$P = \{p_1,p_2,...,p_n\}$ a set of $N=|P |$ permissions
$R = \{r_1, r_2,...,r_k\}$ a set of $K=|R |$ roles
$UPA \in \{0,1\}^{M \times N}$ the targeted permission-to-user assignment matrix, where $UPA_{ij}=1$ implies, that permission $p_j$ shall be assigned to user $u_i$.
$UA \in \{0,1\}^{M \times K}$ a possible assignment of roles to users, where $UA_{ij}=1$ implies, that role $r_j$ is assigned to user $u_i$.
$PA \in \{0,1\}^{K \times N}$ a possible assignment of permissions to roles, where $PA_{ij}=1$ implies, that permission $p_j$ is assigned to role $r_i$.

Based on the presented matrix representations, a definition of the Basic Role Mining Problem in its matrix decomposition version can be provided.

The Basic Role Mining Problem (Matrix Decomposition Version)

Given a set of users U, a set of permissions P and a permission-to-user assignment matrix UPA, find a minimal set of Roles R, a corresponding role-to-user assignment matrix UA and a permission-to-role assignment matrix PA, such that each user has exactly the set of permissions granted by the UPA matrix:

$$\begin{aligned} {{\text {Basic RMP}}} = {\left\{ \begin{array}{ll} \text {min } &{} |R |\\ \text {s.t.,} &{} {\text {UA}} \otimes {\text {PA}} = {\text {UPA}}. \end{array}\right. } \end{aligned}$$

(1)

where $\otimes $ denotes the Boolean Matrix Multiplication:

$$\begin{aligned} ({\text {UA}} \otimes {\text {PA}})_{ij} = \bigvee _{l=1}^k(UA_{il} \wedge PA_{lj}). \end{aligned}$$

A role concept $\pi =\left\langle R^\pi , {\text {UA}}^\pi , {\text {PA}}^\pi \right\rangle $, consisting of a set of roles $R^\pi $, a role-to-user assignment ${\text {UA}}^\pi $ and a permission-to-role assignment ${\text {PA}}^\pi $, denotes a candidate solution for a given Basic RMP. It is called a feasible solution, if it satisfies ${\text {UA}}^\pi \otimes {\text {PA}}^\pi ={\text {UPA}}$. For the Basic RMP, in particular, a feasible solution is also denoted 0-consistent.

The objective of the role mining problem is to find a feasible solution that involves a minimum number of roles. An upper bound on the minimum number of roles can easily be calculated based on the given permission-to-user assignment matrix UPA. If the number of users is less or equal than the number of permissions $m \le n$, one role is created for each user, which is assigned exactly the permissions that are also assigned to the considered user. Hence, ${\text {UA}}=I_m$ and PA = UPA, where $I_m$ denotes the m-dimensional identity matrix. Since ${\text {UA}} \otimes {\text {PA}} = I_m \otimes {\text {UPA}} = {\text {UPA}}$, this solution complies with the 0-consistency constraint. Furthermore, $|R |= m$. If $n < m$, for each permission one role is created, which is assigned the considered permission only. The created roles are then assigned to users according to UPA. Hence, UA = UPA and ${\text {PA}} = I_n$. Since ${\text {UA}} \otimes {\text {PA}} = {\text {UPA}} \otimes I_n = {\text {UPA}}$, this solution also complies with the 0-consistency constraint. Furthermore, $|R |= n$. Therefore, the minimum number of roles is bounded by ${\text {min}}\{m,n\}$.

Figure 1 shows an example of the schematic representation of the UPA, UA and PA matrix. For better visualization, black cells indicate 1’s and white cells represent 0’s. This representation technique is also used in the further course of this paper to illustrate binary matrices.

There are many other variants in which the RMP is modified, for example by relaxation of the 0-consistency constraint or the inclusion of more business-driven objectives like administrative costs [5] or compliance aspects and license costs [6]. A detailed survey on the different variants of the RMP is provided by Mitra et al. [7].

General Solution Strategies

The role mining problem is a well-studied problem in literature. Accordingly, there are many established solution techniques. An overview of different RMP variants and solution strategies can be found in Ref. [7]. Therefore, only the most important contributions and solution strategies are summarized below. A widely used approach to tackle the RMP is to group permissions. From these groups a set of roles is created. Different methods are used for this grouping. These roles are then assigned to users to obtain appropriate role concepts [8,9,10,11,12]. Often, the role mining problem is approached by mapping it to other well-known problems in data mining, such as the minimum tiling problem [3], the Set Cover Problem [13] or the Minimum Biclique Cover Problem [14]. Other approaches are based on graph optimization [15] or formal concept analysis [16].

Evolutionary Algorithms for the RMP

Since the RMP is NP-complete, evolutionary algorithms (EAs) are a common strategy to search for good solutions. In particular, Saenko and Kotenko have published several approaches to coping with the RMP based on EAs [17,18,19]. Other publications that also use the concept of EAs for the RMP can be found, for example, in Refs. [20] or [21]. In our contribution, we also use an evolutionary algorithm called addRole-EA, as presented in Ref. [22], as the basis for integrating the event handling methods developed. Therefore, the following paragraph is devoted to a brief introduction to evolutionary algorithms in the context of the role mining problem.

EAs represent a population-based optimization strategy following the principle of survival of the fittest. At the beginning, an initial population of individuals, each representing one possible solution of the RMP is generated, e.g. at random. From this, the best individuals, in terms of a predefined fitness function, are selected for mutation (modification of the genome of an individual) and crossover (exchange of genetic information between individuals), which leads to the creation of additional individuals. For the Basic RMP, the number of roles $|R^\pi |$ of an individual constitutes its fitness value. Again, only the best individuals are selected to survive, thus being part of the next generation. This procedure is repeated iteratively, until a stopping condition is met (usually a maximum number of iterations, a maximum number of iterations without improvement of the global best fitness value or a given solution quality). A top-level description of a general evolutionary algorithm is given in Fig. 2. A more detailed introduction to evolutionary algorithms is provided, for example, in Ref. [23].

In the addRole-EA, each individual coincides with a possible role concept $\pi =\left\langle R^\pi , {\text {UA}}^\pi , {\text {PA}}^\pi \right\rangle $. Since in real-world use cases roles usually contain a small number of permissions and only a small set of roles is assigned to each user, it is logical to use the sparse format for matrices to represent UA and PA to save memory space. The encoding of an individual in the addRole-EA is illustrated in Fig. 3.

The unique feature of the addRole-EA is its addRole-method. This allows the addition of new roles to $R^\pi $, ${\text {UA}}^\pi $ and ${\text {PA}}^\pi $ of an individual $\pi $, such that the 0-consistency constraint is fulfilled at all times. Subsequently, all roles, that became obsolete by the addition of the new roles, are deleted from the individual, ideally resulting in a reduction of the total number of roles. For mutation, new roles are created (e.g. by the intersection of the permission sets of different users or by merging or splitting of old roles), which are then assigned to individuals by means of this method. In recombination, the roles of different individuals are exchanged, again using the addRole-method. The addRole-EA is a steady-state evolutionary algorithm. For replacement an elitist selection scheme is applied based on the total number of roles (Basic RMP). A detailed description of the addRole-EA is provided in [22].

The Dynamic Role Mining Problem

This section introduces to the Dynamic Role Mining Problem. At first dynamic optimization problems and corresponding sources of dynamics are defined in general. Subsequently, a definition of the Dynamic Role Mining Problem is derived.

Dynamic Optimization Problems

Dynamic optimization problems are characterized by objective functions or restrictions that change with time. These changes are triggered by dynamically occurring events with direct or indirect influence on the specifications of the optimization problem or the associated optimization process.

One possible source of dynamics consists in events triggered by external factors. A good example of this are tour and route planning problems. If the corresponding Vehicle Routing Problem is not adapted properly to dynamically occurring delivery requests or cancellations and changing travel times between destinations due to uncertain and varying traffic conditions, significantly worse or even infeasible optimization results are obtained [24].

Another source of dynamics consists in the interaction of users of optimization software, so-called decision makers (DM), with the corresponding optimization process. To classify events triggered by the interaction of a DM with an optimization process, König and Schneider distinguish between direct and indirect manipulations [25]. Direct manipulations imply the modification of solution candidates, while indirect manipulations comprise changes of optimization objectives or constraints as well as adaption of parameters of the applied optimization algorithm. Nascimento considers interaction possibilities with evolutionary algorithms for dynamic optimization. Additionally to direct and indirect manipulation, it is allowed for dynamic focusing on manually chosen sub-problems Furthermore, interaction possibilities aiming at the specifications of evolutionary algorithms, like deliberate inclusion of certain individuals into the population of an evolutionary, algorithm are presented [26].

A survey on optimization in dynamic environments is provided by Cruz [27] offering the following formal definition of a dynamic optimization problem in its most general form:

$$\begin{aligned} {\text {DOP}} = {\left\{ \begin{array}{ll} \text {optimize } &{}f(x,t)\\ \text {s.t.,} &{}x\in F(t)\subseteq S, t\in T. \end{array}\right. } \end{aligned}$$

where:

$S\in {\mathbb {R}}^n$, S is the search space.
$t\in T$ is the time.
$f: S\times T\rightarrow {\mathbb {R}}$ is the objective function, that assigns a numerical value (f(x, t)) to each possible solution $(x\in S)$ at time t.
F(t), is the set of feasible solutions $x\in F(t)\subseteq S$ at time t.

It can be seen that in this definition the objective function and the constraints are time-dependent. In the following, this concept is applied to role mining resulting in a definition of the Sect. “The Dynamic Role Mining Problem”.

The Dynamic Role Mining Problem

Although it would appear natural due to the multitude of changes that occur in business environments, there is little research on dynamic role mining. A first approach was presented by Saenko and Kotenko [19]. Changes concerning the assignment of permissions to users are aggregated into a matrix ${\text {UPA}}_1$. After a certain period of time, the ${\text {UPA}}_1$ matrix is compared to the original user-permission-assignment ${\text {UPA}}_0$ and the corresponding role concept $\pi _0 = \left\langle R_0, {\text {UA}}_0, {\text {PA}}_0 \right\rangle $, which is currently implemented at the considered company. Based on that, the so-called RBAC Scheme Reconfiguration Problem is defined. It consists of finding a new role concept $\pi _1 = \left\langle R_1, {\text {UA}}_1, {\text {PA}}_1 \right\rangle $, where ${\text {UA}}_1 = {\text {UA}}_0+\Delta {\text {UA}}$ and ${\text {PA}}_1 = {\text {PA}}_0 + \Delta {\text {PA}}$ such that:

$$\begin{aligned}&{\text {RBAC Scheme Reconfiguration Problem}}=\\&\quad {\left\{ \begin{array}{ll} \text {min } &{}\left\| \Delta {\text {UA}} \right\| _1 + \left\| \Delta {\text {PA}} \right\| _1 \\ \text {s.t.} &{}{\text {UA}}_1 \otimes {\text {PA}}_1 = {\text {UPA}}_1. \end{array}\right. } \end{aligned}$$

Hence, the RBAC Scheme Reconfiguration Problem is about finding a new role concept $\pi _1$ which fulfills the conditions defined by ${\text {UPA}}_1$ and contains as few changes as possible compared to the old role concept $\pi _0$. However, only permission changes of already existing employees are considered. New employees or employees leaving the company are not taken into account in this approach. Another disadvantage is the aggregation of changes over a certain period of time. Events such as the arrival of new employees or events triggered by user interaction, however, require an integration into the optimization process, if possible in real time. To reflect this, we provide a definition of the Dynamic Role Mining Problem:

$$\begin{aligned}&{\text {DynRMP}}=\\&\quad {\left\{ \begin{array}{ll} \text {min } &{}|R(t) |\\ \text {s.t.} &{} {\text {UA}}(t) \otimes {\text {PA}}(t) = {\text {UPA}}(t), t\in T. \end{array}\right. } \end{aligned}$$

Analogous to the definition of general dynamic optimization problems in Sect. “Dynamic Optimization Problems”, ${\text {UPA}}(t)$ as well as ${\text {UA}}(t)$ and ${\text {PA}}(t)$ are modeled time-dependent. Solving the DynRMP, therefore means finding an optimal role concept $\pi (t)=\left\langle R^\pi (t), {\text {UA}}^\pi (t), {\text {PA}}^\pi (t) \right\rangle $ for each point in time $t\in T$.

Role Mining in Dynamic Environments

In this section, we describe and classify the various events relevant to role mining. Subsequently, it is explained, how the presented events can be integrated into an evolutionary algorithm. For events emerging from structural changes in business landscapes, we present detailed event handling methods.

Events

In the following, the main events relevant for dynamic role mining are presented and classified. There are two different sources that can trigger events relevant in role mining: (1) interactions of a decision maker (DM) with the role mining software, (2) changes in the company’s structure or staffing.

Interaction Events

This section provides an overview of user interaction events. These events are given identifiers I01 to I17 listed in Tables 1, 2, 3 and 4. They can be classified into the categories defined by König and Schneider [25] or Nascimento [26] (see Sect. “Dynamic Optimization Problems”). The first category contains events which lead to a direct manipulation of the solution candidates (see Table 1). Generally, the main issue here is editing, adding or removing roles. (I01–I06). In the context of compliance regulations, it is possible that certain combinations of permissions must not be assigned to the same role. This is addressed by interaction I07, which allows a DM to manually specify or edit segregation of duty (SoD) conflicts, i.e. critical combinations of permissions. For more details about SoD-conflicts. Refer to Ref. [6], where benchmark instances are provided that include compliance restrictions. At this, critical combinations of permissions are aggregated in a compliance matrix C. To assess the severity of a SoD-conflict in C, a corresponding weight vector is introduced. From this, a compliance score can be calculated for each individual.

Table 1 List of events induced by manipulating solutions

Full size table

Another interaction possibility of a DM with the optimization process is to adjust the parameters of the evolutionary algorithm used. Furthermore, specific mutation or crossover operators, that have turned out to be strong in certain situations, can be selected manually. Since these indirect interactions strongly depend on the algorithm used, Table 2 provides only a few examples.

Table 2 List of events induced by adapting parameters

Full size table

If a DM is already satisfied with the optimization results achieved for certain areas, like the users of certain departments of the company or other sets of users, or if the DM would like to enforce the optimization in other areas, the focus of the optimization process can be adjusted. For example, the optimization focus could be set on a certain set of users (I14). In addition, a DM can exclude users and roles, that he or she is already satisfied with, from further role optimization and thus reduce the problem size. An overview of such interaction events is given in Table 3.

Table 3 List of events induced by adjusting the optimization focus

Full size table

EAs bear the risk of getting stuck in local optima. For this reason, it might be interesting to store some solution candidates from previous iterations. This way, it is possible to return to the stored solution candidates, thereby avoiding the necessity of a complete restart of the optimization process. In dynamic optimization in particular, the fitness landscape is subject to change over time. Hence, it is possible that some of the stored individuals may have better fitness values than the individuals of the current population. Therefore, injecting such stored individuals into the current population appears to be a promising approach. This is also referred to as memory-based evolutionary algorithms which have been covered in previous publications. A survey on memory-based evolutionary algorithms is provided by Branke [28].

Here it is proposed to adopt the concept of memory-based evolutionary algorithms to the domain of role mining with user interaction. In this scenario, the DM can store interesting role concepts into a so-called role concept repository. The DM may also analyze and possibly deploy them later. The resulting interaction possibilities are listed in Table 4. If we consider multi-objective role mining, in which the 0-consistency constraint is relaxed or business-driven objectives like license costs [6] or administrative costs [5] are included, further interaction events can arise like the weighting or ranking of the different optimization objectives or setting thresholds for certain objectives. However, this paper focuses on single-objective role mining problems, and such events are not examined any further.

Table 4 List of events induced using a role concept repository

Full size table

Structural Events

In addition to events triggered by a DM, there are events which are caused by changes in the structure of a company as employees change positions and responsibilities, or as they join or leave the company. An overview of such structural events (S01–S04) is shown in Table 5.

Table 5 List of events induced by structural change

Full size table

Since these events are described in more detail in Sect. “Event Handling”, they will not be discussed further at this point.

Event Handling

To process the events defined in Sect. “Events” close to real time, it is important to forward them to the optimization process immediately after occurrence. For this purpose, the iterative course of evolutionary algorithms is of great advantage, as it can be checked at the beginning of each iteration, whether one or more events are currently pending. If necessary, the corresponding event-handling methods can then be executed to adapt the individuals of the current population of the EA to the new conditions of the business environment. Figure 4 shows the alteration of the sequential process of the EA for the integration of the event-handling methods.

Aggregation of Users

There are some differences between static and dynamic problems regarding the encoding of role concepts. Most companies fill a majority of their positions at least twice to prevent hindrances in the operational process in case of vacation, illness or leaving of employees. Users, that are assigned the same set of permissions, are often aggregated to classes prior to the actual role mining process [13, 22]. Each class of users and the associated permission set can be represented by one row of the UPA matrix to reduce the problem size. To be able to identify all users even after aggregation, each user is assigned a unique UserID. Subsequently, the UserIDs corresponding to each user class are stored in a separate user mapping. After optimization, all users of one user class are assigned the same set of roles. In case of static role mining, especially in the presented Basic Role Mining Problem, the cardinality of these classes is disregarded. However, in the dynamic case, it plays an important role, as users join or leave the company over time. For this purpose, the cardinality of each user classes is added UserCount to each row of the UPA (and UA) matrix. Furthermore, a temporary users list $U_{\text {temp}}(t)$ is introduced. It can be considered a technical auxiliary tool to ensure that users, who are known to be leaving the company or changing positions, can still be provided with permissions for a certain period of time, independent of the ongoing role optimization process. The set of permissions of a user results from the permissions which he or she is assigned by the currently implemented role concept as well as from the permissions of $U_{\text {temp}}(t)$. In this way, the user known to be leaving the company can continue to do his or her work until the day of departure, without affecting the further role optimization process.

Figure 5 shows an example of a user mapping as well as the temporary users list $U_{\text {temp}}(t)$ corresponding to the exemplary UPA matrix of Fig. 1. It currently includes 7 users. The users u101, u102 and u103 belong to the first user class and are assigned $p_1$, $p_2$ and $p_3$. Users u104 and u105 belong to the second user class and are assigned $p_3$, $p_4$ and $p_5$. User u106 belongs to the third user class and is assigned permissions $p_4$, $p_5$ and $p_6$. User u107 will leave the company in the foreseeable future and is therefore no longer part of the optimization process, but is still assigned permissions $p_2$ and $p_4$ through means of the $U_{\text {temp}}(t)$.

In the following, event handling methods corresponding to the events S01-04 are presented with focus on the update of ${\text {UPA}}(t)$, ${\text {UA}}(t)$ and ${\text {PA}}(t)$. These are illustrated in the framework of a continuous example building on the situation in Fig. 5.

User Joins Company (S01)

The knowledge that a new employee will join a company triggers event S01 and the associated event handling method. Since it is usually known in advance that a new user will be joining a company, a distinction is made between the occurrence of the information and the actual entry of the user. To make the best use of this lead time, the event can be further segmented, such that the information on the future arrival of the new user is included into the optimization process as soon as it occurs, whereas the role concept is adapted when the new user joins the company, see Fig. 6.

To update the ${\text {UPA}}(t)$ matrix accordingly in S01a, it is necessary to distinguish between two cases:

Case 1: Permission set of new user equals permission set of existing user.

In case the new user is assigned exactly the same permissions as at least one of the already existing users, the processing of the event is straight-forward, as there is already a user class corresponding to the new user. Therefore, only the UserCount of the corresponding user class must be increased. ${\text {UPA}}(t)$, ${\text {UA}}(t)$ and ${\text {PA}}(t)$ remain unchanged.

An example for the handling of this case of S01a can be found in Fig. 7. In this case, a new user u108 joins the company, which was illustrated in Fig. 5, and is assigned permissions $p_1$, $p_2$ and $p_3$ and is thus categorized into the first user class $C_1$.

Case 2: Permission set of new user does not equal permission set of existing user.

If there is no user with the same permissions as the new user in the current company structure, a new user class must be created. This is done by adding a new row with $UserCount=1$ to ${\text {UPA}}(t)$ and ${\text {UA}}(t)$. The new row of ${\text {UPA}}(t)$ corresponds to the permissions assigned to the new user. The new row in ${\text {UA}}(t)$ contains the roles which are assigned to him or her. Again, two cases must be distinguished:

Case 2.1: Permission set of new user can be covered completely by existing roles.

In this case, existing roles are assigned to the new user to provide him or her with the required permissions. At this, under consideration of the 0-consistency constraint, all roles possible can be assigned to the new user. However, this may result in the user being assigned some permissions multiple times across different roles. Thus, it might be worthwhile to assign only a subset of those roles to the new user. To address this, we presented and investigated different strategies to assign roles to new users in [4].

Figure 8 shows an example of this case, where a new user u109, being assigned $p_1$, $p_2$, $p_3$ and $p_4$, joins the company. Since there is no other user being assigned the same permissions, a new user class $C_4$ is created for the new user. In addition, all permissions required for u109 can be covered by assigning roles $r_1$, $r_2$ and $r_4$.

Case 2.2: Permission set of new user cannot be covered completely by existing roles.

It is possible that, even after assigning all possible roles, regarding the 0-consistency constraint, permissions of the new user still remain uncovered. In this case, a new role must be created for the new user and added as new row to ${\text {PA}}(t)$ and as new column to ${\text {UA}}(t)$. The new role can either be assigned all of the new users’ permissions or only the permissions, which remain uncovered after assigning the existing roles, are used to constitute the new role. This is also examined in Ref. [4].

Figure 9 shows an example of a new user u110, being assigned permissions $p_1$, $p_3$ and $p_4$. Again, there is no other user, that is assigned the same set of permissions, such that a new user class $C_5$ is created for u110. By assigning role $r_2$ to u110, permission $p_3$ can be covered. Based on that, a new role $r_5$ is created containing the remaining, uncovered permissions $p_1$ and $p_4$ for user u110.

User Leaves Company (S02)

Analogous to the first event type, a distinction is also made in S02 between the occurrence of the event information and the actual exit of the employee. As soon as the information about the imminent departure of a user is transmitted, this information is included into the optimization process. For this purpose, the ${\text {UPA}}(t)$ matrix needs to be updated and the affected user is moved to $U_{\text {temp}}(t)$. As soon as the user leaves the company, he or she is removed from $U_{\text {temp}}(t)$ and the currently best role concept is implemented, see Fig. 10.

Again, the update of ${\text {UPA}}(t)$ required for S02a leads to the consideration of two different cases:

Case 1: Permission set of leaving user equals permission set of remaining user.

In this case, other users remain in the user class of the leaving user after his or her departure. The handling of this case is similar to case 1 of S01. ${\text {UPA}}(t)$, ${\text {UA}}(t)$ and ${\text {PA}}(t)$ remain unchanged. Only the UserCount of the user class corresponding to the leaving user must be reduced by 1. Additionally, the leaving user is moved to $U_{\text {temp}}(t)$.

Figure 11 shows the exemplary departure of user u102, belonging to user class $C_1$. Since there are other users in $C_1$, only the corresponding UserCount is updated and u102 is moved to $U_{\text {temp}}(t)$.

Case 2: Permission set of leaving user does not equal permission set of remaining user.

The case in which the leaving user is the only member of the corresponding user class requires more detailed examination. At first, ${\text {UPA}}(t)$ and ${\text {UA}}(t)$ have to be updated by removing the row corresponding to the user class of the leaving user. In addition, it must be checked, if there are roles uniquely assigned to the leaving user. If this is the case, the corresponding rows of ${\text {PA}}(t)$ as well as the corresponding columns of ${\text {UA}}(t)$ are removed. Again, the leaving user is moved to $U_{\text {temp}}(t)$.

Figure 12 shows the departure of u110. Since u110 is the only member of the corresponding user class, $C_5$ and the corresponding rows in ${\text {UPA}}(t)$ and ${\text {UA}}(t)$ are removed, while u110 is moved to $U_{\text {temp}}(t)$. In addition, role $r_5$, which is uniquely assigned to u110 can be removed from ${\text {UA}}(t)$ and ${\text {PA}}(t)$.

Change of Job Position (S03)

Another change in the structure of a company results from positions changes of users, which usually take place within the context of relocations and promotions. To enable a user to continue his or her previous work for a certain transition period, the permissions of the old job position must not be withdrawn immediately. At the same time, the user must already be assigned the permissions of the new job position to be able to perform the new tasks. This can lead to a state in which, a user is allowed to execute transactions, that may not normally be controlled by the same person. To mitigate these compliance conflicts, it is of highest importance to ensure that the permissions of the old job position are revoked after the transition period has expired. The sequential handling of S03 is illustrated in Fig. 13. At this, the mechanisms of S01 and S02 are used in parallel. As soon as the information about the impending change of position becomes known, the user concerned will be added to the optimization process using the methods of S01, but with unchanged UserID, while the user corresponding to the old job position is moved to $U_{\text {temp}}(t)$, again with unchanged UserID using the methods presented for S02. At the beginning of the transition period, the current best role concept is implemented. Hence, the user is assigned the permissions of the new job position directly from the implemented role concept. In addition, the permissions corresponding to the old job position are assigned to the user by $U_{\text {temp}}(t)$. At the end of the transition period, the user is removed from $U_{\text {temp}}(t)$ such that only the permissions required for the new job position remain in the user’s permission set.

Permission Request (S04)

In the day-to-day business of companies, it is possible that users lack certain permissions to perform the tasks at hand. If this is the case, a permission request can be submitted to report which permissions are required additionally. Subsequently, this is reviewed by a supervisor and either approved or rejected. Even if this event provides far less lead time compared to S01-03, it can be worthwhile to include it into the optimization process as soon as the permission request is submitted, see Fig. 14. The corresponding user in ${\text {UPA}}(t)$ is replaced by a new user with the requested additional permissions, but unchanged UserID, using the mechanisms of S01 and S02. At the time of the decision on the permission request, the currently best role concept $\pi ^*(t)$ can be implemented in case of approval. In case the permission request is rejected, the changes to ${\text {UPA}}(t)$ are revoked.

Experiments and Evaluation

In this section, the difference between dynamic and static optimization is discussed in more detail. In practice, once a good role concept has been found and implemented, the roles contained in this role concept usually remain unchanged over time. In many cases, this leads to a large number of unnecessary roles, which contradicts the minimization objective of role mining. In dynamic role mining, however, dynamically occurring events can be integrated into the optimization process, such that roles can be adapted to the new circumstances. In the following, this is examined for the four events triggered by structural change (S01-04) and the associated event-handling methods presented in the previous section.

Preparation of Benchmarks

The experiments were conducted on two benchmark instances of the PLAIN_x-Benchmark of RMPlib [6]. The first benchmark instance considered, PLAIN_small_02 (PS_02), includes 50 users and 50 permissions. The second benchmark instance, that we considered for our experiments, PLAIN_small_05 (PS_05), comprises 100 users and 100 permissions. The number of roles, which was used to create the benchmark instances, can serve as reference value and upper bound for the optimum number of roles. This amounts to 25 roles for PS_02, while 50 roles were used to create PS_05 [6]. To simulate users joining the company (S01) or new job positions (S03), the instances were initially reduced by a certain number of users. For this, users were randomly removed before each experiment was conducted based on uniform distribution until the desired number of users remained. The users removed from the benchmark instances and their associated permissions, can then be added as new users (S01) or considered to be associated to new job position for users that remained in the benchmark instances (S03). To simulate event S02, random users are selected who are to be leaving the company. For permission requests (S04), users request random subsets of the permissions, which are not assigned to them at the time of the event occurrence.