1 Introduction

ALs are flow manufacturing systems that are composed of stations and a material handling system. Products move between consecutive stations to be assembled. Specific tasks are performed at each station within a certain cycle time. Assigning tasks to stations to achieve certain objectives within a set of constraints is an assembly line balancing problem (ALBP).

ALs can be differentiated according to the assembled product variety, such as single-model, mixed-model and multi-model, and according to their layout, such as straight, U-shaped, two-sided, and parallel, etc. In parallel assembly lines (PAL), multiple ALs are positioned close and parallel to each other. When a single line is not sufficient to satisfy the demand, PAL is commonly used to increase capacity (Gökçen et al. 2006). The advantages of PAL over single assembly lines are i) improved productivity (fewer station allocations and reduction of idle time due to the use of common stations), ii) reduced failure sensitivity (when one line malfunctions, the rest of the lines may continue production) and iii) increased system flexibility (if the demand changes, each line can have a different cycle time) (Lusa 2008).

Advanced industrial organizations focus on producing multiple model types rather than concentrating on the production of a single product in order to meet a significant portion of customers’ needs (Hoseinpour et al. 2020, 2021). Mixed-model AL (MMAL) is widely utilized to meet various and changing market demands. The production characteristics of the models are similar. Therefore, setup time is not required or can be ignored between different product models. Two problems should be addressed in MMAL. These are the line balancing problem (assigning tasks to stations) and the model sequencing problem (determining the production sequence of the models). These two problems are closely related (Miltenburg 2002). The sequencing of the models impacts the efficiency of the line balance, whereas the optimality of the model sequencing impacts the line balancing results (Kim et al. 2006).

In mixed-model PAL (MPAL), each line has a set of model mixes that remain constant throughout production. Different model combinations may occur in different production cycles at a common station on two neighboring lines because of the manufacturing sequence of product models. Özcan et al. (2010) addressed this problem called the mixed-model PAL balancing and model sequencing (MPALB/S) problem. In the MPALB/S problem, line balancing and model sequencing (B/S) are dealt with simultaneously.

The integration of robots in manufacturing has seen a significant increase in recent years due to advancements in technology. These machines offer a range of benefits, including manufacturing flexibility and safety, improved product quality, enhanced precision and accuracy, and reduced dependence on skilled labor (Gao et al. 2009; Goel and Gupta 2020). However, they result in a rise in energy consumption, potentially causing harmful effects on the environment. In recent years, the rise in energy prices, disruptions in energy supply, legal regulations and climate change have highlighted the importance of energy conservation. Manufacturers are focusing on reducing energy consumption and developing more environmentally friendly production systems. This can be accomplished by implementing energy-efficient technologies, utilizing environmentally friendly energy sources and optimizing manufacturing processes.

In ALs, assembly tasks can be done by humans, robots or a collaboration of both. The assembly line where robots are used to perform assembly operations is called a robotic assembly line (RAL) (Chutima 2022). Robots with various technical specifications and capabilities may vary in processing time for a particular task. This also means that the energy consumed can vary between robots.

This paper introduces the mixed-model parallel robotic assembly line balancing and model sequencing (MPRALB/S) problem with energy consideration as a new research topic in the ALBP literature. Mixed-model parallel RAL (MPRAL) is a system in which robots replace human operators in the MPAL system. The MPRALB/S problem is an extension of the parallel RAL balancing (PRALB) (Çil et al. 2017b; Soysal-Kurt and İşleyen 2022) and MPALB/S (Özcan et al. 2010) problems, and its main characteristics are similar to them. The energy-oriented MPRALB/S problem deals with three sub-problems: i) the assignment of tasks to stations, ii) the assignment of robots to stations, and iii) the model sequencing with the lowest energy consumption. The goal is to find the best line balance while solving these sub-problems to optimize the objectives of the problem.

The authors have conducted an extensive review of the literature and found no research paper on MPRAL. Furthermore, there are very few studies that consider the energy consumption of mixed-model RAL (MRAL). Energy costs have an important place in operating costs today, it has become necessary to consider energy consumption as well as production efficiency. This paper addresses the MPRALB/S problem with energy consideration based on these findings.

The objectives of the problem are to minimize the joint cycle time and average energy consumption of MPRAL simultaneously when the number of stations is given. The MPRALB/S problem is illustrated with a numerical example. The problem has a large solution search space due to both robot and model diversity. As diversity increases, the area to be searched also increases exponentially. The problem is complex and difficult to solve. Therefore, a method based on NSGA-II is developed to solve the problem. An improvement is applied to the NSGA-II that eliminates solutions with the same objective function values. This new proposed version is called MNSGA-II. A benchmark data set for the MPRALB/S problem does not exist, hence new data sets are generated. The algorithm’s parameters are set through the Taguchi method. To assess the performance of MNSGA-II, NSGA-II (without improvement) and RSA are utilized.

In this paper, Sect. 2 gives a summary of the relevant research in the field. Section 3 outlines the problem and its assumptions and presents a mathematical formulation. Section 4 gives a numerical example to illustrate the MPRALB/S problem. Section 5 introduces the proposed MNSGA-II. Section 6 contains the results of the computational study. Finally, Sect. 7 gives conclusions and suggestions for future research.

2 Literature review

MPRAL is a mixed-model version of parallel robotic assembly lines. The PRALB problem (PRALBP) was first addressed in the literature by Çil et al. (2017b). Subsequently, Soysal-Kurt and İşleyen (2022) considered PRALBP with two objective functions. PRALBP is a new research topic and there is a growing interest in PRALBP in the literature. There is no paper in the literature that investigates the MPRAL system and MPRALB/S problem considering energy consumption.

MPRAL offers various benefits such as the simultaneous production of different product models on the same line, automated and faster operations compared to traditional assembly lines, and reduced labor and resource requirements due to common station utilization. Considering that energy expenses represent a significant cost factor for businesses, balancing MPRAL with a focus on energy consumption and achieving production with lower energy usage can support businesses in achieving their environmental sustainability goals and help them gain a competitive advantage. Considering all these factors, it is inevitable that balancing and sequencing problem of MPRAL with energy consumption will receive more attention from researchers and manufacturers in the future.

This section reviews the studies on MPAL, RAL, MRAL and energy consumption in RAL, which are the most relevant topics for this paper. Cycle time and energy consumption are represented by CT and EC, respectively.

Gökçen et al. (2006) introduced the parallel ALBP (PALBP), which focuses on simultaneously balancing multiple assembly lines using common resources. To solve the PALBP, they proposed a mathematical model and heuristic method. After this study, several research studies on PAL and their variants have been reported in the literature. The problem known as MPALB/S was introduced by Özcan et al. (2010). This problem is concerned with optimizing the efficiency of production lines and balancing the workload among stations. Chutima and Yothaboriban (2017) addressed MPAL balancing problem to optimize multiple objectives hierarchically. However, they did not consider the model sequencing problem. Kucukkoc and Zhang (2014a) addressed the mixed-model parallel two-sided AL balancing and sequencing (MPTALB/S) problem. To solve the problem, they used an agent-based ant colony optimization algorithm (ABACO). Kucukkoc and Zhang (2014b) proposed a mathematical model and an ABACO for the MPTALB/S problem. Kucukkoc and Zhang (2017) presented the mixed-model parallel U-type AL B/S problem and used a heuristic for this problem. A detailed review of the studies on PALBP is given by Aguilar et al. (2020).

The studies of Rubinovitz and Bukchin (1991) and Rubinovitz et al. (1993) are the pioneers of the robotic assembly line balancing problem (RALBP). Following these studies, many different versions and solution methods of RALBP have been developed in the literature. Levitin et al. (2006) and Gao et al. (2009) addressed the RALBP to minimize the cycle time. Aghajani et al. (2014) aimed to minimize the sum of the CTs of all models for a mixed-model two-sided RAL. Rabbani et al. (2016) investigated the multi-objective mixed-model U-type RALBP including CT, robot purchasing costs, robot setup costs and sequence dependent setup costs. Çil et al. (2017a) addressed a mixed-model straight RAL to minimize the sum of the CTs of all models. Li et al. (2018) focused on the B/S of a mixed-model straight RAL to minimize the makespan. Aslan (2023) developed a variable neighborhood search algorithm to minimize the CT for the mixed-model two-sided RALBP.

Nilakantan et al. (2015) proposed two models for straight RAL, energy-based and time-based. While the objective of the energy-based model is to minimize EC, the objective of the time-based model is to minimize CT. Li et al. (2016) studied the two-sided RAL balancing problem to minimize CT and EC. Z. Zhang et al. (2019) addressed the U-type RAL to minimize CT and EC. They presented a mathematical model and used a Pareto artificial bee colony algorithm. Soysal-Kurt and İşleyen (2022) developed a hybrid firefly algorithm to simultaneously minimize the CT and EC for the PRALBP. They also presented a mathematical model. Sun et al. (2020) developed a hybrid estimation of distribution algorithm to minimize the CT and EC in a straight RAL. Zhou and Wu (2020) focused on minimizing the number of station and EC in the straight RAL. The EC includes the energy consumption of the robots during operation and standby, as well as the energy consumption during the transportation of workpieces between stations and the changeover between tasks. Nilakantan et al. (2017) aimed to maximize line efficiency and minimize the carbon footprint caused by the EC of the robots for straight RAL. B. Zhang et al. (2021) studied the mixed-model U-type RAL B/S problem to minimize the makespan and EC. Belkharroubi and Yahyaoui (2022) addressed the mixed-model straight RAL to minimize EC. Zhang et al. (2020) studied the MMAL B/S problem to optimize EC and line efficiency. They proposed a mathematical model and cellular genetic algorithm. Ngampanich and Chutima (2022) (2022 focused on the MPAL balancing problem where tasks are performed by normal workers, disabled workers and robots. The problem has multiple objectives including the energy load variance of workers, tax savings and PM 2.5 emissions. They used the EC of the robots to calculate PM 2.5 emissions. However, they did not address the model sequencing problem. Studies dealing with energy consumption in assembly lines are illustrated in Table 1.

Table 1 Papers addressing energy consumption

3 Problem description

In the MPRAL system, a set of mixed-model straight RALs are arranged in parallel. The manufacturing processes of the product models are very similar. As in PAL, there are separate stations and common stations. During a production cycle, robots at separate stations work on a single model, whereas robots at common stations can work on two different models.

MPAL utilizes the advantages of PAL and MMAL. The MPRAL system proposed in this paper combines the advantages of RAL and MPAL. Some of these advantages include i) the ability to meet different customer demands and flexibility due to the ability to produce mixed product models in lines; ii) increased productivity and lower material handling costs due to the use of common resources at common stations; and iii) improved product quality and increased productivity due to the use of robots in assembly tasks.

A model combination that may emerge at a common station in a MPRAL may change in subsequent production cycles. The reason for this change is the order in which the product models enter the lines. In this case, it would be appropriate to consider the model sequence when balancing the lines. This is a MPRALB/S problem where balancing and sequencing problems are addressed together. As a result, more efficient solutions can be obtained by addressing these two problems together rather than separately.

Figure 1 depicts a MPRAL system. Three product models are produced in both lines. Stations 3 and 4 are separate stations. Stations 1, 2 and 5 are common stations, and during a production cycle, the robots operate on one model from each line. The product model processed at a station in a production cycle is replaced by the product model from the previous station in the next production cycle. In this case, the combination of the models occurring at a common station on parallel adjacent lines may vary.

Fig. 1
figure 1

The layout of MPRAL

In this paper, the objectives of the MPRALB/S problem are to minimize the average energy consumption of the lines and joint cycle time simultaneously, considering the model sequences, when the number of stations is fixed. The reason for considering the energy consumption as an average is that different model combinations occur at stations in different production cycles due to the production of mixed product models. In this case, the amount of energy consumed in each of the production cycles may be different. Therefore, this study considers the total energy consumption as the average of the energy consumption in all production cycles.

3.1 Problem assumptions

The following are the assumptions of the MPRALB/S problem considered in this study (Kucukkoc and Zhang 2014a; Nilakantan et al. 2015; Özcan et al. 2010; Soysal-Kurt and İşleyen 2022):

  • Production is carried out with two or more parallel straight RALs and begins and ends at the same time on all lines.

  • The production characteristics of the products are similar. There are multiple product models in each line.

  • Each line has a set of product models, and this set remains constant throughout production.

  • It is possible to work on both sides of any line.

  • The model sequences are based on the concept of the minimum part set.

  • The precedence relations of the product models are known. For each line, a combined precedence relations diagram (Macaskill 1972) is used. A combined precedence relations diagram \(\left({P}_{h}\right)\) is created by combining the precedence relations diagrams of all models belonging to a line.

  • When assigning tasks to stations, the precedence relations must be met.

  • The workload of a station cannot exceed the joint cycle time.

  • Common tasks of the product models on a line must be assigned to the same station. Some tasks may have different processing times or be equal to zero for certain product models.

  • Each task cannot be subdivided and must be assigned to only one station.

  • Robots are capable of performing any task. The processing times of the tasks depend on the type of robot and are deterministic.

  • There is only one robot at each station.

  • There are no restrictions on the selection of robot types assigned to stations. A robot type can be allocated to multiple stations.

  • Depending on the type of robot, the amount of energy required to complete a task varies.

  • Only the energy consumed by the robots is taken into account when calculating the energy consumption of the stations.

  • A robot’s energy consumption consists of energy consumption in operation and standby mode.

  • Material handling, loading and unloading, set-up and tool-changing times are included in the processing times of the tasks.

3.2 Notations

The following are the notations used in this paper:

\(h\): index for lines \(\left(h=\mathrm{1,2},\dots ,H\right)\)

\(i, j\): index for tasks \(\left(i=\mathrm{1,2},\dots ,{I}_{hm}, j=\mathrm{1,2},\dots {J}_{hm}\right)\)

\(r\): index for robot types \(\left(r=\mathrm{1,2},\dots ,R\right)\)

\(k\): index for stations \(\left(k=\mathrm{1,2},\dots ,K\right)\)

\({m}_{hm}\): index for product models on line h \(\left(m=1,\dots ,{M}_{h}\right)\)

\(H\): total number of lines.

\(Nt\): total number of tasks on all lines.

\(R\): total number of robot types.

\(K\): total number of stations.

\({M}_{h}\): total number of product models on line h.

\({P}_{h}\): set of precedence relations \(\left({i}_{h},{j}_{h}\right)\), where task i is an immediate predecessor of task j in line h.

φ: index for production cycle \(\left(\varphi =1,\dots ,\phi \right)\), where \(\phi ={MS}_{max}\)

\({\alpha }_{k-h}^{\varphi }\): product model produced on line h at station k in production cycle φ.

\({t}_{h{\alpha }_{h-k}^{\varphi }ir}\): processing time of task i of the model produced on line h at station k in production cycle φ with robot r.

\({D}_{hm}\): demand, over the planning horizon, for model \({m}_{hm}\) produced on line h.

\({cd}_{h}\): greatest common divisor of product model demands \(\left({D}_{hm}\right)\) for line h.

\({MPS}_{h}\): minimum part set or model mix of line h \(\left({d}_{h}={d}_{h1},\dots {d}_{h{M}_{h}}\right)\)

\({MS}_{h}\): model sequence of line h.

\({d}_{hm}\): demand for model \({m}_{hm}\) in model mix of line h.

\({S}_{h}\): total number of products in one \({MPS}_{h}\) for line h (the length of \({MS}_{h}\) for one \({MPS}_{h}\)).

\(S\): total number of products (sum of \({S}_{h}\) values of all lines).

\(LCM\left({S}_{1},\dots {S}_{H}\right)\): least common multiple of \({S}_{h}\) values.

\({MS}_{max}\): maximum number of model combinations that can appear at a common station.

\({TS}_{k}\): set of tasks assigned to station k.

\(CT\): joint cycle time for all lines.

\({OPC}_{r}\): operation energy consumption of robot r per unit time.

\({SPC}_{r}\): standby energy consumption of robot r per unit time.

\({ST}_{k\varphi }\): workload of station k in production cycle φ.

\({OEC}_{k\varphi }\): operation energy consumption of station k in production cycle φ.

\({SEC}_{k\varphi }\): standby energy consumption of station k in production cycle φ.

\({EC}_{k\varphi }\): total energy consumption of station k in production cycle φ.

\({TEC}_{\varphi }\): total energy consumption of all lines in production cycle φ.

\(TEC\): average energy consumption of all lines for all production cycles.

3.3 Formulation

This section gives the formulation of the MPRALB/S problem. The formulation is adapted from the studies by (Kucukkoc and Zhang 2014a; Özcan et al. 2010; Soysal-Kurt and İşleyen 2022).

A model sequence for a line is based on the minimum part set (MPS) concept (Bard et al. 1992). The minimum part set for line h \(\left({MPS}_{h}\right)\) is obtained by dividing the model demands on line h by the greatest common divisor of the demands \(\left({cd}_{h}\right)\). The vector \(\left({d}_{h}={d}_{h1},\dots {d}_{h{M}_{h}}\right)\) represents the model mix (minimum part set) for line h. \({MS}_{h}\) is the model sequence of line h (Özcan et al. 2010).

$${d}_{hm}=\frac{{D}_{hm}}{{cd}_{h}}\quad m=1,\dots ,{M}_{h }\quad h=1,\dots ,H$$
(1)

The length of \({MS}_{h}\) for one \({MPS}_{h}\), namely, the total amount of product in one \({MPS}_{h}\), is calculated as:

$${S}_{h}=\sum_{m=1}^{{M}_{h}}{d}_{hm}\quad h=1,\dots ,H$$
(2)

In MPRAL, the same model combination may not appear in all production cycles. Therefore, the system should be analyzed by dividing it into multiple production cycles \(\left(\varphi =1,\dots ,\phi \right)\). By dividing the system into \({MS}_{max}\) production cycles, all model combinations that may appear in MPRAL should be examined with the model mix of each line (ϕ = \({MS}_{max}\)). Each consecutive \({MS}_{max}\) production cycle gives the same results. The following calculation is used to determine the maximum number of possible model combinations that may occur at a common station (Kucukkoc and Zhang 2014a):

$${MS}_{max}=LCM\left({S}_{1},\dots {S}_{H}\right)\quad h=1,\dots ,H$$
(3)

The workload of a station may be different for each production cycle. In this case, the station time is calculated according to the model combination that appears at the relevant station in a production cycle. The workload of a station is calculated as follows:

$${ST}_{k\varphi }=\sum_{i\in {TS}_{k}}{t}_{h{\alpha }_{k-h}^{\varphi }ir}\quad \forall k;\, \forall \varphi$$
(4)

After calculating the workloads of all stations for all production cycles, the maximum station time is determined and assigned as the joint cycle time of the MPRAL. The cycle time is determined as follows:

$$CT=max\left({ST}_{k\varphi }\right)\quad \forall k; \,\forall \varphi$$
(5)

During a production cycle, the operation energy consumption of a station is obtained by multiplying the operation power consumption of the robot assigned to that station by the station time:

$${OEC}_{k\varphi }={OPC}_{r}.{ST}_{k\varphi } \quad\forall k; \,\forall \varphi$$
(6)

During a production cycle, the standby energy consumption of a station is determined by multiplying the standby power consumption of the robot assigned to that station by the standby time:

$${SEC}_{k\varphi }={SPC}_{r}.\left(CT-{ST}_{k\varphi }\right) \quad\forall k; \,\forall \varphi$$
(7)

During a production cycle, the total energy consumed at a station consists of the energy consumed during operation and standby:

$${EC}_{k\varphi }={OEC}_{k\varphi }+{SEC}_{k\varphi } \quad\forall k;\, \forall \varphi$$
(8)

The total energy consumption of the MPRAL during a production cycle is the sum of the energy consumption at all stations:

$${TEC}_{\varphi }=\sum_{k=1}^{K}{EC}_{k\varphi }\quad \forall \varphi$$
(9)

The average energy consumption of the MPRAL is obtained by dividing the sum of the energy consumed in all production cycles by the total number of production cycles (ϕ):

$$TEC=\sum_{\varphi =1}^{\phi }{TEC}_{\varphi }/\phi$$
(10)

4 An illustrative example of the MPRALB/S problem

A small-scale example is used in this section to clearly explain the MPRALB/S problem. In this example, there are two lines, seven tasks on each line, six stations and three robot types. The MPSs for Lines 1 and 2 are \({MPS}_{1}=\left(1, 2\right)\) and \({MPS}_{2}=\left(1, 2\right)\), respectively. Since this study introduces and explains the MPRALB/S problem, the processing times and precedence relations of the product models of Line 1 are also used for Line 2. Table 2 contains a dataset for Merten’s problem. The table shows the processing times (s) and precedence relations of the tasks by robots according to product models. The operation power consumptions of the robots per unit time are 0.4, 0.35 and 0.3 for Robots 1, 2 and 3, respectively. \({OPC}_{r}\times {t}_{h{\alpha }_{k-h}^{\varphi }ir}\) is used to calculate a robot’s operation energy consumption. This study assumes that a robot’s standby power consumption is 10% of its operation power consumption (Nilakantan et al. 2015).

Table 2 Immediate predecessors and processing times (s) according to product models for a Merten’s problem (two product models, three robot types and seven tasks)

A solution to Merten’s problem for the MPRALB/S problem is shown in Fig. 2. In this figure, the numbers on the lines represent the indexes of the tasks, and the letters represent the product models. Stations 1 and 2 are separate stations, and the others are common stations. Table 3 shows the station workloads, station energy consumptions and model combinations that appear in the production cycles for a line balance of the mentioned problem.

Fig. 2
figure 2

A line balance for the example problem (\({MS}_{1}={\text{BAB}},\) \({MS}_{2}={\text{DCD}}\))

Table 3 Model combinations, station workloads (s) and energy consumptions (kJ) for all production cycles

Since different model combinations may appear in each production cycle in MPRAL, the number of production cycles is considered \({MS}_{max}\). \({MS}_{max}=LCM\left(3, 3\right)\) and the maximum number of production cycles examined is three. Namely, the calculations after three production cycles will be the same as for those three production cycles.

The model sequence of a line is placed in the table starting from the last station of the line to the first station. The first element in the model sequence of that line is placed at the last station of a line. The second element in the model sequence is placed at the previous station of the last station. The process of placing the model sequences in the table continues as mentioned until the first station of the corresponding line.

According to Table 3, the robots assigned to the stations from Station 1 to Station 6 are Robot 3, Robot 3, Robot 1, Robot 3, Robot 3 and Robot 2, respectively. Take Station 6, a common station, as an example. This station is the last station in both lines. In the first production cycle \(\left({\alpha }_{6-h}^{1}\right)\), the first elements of the model sequences of both lines appear at this station. These are model “B” for Line 1 and model “D” for Line 2. In the remaining production cycles (\({\alpha }_{6-h}^{2}\) and \({\alpha }_{6-h}^{3}\)), they appear as “AB” for Line 1 and “CD” for Line 2 according to the order in the model sequences. The tasks assigned to Station 6 are Task 3 on Line 1 and Task 3 on Line 2. Robot 2 is assigned to Station 6. During the production cycles 1, 2 and 3, the workloads of the station (ST) are 100, 84 and 100 s, respectively. The operation energy consumptions of the station (OEC) are 35, 29.4 and 35 kJ. The standby energy consumptions of the station (SEC) are 0.245, 0.805 and 0.245 kJ. The total energy consumptions of the station (EC) are 35.245, 30.205 and 35.245 kJ. The longest station time in all production cycles for all stations is the joint cycle time of the MPRAL, and therefore, the cycle time (CT) for this solution is 107 s (ST11).

Table 4 shows the energy consumption of the stations and MPRAL for each production cycle. For example, Station 1 consumes 32.1 kJ, 28.05 kJ and 28.05 kJ energy during production cycles 1, 2 and 3, respectively. The average energy consumption of this station is 29.4 kJ. In addition, the energy consumed in the MPRAL is 195,555 kJ, 181,695 kJ and 185,565 kJ during production cycles 1, 2 and 3, respectively. As seen in the table, the total energy consumption of the MPRAL (TECφ) is different in all production cycles. This is because different model combinations appear in all three production cycles. Therefore, the energy consumptions for all production cycles of the MPRAL are averaged. The average energy consumption (TEC) of the MPRAL in the mentioned problem is calculated as 187.605 kJ.

Table 4 The energy consumption at the stations and the total energy consumption of the MPRAL for all production cycles (kJ)

5 Proposed solution approach

5.1 Solution representation

Three vectors are used to represent the solutions: task assignment vector, robot assignment vector and model sequencing vector. The task assignment vector’s size is equivalent to the total number of tasks (Nt). Each position in the vector corresponds to a specific task and contains a station index. The robot assignment vector consists of K elements. Each location on the vector corresponds to a station and contains a robot type index. In the model sequencing vector, the length is S, and a product model is written into each location in the vector. A location in the vector indicates the sequence in which the product model enters the line.

The vectors representing solutions are shown in Fig. 3. For the task assignment vector, “Task” and “Station” refer to an index of a task and an index of the station where the related task is performed, respectively. According to the combined precedence relations diagram generated separately for each line, the first \({I}_{h}\) elements are for tasks on Line 1, and the next elements are for tasks on Line 2. In the robot assignment vector, “Station” and “Robot” refer to an index of a station and an index of the robot type allocated to the corresponding station, respectively. In the model sequencing vector, the first \({S}_{h}\) elements are for product models on Line 1, and the next elements are for tasks on Line 2. “Sequence no” refers to the sequence in which a product enters the line, and “Model” refers to the product model entering the line in the corresponding sequence number. The example in Fig. 3 is considered for two lines, two robot types, four stations and four product models. The total number of tasks on the lines is 14. In the task assignment vector, the first seven elements (tasks) are for Line 1 and the rest are for Line 2. The MPSs for Lines 1 and 2 are \({MPS}_{1}=\left(1, 1\right)\) and \({MPS}_{2}=\left(1, 1\right)\), respectively. In the model sequencing vector, the first two elements are the model sequencing for Line 1, and the rest are for Line 2. For instance, Tasks 6 and 7 on Line 1 and Task 6 on Line 2 are performed by Robot 1 at Station 4. In the model sequencing vector, the first two elements are for Line 1 and the next two elements are for Line 2. The model sequences entering the lines are AB for Line 1 and ED for Line 2.

Fig. 3
figure 3

A solution representation

5.2 Modified NSGA-II

This paper employs NSGA-II, which was introduced by Deb et al. (2002), to address the MPRALB/S problem. In traditional NSGA-II, when the algorithm terminates, the population may contain many solutions with the same objective function. This situation causes the solution diversity in the population to be insufficient. The main reason for this can be explained as follows. In NSGA-II, there are parent population (PP) and offspring population (OP). The OP is formed after crossover and mutation are performed on the parents selected from the PP by methods such as tournament and roulette wheel. Then, the PP and OP are combined. The number of solutions in the combined population (2*pop_size) is twice the predefined population size (pop_size). A new population is generated by selecting solutions from the combined population (CP) according to the fast non-dominated sorting and crowding distance method (FNSCDM) (Deb et al. 2002) until the population size (pop_size) is met. This population will be employed in selection, crossover and mutation operations to generate a new OP.

In the creation part of the OP, if the crossover and mutation probabilities are not met after the selection of the parents, the parents remain unchanged and are transferred to the OP. Since these solutions, which remain unchanged, appear in both the PP and OP, multiple solutions that have the same objective functions arise in the CP. In this case, duplicate solutions can be found in the final population.

This study proposes an improvement to address the issue of the NSGA-II getting trapped in local optima, which can arise from having duplicate solutions. The new proposed version is called MNSGA-II and works as follows (Algorithm 1). The initial population of the algorithm is also the PP and is created with randomly generated solutions until the population size is satisfied. Individuals with the same objective functions in the initial population are identified, one of these individuals is left in the population at random, and the others are eliminated from the population. New individuals are added to the population until the predefined population size is met. These new individuals are obtained as follows. An individual is randomly selected from the population. A new solution is created by applying mutation to this individual, and the new solution is included in the population. Thus, the initial population is created.

After this stage, offsprings are obtained by selecting parents, crossover and mutation. Then, the PP and OP are combined, and individuals with the same objective functions in the CP are found. One of these individuals is left in the CP at random and the others are removed. New individuals are added to the CP until it reaches twice the predefined population size. The procedure of creating new individuals is the same as that applied to the initial population. The FNSCDM is used to select solutions from the CP, which are then added to the new population.

The tournament selection method can be used for parent selection for crossover in NSGA-II. In this study, the selection of parents by tournament method is as follows. From the PP, three individuals are selected at random. One of these individuals is selected according to the FNSCDM and defined as the first parent. This selection procedure is also repeated for the selection of the second parent.

Algorithm 1
figure a

Pseudocode of MNSGA-II

5.3 Crossover and mutation operations

The crossover and mutation operations are performed if the predetermined crossover and mutation probabilities are satisfied. In this study, the crossover operation is applied for all three vectors. Single-point crossover is applied for crossover. The crossover operator for the model sequencing vector is performed as follows (Fig. 4). Parent 1’s model sequence on Line 1 is Offspring 1’s model sequence on Line 1. Parent 2’s model sequence on Line 2 is Offspring 1’s model sequence on Line 1. Thus, Offspring 1 is obtained. For Offspring 2, Parent 2’s model sequence on Line 1 is Offspring 2’s model sequence on Line 1. Parent 1’s model sequence on Line 2 is Offspring 2’s model sequence on Line 2.

Fig. 4
figure 4

Single-point crossover for the model sequencing vector

A mutation is applied to the offspring after the crossover. For mutation, three neighborhood operators are applied. These are swap, insert and alteration operators. The following is the framework of the neighborhood search. The algorithm randomly selects one of the three vectors to search for neighborhoods. Then it chooses Line 1 or Line 2 at random. By randomly applying one of the neighborhood operators, it generates a new neighbor solution. The alteration operator can be applied to the task and robot assignment vectors. The swap and insertion operators can be applied to all three vectors.

The neighborhood operators can be described as follows (Fig. 5). By using the swap operator, the values at two randomly selected locations in the vector are exchanged. With the insert operator, a location is selected at random, and the value at that place is inserted into a different location. By using the alteration operator, a location is selected at random and a different value is assigned to it.

Fig. 5
figure 5

Neighborhood operators for the robot assignment vector

In this study, a repair mechanism is used to make newly generated solutions feasible if they do not meet the precedence constraints of the tasks. If a task is assigned to a station after its successor task, their station indices are swapped (Nilakantan et al. 2017). If the vector does not contain all station indices, the missing indices are inserted into the vector at random until all are included, ensuring that every station has at least one assigned task (Soysal-Kurt and İşleyen 2022).

6 Computational study

A computational study is conducted in this section to solve the multi-objective MPRALB/S problem. The study includes a comparison of the proposed MNSGA-II with NSGA-II and RSA (Li et al. 2016). NSGA-II is implemented to test the effectiveness of the proposed improvement in MNSGA-II. Also, RSA is used for comparison because it is effective at solving multi-objective problems. The RSA’s neighborhood search mechanism is performed similarly to the MNSGA-II’s mutation procedure.

New datasets are generated for the MPRALB/S problem, as there is no existing dataset in the literature. The processing times of Line 1 in the paper by Çil et al. (2017b) are taken as a basis to generate the processing times of the product models. The processing times in the relevant paper are used as the processing times for model “A” on Line 1. The processing times of model “B” on Line 1 are generated by multiplying the processing time of model “A” for robot Ri by random values in the range [0.8 × tir, 1.2 × tir]. The data set generated for Line 1 is also used for Line 2. The power consumption of the robots per unit time is taken from the studies by (Nilakantan et al. 2015; Soysal-Kurt and İşleyen 2022). The precedence relations between tasks can be found on the website http://assembly-line-balancing.de/.

Table 5 gives the problem instances used in the study. For instance, the expression Hahn/53–53 in the table indicates that there are 53 tasks on Line 1 and 53 tasks on Line 2 in the Hahn problem. The number of stations is certain and fixed for each problem. The minimum part sets for the lines are \(\left({MPS}_{1}\right)-\left({MPS}_{2}\right)\). Algorithm parameters are set with Taguchi method on Minitab 19. All the algorithms are encoded in Python and run on Anaconda3-Spyder with an Intel(R) Core(TM) i5-6400 CPU, 2.70 GHz, and 8 GB RAM.

Table 5 Problem instances examined in the paper

6.1 Parameter setting with Taguchi method

The Taguchi method is useful for determining the best combination of parameter levels when there is multiple parameter and levels. It can be applied when the number of full factorial tests for determining parameter levels is high and can reach a conclusion with fewer tests by employing orthogonal arrays. The test results are evaluated with a signal-to-noise (SN) ratio.

Hypervolume ratio (HVR) is a significant measure that assesses the solution diversity and proximity to the Pareto front. Therefore, the response of the experiments is the value of (1-HVR). The objectives of the MPRALB/S problem have different measurement units. To calculate the HVR more consistently, the Pareto solutions are normalized according to the objective functions. The true Pareto front, which is the set of optimal solutions for a given problem, is required to calculate the HVR. The true Pareto front, which cannot be determined in this problem, is composed of the non-dominated solutions obtained after running all the experiments.

For parameter setting, Hahn problem with ten stations is used. The termination criterion for the algorithms is Nt × Nt × 40 ms CPU time, and all experiments are repeated five times. MNSGA-II has three parameters, each with three levels. Thus, the orthogonal array L9 is used, and the design of the experiments is presented in Table 6.

Table 6 Orthogonal array for parameter setting

The parameters and levels of the MNSGA-II are as follows:

  • Population size (pop_size): 30, 60, 90

  • Probability of crossover (p_cros): 0.7, 0.8, 0.9

  • Probability of mutation (p_mut): 0.3, 0.4, 0.5.

Figure 6 and Table 7 show the parameter setting results of the MNSGA-II. Table 7 is the response table for the SN ratios to determine the best levels of the parameters. “Delta” and “Rank” indicate the effect of the parameters on the response. The largest Delta value gives the most important parameter. The ranks indicate the order of effect of the parameters. Thus, the most important parameter is the population size (pop_size) with the largest Delta value, followed by the probability of mutation (p_mut) and the probability of crossover (p_cros). In the main effects plot (Fig. 6), the highest average SN ratio gives the best level for each parameter. Therefore, the best parameter levels for solving the problem are pop_size = 30, p_cros = 0.9 and p_mut = 0.3. The parameter values for MNSGA-II are also used for NSGA-II.

Fig. 6
figure 6

SN ratios mean effect plot

Table 7 Responses for SN ratios

The same procedure as for the MNSGA-II is applied for the parameter setting of the RSA. The followings are the parameters and levels:

  • Initial temperature (T0): 0.5, 0.75, 1.0

  • Cooling rate (alpha): 0.90, 0.95, 0.98

  • Number of solutions searched in each temperature (N): 10, 20, 30

  • Number of moves before restart (NR): 50, 100, 150.

The best parameter levels for the RSA are T0 = 0.75, alpha = 0.95, N = 30 and NR = 50.

6.2 Performance measures

In this paper, the performance of the algorithms is evaluated through three widespread evaluation metrics: hypervolume (HV) ratio, the ratio of non-dominated solutions (RP) and generational distance (GD). The followings are descriptions of the metrics:

Hypervolume ratio. It is calculated by dividing the HV of a Pareto frontier (S) by the HV of the true Pareto front (P) and is expressed as follows (Nilakantan et al. 2017):

$$HVR=\frac{volume\left({\bigcup }_{i=1}^{size\left(S\right)}{v}_{i}\right)}{volume\left({\bigcup }_{j=1}^{size\left(P\right)}{v}_{j}\right)}$$
(11)

\({v}_{i}\) and \({v}_{j}\) refer to hypercubes. The worst values for all objectives are represented by a reference point W. It is used to determine a Pareto set’s volume. The diagonal corners of a hypercube are on the objective vectors of the Pareto frontier and W. The higher the HVR of a Pareto set, the closer it is to the true Pareto front.

The ratio of non-dominated solutions. It is the ratio of the number of solutions non-dominated by the other non-dominated solutions in the union of sets to the number of non-dominated solutions produced by the algorithm (Soysal-Kurt and İşleyen 2022; Zhang et al. 2019).

$$RP\left({S}_{i}\right)=\left|{S}_{i}-\left\{x\in {S}_{i}|\exists y\in S : y\succ x\right\}\right|/{S}_{i}$$
(12)

where \(y\succ x\) means y dominates x, \(\left|{S}_{i}\right|\) is the number of solutions in the obtained Pareto set \({S}_{i}\) by the algorithm, and S is the union of all sets. It is the number of non-dominated solutions produced by the algorithm in the union of sets divided by the number of non-dominated solutions produced by the algorithm. A higher value of this metric indicates a better set of solutions.

Generational distance. It is used to measure the distance between a Pareto set and the true Pareto front and is calculated as the average Euclidean distance between the solutions in the Pareto set and their nearest point on the true Pareto front. A lower GD value indicates a better solution set. GD is described as (Van Veldhuizen and Lamont 2000):

$$GD=\frac{\sqrt{\sum_{i=1}^{n}{d}_{i}^{2}}}{n}$$
(13)

where n is the number of solutions in the Pareto set obtained by an algorithm, and \({d}_{i}\) is the Euclidean distance between solution i and its nearest point on the true Pareto front.

The non-dominated solutions obtained by the algorithms are normalized to compute the metrics consistently. Furthermore, the true Pareto fronts for each test problem consist of non-dominated solutions that occur after the union of the Pareto sets obtained by all algorithms.

6.3 Comparison of algorithms

This section focuses on evaluating the MNSGA-II, NSGA-II and RSA in terms of HVR, GD and RP metrics. For each problem instance, the experiments are run ten times due to the randomness, and the termination criterion of the algorithms is Nt × Nt × 100 ms CPU time.

Table 8 shows the results of the performance metrics for each problem instance. Evaluation of the findings reveals that in terms of the HVR metric, MNSGA-II achieves the best values among all algorithms in 12 cases, RSA in 10 cases and NSGA-II in 3 cases. In terms of the RP metric, both MNSGA-II and RSA obtain the best values among all algorithms in 12 cases and NSGA-II in 3 cases. Regarding the GD metric, MNSGA-II achieves the best values in 13 cases, RSA in 11 cases and NSGA-II in 2 cases. Additionally, it is observed that in multiple problem instances, certain algorithms achieve identical metric values.

Table 8 Comparison of the algorithms by performance metrics

When comparing the performances of MNSGA-II and NSGA-II to assess the effectiveness of the proposed improvement in MNSGA-II, it is clear that MNSGA-II outperforms NSGA-II across all performance metrics. MNSGA-II shows superior performance in approximately 81% of the cases for the HVR, 71% of the cases for the RP and 76% of the cases for the GD.

Figure 7 shows the non-dominated solutions obtained from the algorithms for six problem instances with different sizes. All algorithms can obtain many solutions within ten runs, but figures include the non-dominated solutions from all runs. The non-dominated solutions are indicated by blue circles for MNSGA-II, green squares for NSGA-II and red triangles for RSA. The computational results and Fig. 7 clearly demonstrate the superiority of the proposed MNSGA-II over NSGA-II. Moreover, it can be inferred that MNSGA-II and RSA show comparable performance in solving the problem instances.

Fig. 7
figure 7

Pareto fronts of the MNSGA-II, NSGA-II and RSA

To explain the trade-off between the objective functions from a managerial perspective, consider the non-dominated solutions generated by MNSGA-II in the “Gunther with 10 stations” problem illustrated in Fig. 7c. The Pareto front of the MNSGA-II has six non-dominated solutions, labeled as “p1” to “p6”. The “p1” solution has a cycle time of 340 s and an average energy consumption of 1829.16 kJ, whereas “p6” has a cycle time of 357 s and an average energy consumption of 1684.65 kJ. By moving from “p1” to “p6” on the Pareto front, there is an opportunity for 7.9% energy savings with a longer cycle time of 5%. On the other hand, some positions on the front offer energy savings at a minimal cost to cycle time, such as moving from point “p4” to point “p5” which results in a 26.36 kJ energy savings while only extending the cycle time by 2 s. Furthermore, a shift from solution “p2” to “p3” on the Pareto front results in a 55.69 kJ reduction in average energy consumption, with an increase of two seconds in the cycle time. When this approach is applied to large-scale problems, significant energy savings can be achieved, and it can guide managers who want to reduce the negative environmental impacts of production while determining the most appropriate production plan.

In addition, MNSGA-II, NSGA-II and RSA are statistically compared using performance measures. An initial ANOVA test is utilized, and it is determined that the performance metric values are not normally distributed. Therefore, the non-parametric Friedman rank-based test is conducted to assess whether there is a statistically significant difference between the algorithms. Since higher values are better for HVR and RP, if the rank value for one algorithm is higher than the rank values of the other algorithms, this algorithm can be considered to perform better than the others. For GD, the opposite is true as lower value is better. Figure 8 shows the Friedman ranks of HVR, RP and GD. For the HVR metric (Fig. 8a), the Friedman ranks indicate that MNSGA-II performs the best, followed by RSA and NSGA-II. In the case of RP (Fig. 8b), the ranks remain RSA, MNSGA-II and NSGA-II. However, it’s important to note that the rank values for MNSGA-II and RSA are nearly identical, suggesting their comparable performance. Lastly, when considering the GD metric (Fig. 8c), the rankings are MNSGA-II, RSA and NSGA-II. These ranks provide valuable insights into the relative performance of these algorithms across different evaluation criteria.

Fig. 8
figure 8

The Friedman ranks of HVR (a), RP (b) and GD (c)

According to Friedman test statistics, p values are 0.001, 0.003 and 0.003 for HVR, RP and GD, respectively. The results at a 95% confidence level indicate that the performance of at least one algorithm is different from the others. The Wilcoxon signed rank test is applied to determine which algorithms’ performances are different. The p-values for the Wilcoxon test are given in Table 9. A p value less than 0.05 means that there is a significant difference.

Table 9 Statistics for the Wilcoxon signed ranks test

According to the statistical test results, the following conclusions can be drawn. i) The performances of NSGA-II and MNSGA-II are different and MNSGA-II outperforms NSGA-II. ii) The performances of RSA and NSGA-II are different and RSA outperforms NSGA-II. iii) There is no statistically significant difference between the performances of MNSGA-II and RSA.

7 Conclusion

Robots have been widely used in the industrial sector in recent years. While they increase production efficiency significantly, they also increase energy consumption and carbon emissions. In manufacturing, robots are commonly utilized on assembly lines. Consideration of energy consumption when balancing a robotic assembly line can result in significant energy savings. In MPRAL, where multiple product models are produced in a mixed sequence, finding an optimal line balance and model sequence is quite difficult. As a result, this paper aims to minimize cycle time and energy consumption in the MPRALB/S problem, and it is the first attempt to simultaneously address balancing and sequencing problems in MPRAL.

The MPRALB/S problem aiming to minimize the cycle time and energy consumption is illustrated with a numerical example. A MNSGA-II is developed to solve the problem. This algorithm eliminates solutions with the same objective function values in the population. In order to determine the parameter values of the algorithm, the Taguchi method is employed. The algorithm’s performance is then compared to NSGA-II and RSA. The results of the computational study show that MNSGA-II is superior to NSGA-II. Furthermore, MNSGA-II and RSA produce solutions with similar levels of diversity and proximity to the true Pareto front.

Line balancing and model sequencing are two closely related problems for a MRAL. The performance of both problems is affected by each other’s results. The assignment of robots to stations by considering their energy consumption complicates the process of determining the best solution for the MPRALB/S problem. There are some benefits of simultaneously balancing and model sequencing of MPRAL to minimize cycle time and energy consumption, such as increased line efficiency and reduced energy consumption. Since processing times may vary from one model to another, assigning suitable tasks to stations can reduce station idle times, resulting in increased productivity. Therefore, as the energy consumed in the MPRAL will also decrease, a sustainable line is obtained and an economic improvement is provided for the company.

As the addressed problem introduces a new field of research to the assembly line literature, it provides opportunities for researchers to further develop this field. Any robotic assembly line balancing problem is known to be complex. It becomes more complicated when the model sequencing and balancing problems are addressed simultaneously. It takes more time to find an effective solution. In future studies, solution approaches such as mathematical models, heuristic and metaheuristic methods that find effective solutions in a shorter time for more complex model mixtures can be developed.