Introduction

Today’s competitive environment impels companies to improve the quality of their products proactively, so the design of experiments (DOEs) can be one of the most efficient methods for this purpose. The experimental design helps us find the effects of the controllable and nuisance factors on one or more responses. Finding the combination of controllable factor levels, namely treatment, which leads to the most appropriate process outputs is one of the common challenges in quality engineering researches.

The Taguchi method is a common strategy in the robust design and involves designing experiments with the use of orthogonal arrays for finding the treatment which optimizes a given performance measure, typically a signal-to-noise (SN) ratio.

The Taguchi method was initially proposed for single response problems (Taguchi and Chowdhury 2000; Maghsoodloo et al. 2004; Robinson et al. 2004; Zang et al. 2005). However, in real industrial problems, there are more than one responses and their simultaneous optimization is of the interest.

Multiple-response optimization (MRO) problems have been studied by several researchers. Existing methods in this field are classified into three basic categories by Ortiz et al. (2004). The performance of each category depends on the complexity of the problem. In the first category, overlying contour plots of each response is applied for finding the space where each response satisfies its specification limits. Myers and Montgomery (2002) expressed that this method is applicable only in the situations where the numbers of controllable factors are very few. In the second category, the most important response for the decision maker (DM) is used as the main objective and the rest of responses are considered as constraints. However the approaches in this category do not conform to the basic idea of multiple-response optimization which is the simultaneous optimization of all responses.

The third category, which contains a major proportion of studies in MRO, consists of three main steps to find the optimal treatment. In the first step, a model for describing the relation between controllable factors and responses is built. In the second step, an aggregation approach of responses is applied. Finally, in the third step, an optimization method is used for optimizing the single response which is obtained from the last step. In the mentioned categories, Ordinary Least Square (OLS) is one of the common methods for building the model of relation between controllable factors and responses (first step). Although in some researches such as Bashiri and Moslemi (2013), robust optimization methods are applied; one of the major shortcomings of the studies in this category is that when the mean square error (MSE) of the regression model is a high value, the ability of the model to describe the relationship of the response variables and the controllable factors would be poor (Erzurumlu and Oktem 2007). For overcoming this problem, ANN can be used as a proper substitute method for response estimation. Some authors have compared response surface and regression models with ANN in model building and the preciseness of ANN has been verified in their results (Tsao 2008; Desai et al. 2008; Namvar-Asl et al. 2008; Gauri and Pal 2010).

Furthermore, Niaki and Hoseinzade (2013) used ANN for forecasting S&P indices where the experimental design was used for tuning the parameters of the neural network.

A numerous number of researchers studied about using artificial neural networks for solving MRO problems. The summary of existing approaches is illustrated in Figure 1, where the studies are classified according to the techniques each of them has used.

Figure 1
figure 1

A taxonomy of methods for multiple-response optimizations using ANN (Li et al. 2003 , Hsu 2004 , Ozcelik and Erzurumlu 2006 , Chang 2006 , Chang 2008 , Chang and Chen 2011 , Hesieh and Tong 2001 , Noorossama et al. 2009 , Hsu 2001 , Sibalija and Majstorovic 2012 , Caporaletti et al. 1999 , Liao and Chen 2002 , Liao 2004 , and Liao 2005 ).

In MRO problems, different multi-criteria decision making methods have been used to determine the optimum treatment (for a review see Amiri et al. (2012)). Data envelopment analysis (DEA) is one of these techniques which have been used in several researches. Caporaletti et al. (1999) proposed a pure input DEA model for the nominal-the-best (NTB)-type responses using y ¯ ij y i 2 and S ij 2 as input variable. In this study, just the experimented treatments have been evaluated. Liao and Chen (2002) proposed an input-oriented basic DEA ratio known as the Charnes, Cooper and Rhodes (CCR) model introduced by Charnes et al. (1978) that uses the normalized mean responses as input variables when the responses are the NTB or the smaller-the-better (STB) type; also, Goel et al. (2007) proposed a new method in multiple-response optimization using the Pareto optimal solution.

In the model of Liao and Chen (2002), when the responses are LTB type, the normalized mean responses are considered as the output variable. Herein again, only the real experimented treatments and their corresponding responses are considered. Liao (Goel et al. 2007) also used a back propagation (BP) neural network (trained with the data of the actual treatments) to estimate the SN ratio of responses for all treatments and then efficient treatments are determined by the CCR DEA model, considering normalized SN ratio as outputs.

In Liao (Goel et al. 2007), the same DEA model is used but all possible treatments are estimated using a BP neural network. In their proposed approach, the most treatment is not selected.

Gutierrez and Lozano (2010) used a similar approach to find the efficient treatment and then sieve the most efficient among the efficient ones.

In the mentioned literature of the studies which have used DEA for the determination of the optimum treatment, only the response variables have been focused on, while the main philosophy of DEA is maximization of the overall process outputs versus the minimization of the total consumed inputs.

Many real world processes can be exampled where the controllable factors are STB type. For instance, in a plastic molding process, one of the factors is the barrel temperature, of which the less value is more preferred. The higher temperature impels more electricity consumption and equivalently more costs. By considering other similar STB controllable factors as inputs and the SN ratio of response variables as the outputs of the process, the main philosophy of DEA would be realized. The final results of DEA for such a problem would assure us that the most appropriate quality of the process is obtained by the least consumption of input variables.

In this paper, a new approach is developed based on the neural network and data envelopment analysis where controllable factors are the STB type and the responses are uncorrelated. Besides, it is. In this method, the ANN is used to estimate the response values for unpracticed treatments in a way that the multiple responses are obtained simultaneously. In this paper, the used neural network is tuned by the method proposed by Bashiri and Geranmayeh (2011). In their method, the Central Composite Design (CCD) and the Genetic Algorithm (GA) have been used to tune and determine the optimum parameters of the neural network, specifically the number of the layers and the number of neurons in each layer.

After tuning the ANN, the unpracticed treatments with smaller intervals of factor levels are generated and their corresponding responses are estimated by the ANN.

In the next step, by using DEA, the efficiency value of each treatment is computed. Finally, the most efficient treatment is determined using the maximin weight model approach.

The remainder of the article is organized as follows: in the next section, the steps of the proposed approach are described. In section 3, the proposed method is implemented in a real world case study and the steps of the proposed method are explained thoroughly. Finally, section 4 summarizes and draws the conclusion.

Proposed approach

This section describes the proposed methodology which is illustrated in Figure 2.

Figure 2
figure 2

Flow chart of the proposed approach.

Experimentation phase

In the first step of any design of experiment problems, the responses and their descriptions such as their types should be determined. Also, the controllable factors and their corresponding levels should be specified according to the technical knowledge of the process and the execution limitations. By knowing the required information of responses and the controllable factors, a Taguchi design is chosen and conducted and the response data are collected. The outputs of this step are the SN ratio values computed for each response in each treatment. Note that the responses are assumed to be uncorrelated.

Tune-effective parameters of ANN for estimation of responses

For obtaining desirable results by using Artificial Neural Networks, tuning the parameters of ANN seems to be necessary. For example, the number of layers and the number of neurons in each layer are effective parameters in the performance of neural networks. Some authors have used the design of experiments (DOE) for determining the best combination of effective parameters of ANN. Khaw et al. (1995) used the Taguchi method as well as two simulated data collections to determine the effective parameters of ANN, which caused to increase the velocity and convergence of the Back Propagation (BP) algorithm. Also, other similar researches have been proposed in this field, such as the studies of Kim and Yum (2004), Sukthomya and Tannock (2005), Tortum et al. (2007), Packianather et al. (2000), and Peterson et al. (1995). Bashiri and Geranmayeh (2011) proposed a method for tuning the parameters of the artificial neural network based on CCD and genetic algorithm. Because of the accuracy and generalization capability of this approach, in this study, we applied this method for tuning the parameters of the neural network with some necessary changes in determining the neural network’s performance criteria. The condition of the problem determines the proper performance criteria of the neural network.

For training ANNs, the data are divided into three subsets: training, validation, and testing sets. In this study, root mean square error (RMSE) of the test and validation data are considered as ANN’s performance criteria in estimation of responses in the multiple response optimization problem. In this step, the optimum number of hidden layers and the number of neurons in hidden layers are obtained.

Train ANN using tuned parameters and estimate the unpracticed treatments

In the previous step, optimal parameters of ANN for this problem are obtained. So, the neural network is ready to be trained and estimate the response values of unpracticed treatments.

After the training phase, neural network builds a model and can estimate other treatments which are not experimented. Since the trained neural network’s response estimation is not affected by the number of factor levels, new levels are defined between the initial factor levels. This procedure improves the accuracy of the solution.

In mentioned studies, the neural network is used to estimate the SN ratio or the mean square deviation (MSD) of responses. But in the cases where a nuisance factor exists, the effect of the nuisance factor is neglected and the nuisance factor is treated as a replicate. In this study, for solving this problem, ANN is applied for estimating responses, not the SN ratios or MSDs of responses. As Chang (Ozcelik and Erzurumlu 2006) applied ANN in dynamic multiple response experiments, factors and nuisance factors are considered as inputs for training of the neural network.

Evaluate the efficiency of all treatments using DEA

DEA is used to compute the relative efficiency of a group of competing decision-making units (DMUs), while there are several inputs and outputs for each DMU (Tbanassoulis 2001). The relative efficiency is the ratio of the weighted sum of outputs to the weighted sum of inputs.

Thus, if a DMU wants to have an upper efficiency, the input data must be minimized and the output data must be maximized. If it is assumed that n available DMUs should be studied, and each of them has m inputs and s outputs, the efficiency of the j th DMU will be computed by Equation 1:

Maximize E 0 = r = 1 s u r y r 0 s . t . i = 1 m v i x i 0 = 1 , r = 1 s u r y rj i = 1 m v i x ij 0 , u r 0 , r = 1 , , s , v i 0 , i = 1 , , m , ,
(1)

where u r is the weight of output r, v i is the weight of input i, y rj is the value of output r from DMU j, and x ij is the value of input i from DMU j and where DMU0 is the DMU under study.

In this paper, each treatment is considered as a DMU, each controllable factor is considered as an input variable and finally each response variable is considered as an output.

For using DEA in MRO problems, input variables should be STB type and response variables should be LTB type; however this condition does not necessarily hold in all processes. For this reason, before applying DEA, it should be checked that the controllable factors are STB type variables and the responses are LTB-type ones.

Although there might be different types of responses in an MRO problem, since we used the Taguchi method, the responses are transformed into the SN ratio values using Equations 2, 3, and 4 and we know that the SN ratio is a LTB-type variable. For the controllable factors, according to their properties, we can assure that whether a controllable factor is STB type or not.

x ij = 10 log 10 1 l k = 1 i y ijk 2 0 y ijk for the smaller the better response
(2)
x ij = 10 log 10 1 l k = 1 i 1 y ijk 2 0 y ijk for the larger the better response
(3)
x ij = 10 log 10 y ¯ ij 2 S ij 2 0 y ijk for the nominal the better response
(4)

Let the SN ratio be x ij for the j th response at the i th trial, for i = 1, …, m, j = 1, …, n. y ijk is the observed data for the j th response at the i th trial, in the k th repetition, y ¯ ij = 1 l k = 1 l y ijk (the average observed data for the j th response at the i th trial), S ij 2 = 1 l 1 k = 1 l y ijk y ¯ ij 2 (the variation of the observed data for the j th response at the i th trial) for, i = 1,…,m and j = 1,…,n and k = 1,…,l. Now, x ij is normalized as Z ij (0 ≤ Z ij  ≤ 1) by Equation 5 to avoid the effect of adopting different units.

Z ij = X ij min X ij , j = 1 , 2 , , n max X ij , j = 1 , 2 , , n min X ij , j = 1 , 2 , , n
(5)

At the end of this step, an efficiency value is computed for each treatment which represents that how well the input variables, controllable factors, have been minimized and how well the output variables, SN ratios of responses, have been maximized. The efficiency value is a measure ranging from 0 to 1 and sometimes more than one treatment would have the efficiency equal to 1.

Use maximin weight model to choose among the efficient treatments

In DEA method, it is common that more than one DMU would be selected as the efficient one. For the determination of the most efficient DMU, another mathematical modeling should be done. The maximin weight model was presented by Wang et al. (2009) for ranking DEA efficient units. This model is solved for all of efficient DMUs, the model is represented in Equation 6.

Maximize w s . t . i = 1 m v i = 1 , r = 1 s u r y ^ ro i = 1 m v i x ^ i 0 = 0 , r = 1 s u r y ^ rj i = 1 m v i x ^ ij 0 , j = 1 , , n u r w , r = 1 , , s , v i w , i = 1 , , m , ,
(6)

where w is the maximin weight that could keep DMU0 efficient, x ^ ij i = 1 , , m ; j = 1 , , n is the normalized input obtained using Equation 7.

x ^ ij = x ij j = 1 n x ij , i = 1 , , m ; j = 1 , , n
(7)

If there would be k efficient units, by solving maximin weight model represented in Equation 7 for each efficient unit, a group of maximin weights, w i 1 , w i 2 , w ik , are obtained and the DMU with the largest value of w is considered as the most efficient treatment.

Illustrative example

For verifying the applicability of the proposed method, a real case study of an injection molding process, in a TV and monitor production line, is conducted. In the case under study, two responses are of interest and the responses are the sizes of the upper side and the lower side of a cabinet front of a monitor. The specification limits of responses are represented in Table 1. In this table, the response types are determined so that we would be able to choose among the formula of computing SN ratios from Equations 2, 3, and 4.

Table 1 Response definitions for the injection molding process example

According to the initial experiments and also operational experiences, five controllable factors are screened to set up the experimental design. The description of each controllable factor and also their corresponding levels are illustrated in Table 2.

Table 2 Factors and their levels for the injection molding process example

According to the information given in Table 2, the L18 Taguchi design is selected for design of experiments. Since two similar machines can be used for production of the mentioned part, two nuisance factors are considered.

As it can be easily seen in Table 2, the controllable factors are mainly from the type of injection pressure and injection speed or processing time. According to maintenance considerations of injection machines, a production process with less needed injection pressure and less needed injection speed would incur less erosion to the machines, so all pressure-related factors and speed-related factors are STB type. Besides, as much as the required time for a process is shorter, the production rate would increase, so a shorter processing time is more desired. By considering these aspects, all the mentioned controllable factors can be considered as STB-type inputs. An efficient treatment is a setting in which not only the quality requirements are met, but also the abovementioned preferences of the operation are satisfied. Some of the L18 orthogonal Taguchi experimental design results are reported in Table 3.

Table 3 Summary of experimental results for injection molding process example

Tune-effective parameters of ANN for estimation of responses

In the training phase, factor level values and noise factor levels are considered as inputs and corresponding response values are considered as outputs of neural networks. As results, there are 36 data divided into three groups of training, testing, and validation data. An important point in choosing testing and training data is that considering one replicate as training and other replicate as a testing data causes erroneous judgment. In this study, the 5th, 10th, and 16th experiments in condition of the first noise factor and second noise factor are considered as testing and validation data, respectively, and others as the training data. The algorithm and its significant parameters considered for training the neural network are reported in Table 4. In the training phase of the ANN, the linear function is selected as the transfer function for the last layer and the tangent sigmoid is used for other layers. By considering the dimension of training data, axial points for designing of experiments by CCD is determined in Table 5. For more details of the tuning method, see Hsieh and Tong (2001).

Table 4 Algorithm and its parameters considered for training the neural network
Table 5 Parameters and their levels studied in experiments based on CCD design in ANN parameter tuning

By performing such an experiment, the optimal setting for the parameters of the neural network are determined as reported in Table 6.

Table 6 Optimum values of effective parameters in performance of ANN

Train the ANN using tuned parameters and estimate unpracticed treatments

In this step, data are divided into testing, training, and validation data subsets as mentioned in the previous section, and finally, the training of ANN is conducted using the tuned parameters obtained from the previous step.

It should be noted that the initial conducted experiment whose results are reported in Table 3, just included 4 treatment results from the 18 practiced treatments, while by using ANN, many other unpracticed treatments and corresponding results can be estimated in the current step. For this regard, new levels are defined between initial levels. These levels are shown in Table 7.

Table 7 Factors and their new defined levels for the injection molding process example

According to the levels defined in Table 7, there are 54 × 3 = 1,875 possible treatments (because of the five levels for factors B, C, D, E, and three levels for factor A) which their responses can be estimated by the trained ANN. The responses of these factor level combinations are estimated using the trained neural network. The estimated responses for new defined treatments are reported in Table 8.

Table 8 Estimated values by neural networks for actual experiments for the injection molding process example

Evaluate the efficiency of all treatments using DEA

In this step, we assume each of the estimated treatments reported in Table 8 as a certain DMU. The SN ratios of estimated responses are computed and then normalized in each treatment using Equations 3 and 5. In this step, evaluation of the efficiency for each treatment can be conducted by solving Equation 1.

Matlab software (MathWorks, Inc., Natick, MA, USA) is used to solve 1,875 linear models to determine the efficiency of each treatment. Finally, the efficiency value for 12 of them is obtained as equal to 1. In the next step, the most efficient treatment would be selected among these 12 treatments.

Use maximin weight model to choose among the efficient parameter combinations

In this step, minimum weighted restriction is applied to determine the most efficient treatment among all. Efficient treatments and weights which are obtained by solving Equation 6 are illustrated in Table 9. As mentioned before, the treatment with maximum weight is selected as the most efficient solution of the case study. The most efficient treatment is obtained as A = 45%, B = 55%, C = 40%, and D = 80% and E = 6 s.

Table 9 The maximin calculated weight for efficient combinations

The combination and corresponding values with the most efficient treatment are denoted in italics. For verifying the obtained results, the real results of the most efficient treatment are performed for a batch of 30 parts and the most efficient and the extracted results are compared with previous experimented data. Results of the comparisons are illustrated in the Table 10, in which SN1 and SN2 are signal-to-noise ratios of the first and second responses, respectively. Note that the most efficient treatment result has been considered as the 19th run. At first, efficient treatments among these 19 treatments are determined, then the most efficient treatment is selected. As it is observed, in this stage, the previous most efficient treatment is selected as the most efficient one. It shows that the obtained treatment as the most efficient one is theoretically and practically the most possible efficient treatment.

Table 10 Validating results

For increasing the applicability of the final results, one-way and two-way sensitivity analyses of efficiency under different values of related factors have been studied. For this purpose, lots of experimented data is necessary to track the sensitive parameters; also, efficiency calculation requires lots of computations for the estimated response values. So, an artificial neural network is applied as the efficiency estimator of different treatments. For this purpose, an ANN is trained by using level values of 1,875 treatments as inputs and efficiency of them as outputs. Then, the sensitivity analysis is done under different values of controllable factors in one or two ways using the trained efficiency estimator neural network. Results of the sensitivity analyses are illustrated in Figure 3. It can be observed that controllable factors A and D can be changed in intervals of about (42 to 45) and (79 to 81), respectively, without changing the selected treatment efficiency score; however, factors B, C, and E have inverse relation to the efficiency score in the analyzed amplitude.

Figure 3
figure 3

Sensitivity analysis of experimental efficiency under different values of factors a, b, c, d, e, respectively.

Two-way sensitivity analysis has been illustrated in Figure 4; it shows that there are interactions between any pairs of controllable factors which affect the efficiency score and that simultaneous analysis of controllable factors instead of individual analysis is more desirable. This verifies that the proposed efficiency analysis of treatments in multiple-response optimizations is a preferred technique than others especially when the controllable factors are as smaller-the-better type.

Figure 4
figure 4

Two-way sensitivity analysis of experimental efficiency under different values of factors a-d, a-b, b-d, a-c, respectively.

Conclusions

In this study, a four-step approach was presented to find the optimal treatment in multiple-response optimization problems. After conducting a Taguchi-designed experiment and collecting the response data, by using a tuned neural network, the responses of unpracticed treatments were estimated considering new levels defined for the controllable factors. Each treatment was assumed to be as a DMU and the smaller-the-better-type controllable factors were assumed as the input variables, whereas the computed SN ratios of responses were assumed to be as the outputs for the DEA modeling. Then data envelopment analysis was applied to obtain the efficiency of each treatment. Finally, the maximin weight model was applied to find the most efficient treatment.

In the proposed method, for assisting the economic aspects of the process improvement, the type of controllable factors was accounted for, and using this approach, the overall output quality of the process was maximized while the usage of input variables was minimized. For validation of the study, the proposed method was applied in a plastic molding process as a real case and the results were compared and analyzed. Sensitivity analysis of the efficiency deviation, which was the useful tool for analyzing results, was presented by applying another neural network. The analysis showed that the proposed approach was a proper tool in discrete multiple response optimization especially for the STB controllable factors. As illustrated in this paper, the proposed approach was based on the response values and it did not consider the variations of responses; so, using other approaches which consider the variation can be future studies of STB-type controllable factors in MRO problems.