1 Frame of reference

In engineering design, there is a need to infuse the knowledge of downstream activities into the design process to iteratively improve model formulation and approximation.

1.1 Features of engineering-design problems

A designer concurrently needs to consider the interactions among subsystems and tradeoffs among their conflicting requirements while simultaneously meeting multidisciplinary constraints, such as cost, physical properties, manufacturing capacity, etc. (Wang 1994). This means that the goals may have different units, and therefore, the evaluation of the tradeoffs can be difficult. Hence, satisficing solutions are desired; these solutions meet multiple design requirements and are relatively insensitive to uncertainties (Wang et al. 2018).

1.2 Studies and gaps on multi-component, multi-goal problems

1.2.1 Regarding problem formulation

Previati and coauthors (2019) use topology to allocate the shared part of the design space to each component and identify the optimal structural layout while maintaining its original geometry. You and Smith (2016) use atomic theory and fuzzy clustering to configure modules and manage multiple design objectives. Chiu and coauthors (2021) use this concept of Design for Assembly to model a packaging system as a three-objective problem combining integer programming and goal programming.

Regarding solution algorithms: (Fishburn 1974) review the literature in Lexicographic methods and categorize them in two trends, mathematical and pragmatic. However, while applying Lexicographic methods is that the utility theory based on human preferences are certain, descriptive, and rational. Therefore, the definition of “best” alternative depends on the context and evolve during the design. Avigad and Matalon (2011) solve multiple single-objective problems with shared components using an evolutionary algorithm. The authors compare various approaches to search for common components of multiple systems or modeling objects. Katsikopoulos (2011) explores the psychological heuristics (as an alternative to optimization) in making inferences and finds out it has mathematical basis and advantages comparing with optimization when making decisions about the future. Meier et al. (2016) explore particular subsets of the Pareto-optimal solutions for time–cost tradeoffs in product development using managerial work policies. Moon et al. (2014) use multi-objective particle swarm optimization (MOPSO) to determine variables for optimal platform strategy for designing a product family.

In this paper, we categorize multi-component or multi-goal problems into two categories—focusing on improving the performance of the solution algorithms and/or design improvement. In the papers on solution algorithms, the main focus is on sorting near-Pareto frontiers (Deb et al. 2002; Seada and Deb 2014). The performance of an algorithm is evaluated by criteria such as solution optimality, diversity of solutions, and the computational complexity of the algorithm (Soltani et al. 2002). Whereas in the papers on design improvement, the authors focus on improving the model formulation or acquiring more knowledge about the interrelationships among the components (Tang et al. 2010). Some typical methods in both categories are shown in Table 1. In our paper, the combination of these two approaches is addressed.

Table 1 The features and limitations of some classic multi-objective (multi-goal) solution algorithms and methods

The limitations of the existing literature on multi-component, multi-goal problems can be summarized as follows.

  • Searching the near-Pareto frontier of optimal solutions that are sensitive to model errors and variations. Optimal solutions are sensitive, because a problem needs to meet the necessary and sufficient Karush–Kuhn–Tucker (KKT) conditions at an optimal solution point, and the probability that a model error or variation violates the KKT conditions is relatively high. Another type of solution, a satisficing solution, only requires the problem to meet the necessary KKT condition. Thus, these solutions are relatively insensitive to model errors and variations. Numerical comparisons between optimal solutions and satisficing solutions are given in Chapter 2 of (Guo 2021).

  • Lack of decision support about using solutions in different areas of the design space in various situations. When there are too many choices but little knowledge on how to select them, decision-makers may be overwhelmed and end up with wrong decisions (Toffler 1970).

  • Relying on domain knowledge to combine or prioritize multiple objectives (Smith et al. 2015), decompose the problem (Zhang and Li 2007), or set targets for the goals (Sabeghi et al. 2016). For some problems, domain knowledge may not be correct or for a new design, domain knowledge may be not be available

Given these limitations, in this paper, we want to find a domain-independent method to capture and reuse the knowledge of a multi-goal engineering-design problem to facilitate the exploration and selection of design scenarios.

To answer this, data analysis is used to obtain insight for decision support for combining goals, especially when domain knowledge is insufficient. We hypothesize that by exploring goal combinations based on their interrelationships, the rate of achievement of each goal can be improved, is more understandable, and diverse design scenarios can be identified. The process of knowledge discovery and reuse is then domain-independent. In this paper, goals are defined as objectives with target values and the design problem is managed by minimizing the deviation between what is achievable and the goal target. Therefore, design improvement is obtained by exploring the interrelationships among the system components, and satisficing solutions are obtained by combining the Pre-emptive and Archimedean (weighted sums) strategies of reaching goals.

In Sect. 2, we propose a knowledge-driven method, the adaptive leveling-weighting-clustering (ALWC) algorithm, to manage knowledge capture and reuse in multi-goal problems. In Sect. 3, we use a thermal-system design test problem to demonstrate the utility of the ALWC method. In Sect. 4, the results and discussion are presented. Finally, in Sect. 5, the contributions in this paper are summarized and the scope of applicability of the ALWC method is described.

2 Proposed method: the adaptive leveling-weighting-clustering (ALWC) method

We propose the adaptive leveling-weighting-clustering algorithm (ALWC) to fill the research gaps. How do we fill the research gaps using the ALWC?

First, instead of searching for a near-Pareto frontier consisting of optimal solutions that are sensitive to model errors and variations that violate the second-order KKT conditions, using the ALWC, designers obtain satisficing solutions that are relatively insensitive to model errors and uncertainties. By applying the ALWC, designers obtain good enough solutions that satisfy the first-order KKT conditions, but may not satisfy the second-order KKT conditions (Guo 2021).

Second, in each iteration, decision support is provided based on the awareness of subsystems and reorganizing them by changing goal combinations.

Third, instead of relying on domain expertise, using the ALWC, designers can use calculations and analyses to manage the multiple components of a system.

We explain how we define and learn the interrelationships among goals in Sect. 2.1, then we introduce how to use the interrelationships to generate and improve goal combinations in Sect. 2.2, and then describe the algorithms in the ALWC in detail in the Appendix.

2.1 Clustering the goals based on their interrelationships

In managing a multi-goal design problem to explore effective combinations of the goals \(\fancyscript{z}(d)\), the interrelationships among the goals, for example, correlations or orthogonality, must be determined. Then, the scalarization function or priority of the goals is determined based on their interrelationships. A scenario, a specific combination of the goals in a multi-goal problem, is defined as a design scenario (DS), that is, an Archimedean approach using a weight vector, or a Pre-emptive approach to prioritize the goals, or a combination of the two strategies. In a design scenario, \({\mathrm{DS}}_{a}\), the merit function is denoted as \({\fancyscript{z}}_{{\mathrm{DS}}_{a}}(d)\), the corresponding satisficing solution is \({x}_{a}^{s}\), and the deviation of Goal k is \({d}_{ak}\). For a K-goal problem, if we solve it using \(A\) design scenarios, \({\left[{\mathrm{DS}}_{1},{ \mathrm{DS}}_{2}, \dots { \mathrm{DS}}_{A}\right]}^{T}\), where \(A\) is a positive integer, we get an \(A\times K\) matrix of deviations, \(\mathcal{D}\); Fig. 1. In the cDSP formulation in Fig. 1, \(\mathcal{P}\) is the set of parameters, t is the vector of the target of the goals, x is the vector of the variables, d is the set of deviations of the goals from their targets, \(\mathcal{F}\) is the feasible area, G(x) is the set of goals, and \({\fancyscript{z}}_{\mathrm{DS}}(d)\) is the merit function of the cDSP either using Archimedean or Pre-emptive, or mixed strategy to manage the goals. The relationships that we learn by analyzing Matrix \(\mathcal{D}\) are an indication of the conditional correlation or conditional orthogonality.Footnote 1 Within the feasible space \(\mathcal{F}\) and under \(A\) DSs, this is the correlation or the orthogonality of the K goals; see Fig. 2. Learning algorithms and cluster analysis methods are the used based on the problem requirements. Here, knowledge discovery and decision support related to the innovation, selection, and customization of learning algorithms and cluster analysis methods are not our focus; however, they can be addressed and improved using the ALWC method.

Fig. 1
figure 1

Using multiple design scenarios to obtain a deviation matrix

Fig. 2
figure 2

Cluster analysis using a deviation matrix

2.1.1 The adaptiveness

To make the learning algorithm generic, we assume no domain knowledge about the interrelationships among the goals. Here, in each iteration, only a limited number of design scenarios are generated, and we update and learn about the interrelationships through iteration. In this sense, our method is adaptive.

2.1.2 Learning the correlations

We learn the correlations from the deviation vector of the goals using angle-based distances. Suppose for a three-goal problem, we use two design scenarios DS1 and DS2 to learn about the goal correlation, and the scalarization of each design scenario is \({\fancyscript{z}}_{{DS}_{i}}\left(d\right), d={[{d}_{k}^{-}, {d}_{k}^{+}]}^{T},\) where \(i=1, 2, k=1, 2, 3.\) The satisficing solution and the deviations are represented in Eqs. 1 and 2

$${x}^{si}=\mathrm{arg}\left(\mathrm{min}\left({\fancyscript{z}}_{{DS}_{i}}\left(d\right)\right)\right),\quad \forall i=1, 2,$$
(1)
$${d}_{ik}=\left|{t}_{k}-{G}_{k}\left({x}^{si}\right)\right|,\quad \forall i=1, 2, \forall k=1, 2, 3.$$
(2)

In Fig. 3, we use a two-dimensional solution space to illustrate the three goals \({G}_{k}(x)\), \(\forall k=1, 2, 3,\) and two satisficing solutions \({x}^{si}, \forall i=1, 2\), for two design scenarios \(\mathrm{DS}i, \forall i=1, 2\). In Fig. 4, we show the deviation points in two-dimensional goal spaces. We normalize the deviation of all design scenarios, \({d}_{k}^{-}, {d}_{k}^{+}\), in the range of [0, 1], where 0 means achieving the target of a goal the and 1 means the worst case of achieving the target of the goal. The coordinate origin O is the utopia point, which means in this scenario, we achieve the target of a goal the best among all design scenarios, either over-achievement or under-achievement. On the contrary, Point I (1, 1) is the worst point, which means in this scenario, we achieve the target of the goal the worst among all design scenarios; in other words, either \({d}^{-}\) or \({d}^{+}\) is the largest in this scenario. When a changing from design scenario DS1 to DS2, the deviations of Goal 1 and Goal 3 change in different directions (Fig. 4a), whereas the deviations of Goal 1 and Goal 2 change in the same direction (Fig. 4b). Therefore Goal 1 and Goal 3 are less correlated, so they belong to two different clusters, while Goal 1 and Goal 2 have a higher correlation, so they belong to a single cluster. Suppose we use i and j to represent two goals – Goal i and Goal j, the acute angle between \(\overline{{D }_{1}^{ij}{D}_{2}^{ij}}\) and the diagonal \(\overline{OI }\) of the goal space, \({\alpha }_{ij}\), indicates the correlation between Goal i and Goal j, so we use an angle-based correlation analysis to understand the interrelationship between the two goals (Eq. 3). As more design scenarios are used, the correlation among goals becomes clearer. For the, A, number of design scenarios

Fig. 3
figure 3

The satisficing solutions to a three-goal cDSP under two design scenarios illustrated in a two-dimensional solution space

Fig. 4
figure 4

The satisficing solutions to a three-goal cDSP under two design scenarios illustrated in two-dimensional goal spaces

\({\alpha }_{ij}\left({\mathrm{DS}}_{\kappa },{\mathrm{DS}}_{\iota }\right)\) is the angle between \(\overline{{D }_{\kappa }^{ij}{D}_{\iota }^{ij}}\) the deviation coordinates of Goal i and Goal j for two design scenarios, \({\mathrm{DS}}_{\kappa }\) and \({\mathrm{DS}}_{\iota }\), average the angle-based correlations are shown in Eq. 4, where \({\mathrm{Cor}}_{A}^{2}\) is the square of the correlation of the design scenario matrix.

\({D}_{ij\_\mathrm{cor}}\) are elements of correlation matrix, which is a type of \({\mathbb{I}}_{\mathcal{D}}\) we use in this paper.

Therefore, we obtain a matrix using angle-based correlation with A design scenarios as in Eq. 5. As we update the design scenarios at each iteration, we update \({\mathbb{I}}_{\mathcal{D}}\). Using \({\mathbb{I}}_{\mathcal{D}}\) to cluster the goals, when the clustering results converge, the iteration is stopped

$${D}_{ij\_\mathrm{cor}}\left({\mathrm{DS}}_{1}, {\mathrm{DS}}_{2}\right)=\mathrm{arc}\left(\mathrm{sin}\left({\alpha }_{ij}\right)\right), \quad \forall i,j=0,\dots K,$$
(3)
$${D}_{ij\_\mathrm{cor}}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)=\frac{1}{{\mathrm{Cor}}_{A}^{2}}{\sum }_{\kappa =1}^{A-1}{\sum }_{\iota =\kappa +1}^{A}\mathrm{arc}\left(\mathrm{sin}\left({\alpha }_{ij}\left({\mathrm{DS}}_{\kappa },{\mathrm{DS}}_{\iota }\right)\right)\right), \quad \forall i,j=0,\dots K,$$
(4)
$${\mathbb{I}}_{\mathcal{D}}\left(\mathrm{correlation}, {\left[{\mathrm{DS}}_{1},\dots {\mathrm{DS}}_{A}\right]}^{T}\right)=\left({D}_{ij\_\mathrm{cor}}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)\right).$$
(5)

2.1.3 Learning about orthogonality

When learning about the correlation among the goals, we treat the deviation vector of the goals as statistical variables and analyze the correlations among these variables. When learning about the orthogonality among the goals, we obtain the relative position of goals in geometric space, under the constraints of the feasible region \(x\in \mathcal{F}\), using the dot-product of the deviation vector of any two goals, see Eq. 6. In Fig. 4a and b, we show the dot-product of the deviation vector of two goals, Goal i and Goal j, and for two design scenarios, DS1 and DS2. The dot-product is calculated as in Eq. 7 for the Euclidean distance, or 2-norm, that is when \(p=2\) in Eq. 7. If under multiple design scenarios, the dot-product is zero, the deviation vectors of the two are orthogonal, so we conclude that the two goals are orthogonal under these design scenarios—one way of representing this is that the projection of the two goals onto a two-dimension space is perpendicular; see Fig. 5a. If the dot-product is relatively large, the two goals are closer to being parallel than being orthogonal; see Fig. 5b. Then, \({D}_{ij\_\mathrm{ort}}\) are elements of the orthogonality matrix, which is another type of \({\mathbb{I}}_{\mathcal{D}}\) used in this paper; therefore, we obtain the matrix using orthogonality for A design scenarios, Eq. 8

Fig. 5
figure 5

The orthogonality between the deviation vectors of two goals for two design scenarios

$$\left\{<\mathrm{Goal} i, \mathrm{Goal} j>|x\in \mathcal{F}\right\}={\int }_{x\in \mathcal{F}}\overline{{G }_{i}\left(x\right)}{G}_{j}\left(x\right)dx\approx {D}_{i{j}_{\mathrm{Ort}}}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)=\left[{d}_{1i},\dots {d}_{Ai}\right]\cdot {\left[{d}_{1j},\dots {d}_{Aj}\right]}^{T},$$
(6)
$${D}_{i{j}_{\mathrm{Ort}}}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)={\left({\sum }_{\kappa =1}^{A}{{d}_{\kappa i}}^{p}\right)}^\frac{1}{p}{\left({\sum }_{\kappa =1}^{A}{{d}_{\kappa j}}^{p}\right)}^\frac{1}{p}\mathrm{cos}\left({\theta }_{ij}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)\right),$$
(7)
$${\mathbb{I}}_{\mathcal{D}}\left(\mathrm{orthogonality}, {\left[{DS}_{1},\dots {DS}_{A}\right]}^{T}\right)=\left({D}_{ij\_\mathrm{ort}}\left({\mathrm{DS}}_{1}, {\dots ,\mathrm{DS}}_{A}\right)\right).$$
(8)

2.1.4 Cluster analysis based on the interrelationship matrix of the goals

In this paper, we use multiple clustering algorithms and use cross validation to ensure clustering the results rationally. There are criteria for designers to choose appropriate clustering algorithms, for example, the sensitivity of the clustering results to the sample size (the number of design scenarios), and the improvement of goal achievement and diversity of the solutions by leveling the goals using a clustering result. Next, we demonstrate the feasibility of further improving the selection of clustering algorithms.

2.2 A schematic of the ALWC method

The ALWC includes two loops; see Fig. 6. Leveling-Weighting-Clustering is the outer loop, and Weighting-Clustering is the inner loop. In the outer loop, leveling starts with the clustering result \(\{C, \mathrm{Cluster}\}\) from the previous iteration.Footnote 2 In the first iteration, leveling starts with the initialized single cluster and all goals are set to a single level. In later iterations, when there is more than one cluster from the previous clustering, each cluster is set to a level. After running all processes in this leveling setting, including the weighting and clustering in the inner loop, the levels are alternated, that is, the goals in each cluster are alternately set to a different level. When the inner loop is entered, the goals in each level are combined using weight vectors. After being solved using the ALP (see definition in Glossary), the deviation matrix, \(\mathcal{D},\) is obtained, and the interrelationship matrix, \({\mathbb{I}}_{\mathcal{D}},\) is obtained and then can be used for cluster analysis. Then, a new \(\{C, \mathrm{Cluster}\}\) is returned to update leveling for the next iteration. The inner loop stops generating more weight vectors when the diversity of \({\mathbb{I}}_{\mathcal{D}}\) shows little increase, that is \(\sigma \left({\mathbb{I}}_{\mathcal{D}}\right)\le \varepsilon ,\) where \(\varepsilon\) is a threshold heuristically determined as more weight vectors are used. The outer loop stops leveling the goals based on the latest clustering result \(\{C, \mathrm{Cluster}\}\) if it has not changed in the previous \(\eta\) iterations, where \(\eta\) is a positive integer determined heuristically. Convergence is obtained when the outer loop stops,

Fig. 6
figure 6

The flowchart of the adaptive leveling-weighting-clustering (ALWC) loop

Algorithms for Steps 2 Leveling, 3 Weighting, and 5 Clustering are given in detail in the Appendix. Steps 1 and 4 are existing methods, and for details of modeling and solution, users may refer to (Mistree et al. 1981, Mistree et al. 1993). The algorithms for calculations are in the Appendix.

We use a test problem, a thermal system design, to determine the effectiveness of the ALWC method; see Sects. 3 and 4.

3 A test problem in thermal system design

3.1 A steam-based Rankine cycle

To demonstrate the use of the ALWC method in managing a multi-goal, design problem, with an example in the design of a thermal system. The problem was first published in (Smith et al. 2015). We select this problem, because it is a multi-component, engineering-design problem with six goals. Like some other problems with nonlinear, nonconvex equations and goals with different units, when formulating the problem as an optimization model and solving it using optimization methods, we fail to identify any feasible solutions. Therefore, we demonstrate how to manage it using the cDSP as the formulation construct, the ALP as the solution algorithm (Mistree 1993), and the ALWC as the multi-goal exploration method.

There can be various applications for small scale “power” plant systems that run small generators to produce electricity or to directly use the power produced. For example, these systems may provide power to irrigation equipment, drive reverse osmosis systems for fresh water for underdeveloped areas, and generate electricity for use in small collectives.

Building a system around a steam-based Rankine cycle is a common approach for designing power plants for these small scale applications if there is an available heat source. The Rankine cycle is a theoretical representation of a heat engine that converts heat into mechanical work while undergoing phase change (Macquorn Rankine 1853; Wikipedia 2019); see Fig. 7. The major system components of the system are a turbine that produces power, a pump, a heat exchanger, and a condenser.

Fig. 7
figure 7

The thermal system

In this process, water is first compressed by a feed-water PUMP, and then boiled and superheated in the HEAT EXCHANGER, before being expanded through a TURBINE, which turns an electric generator, Fig. 7. The low-pressure steam is then condensed in a CONDENSER and fed back to the feed-water PUMP to be reused, Fig. 7. Thus, there are four components in the Rankine cycle thermal; it is natural for designers to treat systems independently, assuming tradeoff relationships among them solving the model or for post-solution analysis. However, relying on such domain knowledge may or may not effectively lead to robust design.

From the perspective of decision support and design improvement, such a thermal system presents complexity and dilemmas to be managed and resolved. We consider heat source issues (heat through the heat exchanger), turbine power out, and the choice of working fluids.

The performance is determined by the cycle’s maximum and minimum pressures and maximum temperature (PMAX, PMIN and TMAX). Energy is transferred to the closed loop Rankine cycle through a heat exchanger. The heat exchanger is assumed to be counter flow with the key characteristic of the maximum temperature of the heating flow (TMAXE).

The ideal Rankine cycle involves four processes, as shown graphically in the Temperature (T) versus Entropy (S) plot in Fig. 8. The Rankine cycle consists of 4 cycles, Fig. 9:

Fig. 8
figure 8

Rankine cycle (temperature vs entropy)

Fig. 9
figure 9

The box plots of the deviations of the six goals using the Pre-emptive strategy, the Archimedean strategy, and the ALWC

①-② adiabatic pumping of the saturated liquid from PMIN to PMAX.

②-④ isobaric heat addition in the heat exchanger to TMAX,

④-⑤ adiabatic expansion in the turbine from PMAX to PMIN producing power with the possibility of wet steam exiting the turbine, and

⑤-① isobaric heat loss in the condenser.

The isothermal segments represent moving from saturated liquid to saturated vapor in the case of ③ in the heater and the reverse in the condenser between ⑤-①. The key thermodynamic properties of the working fluid(s) are determined using REFPROP (Lemmon and Huber 2013). The basic features of the problem are: four decision variables, one linear constraint, nine nonlinear inequality constraints, and six nonlinear goals.

3.2 Formulating the Rankine cycle in the compromise decision support problem

The formulation of the compromise Decision Support Problem (cDSP) is as follows.

figure a
figure b
figure c

3.3 Limitations in the formulation and results in the original paper

There are limitations of the method used in Smith et al. (2015). Using the Pre-emptive approach, the six goals are placed in six priority levels. However, by prioritizing the goals differently, comparisons may show competing goals driving the solution in different directions. Using the Archimedean approach, the six goals are grouped at the same level and linearly combined using weights. There can be a mixture of the Pre-emptive and Archimedean approaches, which may organize the components based on their interrelationships, correlation or orthogonality. As more mixed strategy scenarios are explored, the understanding of the subsystems and the interrelationships among them can evolve. If designers change the model formulation and post-solution analysis, the solution space can be explored sufficiently, but in the original paper (Smith et al. 2015), the authors do not consider this. Therefore, we address the exploration of subsystems and reorganize them to improve the system performance of energy efficiency as well as to obtain knowledge on the nature of the system.

With the ALWC method, knowledge about the goals can be explored, captured, and reused in other problems. Here, we assume there is no knowledge of interrelationships of the goals or subsystems, or that the intuitive divisions of the system (four components—the turbine, condenser, pump, and heat exchanger) may be wrong. We use the ALWC method to learn about the subsystems and organization.

4 Results and discussion

4.1 Clustering result

By running the ALWC loop and applying different interrelationship methods and clustering methods, we obtain clustering results iteratively and list them in Table 2. For each interrelationship method, although only one clustering result converged, while running the ALWC loop, different clustering results are found. There are three clustering results which can be used to update leveling. They are summarized in Table 3. We assign some tentative meanings of the clusters attempting to interpret the cluster results intuitively. The intuitive interpretations may or may not be correct, so we shall not completely rely on them to evaluate and select one cluster result above the others. Our aims in processing the ALWC loop include identifying the various cluster results, obtaining more solutions that better complete the goals or improve the diversity of the tradeoff scenarios among the completion of goals, and discovering knowledge for (interrelationship measurement) method selection. Therefore, we use all cluster results and solutions in all iterations to enlarge the solution pool for decision support.

Table 2 The clustering results from the iterations
Table 3 The summary of the cluster results returned to update the leveling

4.2 Improvement in goal achievement during the design scenario expansion

For a multi-goal problem, because the tradeoff between two goals often affects other goals, there are many nondominated and weakly dominated solutions; therefore, it is ineffective to use "solution domination" to rank the solutions. We use statistics to determine whether the results have been improved and enriched by iterating. In Table 4, we show the mean, standard deviation, best (minimum), and worst (maximum) deviations of each of the six goals, for each clustering scenario, and highlight the best case among all clustering scenarios. To ensure the results from different clustering scenarios are comparable, we use the actual deviation values instead of normalized values. For the 1-level scenario, we use the results of the 71 weight vectors to obtain the statistics, because these 71 weight vectors include the 6 and 21 weight vectors in previous iterations of the inner loop. For all the other three clustering scenarios, as each of them has 22 design scenarios, we use them to calculate statistics. For example, for Goal 1, among the means of each of the four clustering scenarios, the best (smallest) value 0.04 occurs in the third clustering scenario. The numbers in Table 4 have been rounded, but the longer computed numbers have been used for the comparison and the best ones are highlighted.

Table 4 Statistics of the results

By leveling the goals using three clustering scenarios, the mean, standard deviation, and the worst-case results of the deviations of five goals excluding Goal 1 are improved. The worst case of the sum of all goals is improved. These observations indicate that by clustering and leveling the goals using the ALWC method, we identify better design scenarios for reducing the deviation (or improving the achievement) of most goals. In Fig. 9, we give box plots of the six goals under the Pre-emptive strategies (Fig. 9a), Archimedean or weighted sum strategy (Fig. 9b), and the ALWC (Fig. 9c). The vertical axes are the values of deviation variables. The six boxes in each graph are the box plots of the six deviations of all the scenarios used in each strategy. Shorter boxes close to the horizontal axes are preferred. We conclude that using ALWC, the deviations of Goals 1, 4, and 6 are substantially decreased.

Other statistics can be used to evaluate the iterative results from different perspectives. Designers may select or develop customized statistics based on the characteristics of specific problems.

4.3 Reducing the Euclidean distance to the Utopia point

For a multi-goal design problem, goals may represent the performance of various subsystems of the design. It is possible that the improvement of one goal results in a greater loss in another goal or several other goals; see Fig. 10, as Goal 3 is improved 20% at \({D}_{2}^{123}\) versus \({D}_{1}^{123}\), Goal 1 and 2 are worse by 80%. This may be desired in some situations, but more often than not, designers would rather avoid this, because overall performance may not be enhanced practically by improving a single subsystem. Therefore, we use the Euclidean distance to the Utopia point of the deviations, to evaluate the comprehensive performance of the achievement of all goals. Equation is the Euclidean distance to the Utopia point of the result from Design Scenario \(a\) for a K-goal problem. We statistically evaluate the evolution of the Euclidean distance and summarize it in Table 5. Using the clustering scenarios obtained during iteration, the mean, standard deviation, and the worst case of the Euclidean distance to the Utopia point are improved, but the best case is not improved. All the design scenarios and corresponding results from all iterations are added to the solution pool for designers to select. Designers may customize their post-solution analyses for further decision support on design scenario generation and selection

Fig. 10
figure 10

An example of improving Goal 3 by 20% while worsening Goal 1 and Goal 2 by 80%, respectively

Table 5 Statistics of the Euclidean distance to the Utopia point of the results under each clustering scenario. The Euclidian distance is the length of the line segment between the two points.
$${\left|O{D}_{a}\right|=\left(\sum_{k=1}^{K}{{d}_{ak}}^{2}\right)}^\frac{1}{2}.$$
(9)

4.4 Reducing computational complexity

Smith and coauthors explore the leveling of all six goals and simplify the scenarios using their expertise. The theoretical number leveling scenario is \(6!=720\), but they select 15 scenarios that they think are most representative (Smith et al. 2015). If designers use weight vectors to explore the combination of the goals, the number of weight vectors is based on how many pieces they divide the goal weight in the range of [0, 1]; see Eq. 11

$$H=\left(\begin{array}{c}6+p-1\\ p\end{array}\right),\mathrm{ for }p=\mathrm{2,3},\mathrm{4,5},6\dots , H=21, 56, 126, 252, 462\dots .$$
(10)

For a K-goal problem, we explore combinations of the goals (design scenarios) using a mixture of Pre-emptive and Archimedean schemes and enumerate all possible design scenarios. These scenarios are necessary due to the lack of domain knowledge about the interrelationships among the goals or due to missing specific design preferences about tradeoffs among the goals. We need to explore \(\Lambda\) design scenarios where \(\Lambda\) in the range defined in Eq. 12, \(\kappa\) is the number of levels of the K goals, \(\frac{K}{\kappa }\) is the average number of goals at each level on average (if it is not be an integer so it is rounded down), and \(p\) is the number of pieces that each goal’s weight is divided into when the weights are assigned. Hence, the computational complexity of enumerating the mixture of Pre-emptive and Archimedean design scenarios is shown in Eq. 13

$${\sum }_{\kappa =1}^{K}\kappa !\cdot \left(\begin{array}{c}\lfloor\frac{K}{\kappa }\rfloor+p-1\\ p\end{array}\right)\le\Lambda \le {\sum }_{\kappa =1}^{K}\kappa !\cdot \left(\begin{array}{c}K+p-1\\ p\end{array}\right),$$
(11)
$$O\left(K!\cdot {K}^{p-1}\right).$$
(12)

Using the ALWC, the goals are leveled based on clustering results and a specific number of levels are obtained. Using these levels, we do not need to enumerate all scenarios. For each clustering scenario, the number of leveling scenarios is reduced to \(3!=6\). Within each level, the goals are combined using weight vectors. The stopping criteria of weight vectors generation (Line 5 of Algorithm 2) help to prevent designers from using many unnecessary weight vectors. Using the angle-based correlation method to calculate an interrelationship matrix, we explore 142 design scenarios in three iterations and converge. Using the orthogonality method, we then explore 71 design scenarios in three iterations and converge.

For a K-goal problem, if we explore design scenarios using the ALWC, a number of clusters, \(\kappa\), can be identified, based on the goals’ interrelationships, and on average, there are \(\frac{K}{\kappa }\) goals in each cluster and we need \(\Lambda {^{\prime}}\) design scenarios; see the range of \(\Lambda {^{\prime}}\) in Eq. 14. The computational complexity of using ALWC is shown in Eq. 15. When \(>\kappa\) and \(\kappa >1\), \(\Lambda {^{\prime}}\) is smaller than \(\Lambda\) (Eq. 15), we do not need to exhaustively study the cases when \(\kappa\) is equal to all integers between 1 and K

$$\kappa !\cdot \left(\begin{array}{c}\lfloor\frac{K}{\kappa }\rfloor+p-1\\ p\end{array}\right)\le {\Lambda }^{\mathrm{^{\prime}}}\le \kappa !\cdot \left(\begin{array}{c}\lceil\frac{K}{\kappa }\rceil+p-1\\ p\end{array}\right),$$
(13)
$$O\left(\kappa !\cdot {\left(\frac{K}{\kappa }\right)}^{p-1}\right),$$
(14)
$${\Lambda }^{\mathrm{^{\prime}}}<\Lambda , \mathrm{if} K>\kappa \mathrm{and }\kappa >1.$$
(15)

4.5 Verification of the results

4.5.1 The results are verified using domain knowledge

In all three clustering scenarios, Goals 2 and 4 are always in one cluster, and Goals 3 and 5 are always in one cluster, whereas the clustering result of Goal 1 and Goal 6 change in various iterations. This implies that Goals 2 and 4 are strongly correlated or weakly orthogonal, as are Goals 3 and 5; the relationships between Goals 1 and 6 and the other goals are not significant. The clusters represent the subsystems, which verify the clustering result; see Table 6. Goals 2 and 4 are related to the efficiency of the Rankine cycle and the system efficiency indicator 1 is to increase the Rankine cycle efficiency. Goals 3 and 5 represent heat exchange efficiency. The temperature exchanger and heat transfer work are synergetic in the working system. Goal 1 is about the moisture in the turbine, and it sometimes forms a single cluster and sometimes is clustered with Goals 2 and 4. Our domain knowledge confirms that when the moisture in the turbine is low, the Rankine cycle is very efficient, but the turbine is a relatively isolated subsystem. Goal 6 is sometimes a single cluster and sometimes clustered with Goals 3 and 5. These goals all deal with the efficiency of the heat exchanger; Goals 3 and 5 are more about the temperature, whereas Goal 6 deals with the liquid flow. However, in one clustering scenario, Goals 1 and 6 are in one cluster. The scatter plots of any two goals help us to interpret this phenomenon, Fig. 11. The deviations of Goal 1 and Goal 6 are relatively small and close to the Utopia point O (Fig. 11e) in comparison with the other two-goal plots. Even after normalizing the deviations in the range [0, 1], Goals 1 and 6 are weakly correlated. However, when such a clustering scenario was used in the next iteration, the clustering results “returned to normal.” This indicates that iteratively clustering and updating is necessary, because it can expand the sample size and remove bias.

Table 6 Meaning of the three clusters
Fig. 11
figure 11

Scatter plots of any two goals using deviations of 1-level, 21 weight vectors—the vertical and horizontal axes of each sub-figure represent the deviation of two goals, respectively. For example, in Fig. 12a, the vertical axis is d2 and the horizontal axis is d1

Using domain knowledge, the clustering result has been verified for this thermal system design problem. For multi-goal problems in which domain knowledge is missing, the ALWC method can assist designers in identifying the interrelationships among the goals. With the ALWC, more design scenarios regarding the combination forms of goals are explored based on their interrelationships. These design scenarios and deviations of the goals are added to the solution pool for designers to select among to satisfy different requirements, improve the problem formulation, and enable further analysis to obtain a better understanding of the interrelationship among subsystems. The ALWC method can be used as a tool to partition a design problem, especially when information is incomplete.

5 Closing remarks

In this paper, we develop a domain-independent method to capture and reuse the knowledge of a multi-goal engineering-design problem to facilitate the exploration and selection of design scenarios. To accomplish this, the Adaptive Leveling-Weighting-Clustering (ALWC) for exploring multiple design scenarios is proposed. We use a thermal system test problem to illustrate the effectiveness of the ALWC method.

Using the ALWC method, with increasing weight vectors, the interrelationships among goals based on their deviations, or achievement rates evolve and converge. Based on their interrelationships, goals are grouped into clusters to represent different subsystems. The combinations of the goals are explored iteratively, using either the Pre-emptive or Archimedean strategy. This facilitates the assignment of each cluster a different level (leveling) and a combination of the goals in each level using weight vectors (weighting). Through iteration, more design scenarios are identified, and corresponding solutions are obtained for designers to choose the appropriate design scenario and then improve the design. As a tool to acquire insight when domain knowledge is lacking, the combination of the goals is explored, so that better solutions regarding the average deviations, standard deviations, worst case, and the Euclidean distance to the Utopia point are identified and computational complexity is reduced.

The algorithms embodied in the ALWC method can be extended, modified, or customized for specific requirements. When there is insufficient expertise to support decision-making, subsystem division and tradeoffs, and design improvement, the knowledge discovered using the ALWC method can be used to explore the ways that may contribute to design improvement. Our assumption in using the ALWC algorithm is that the domain expertise on the interrelationships among the goals is missing and the solutions can accurately reflect their interrelationships. If domain expertise is sufficient to enable decision-makers to determine subjective preferences, the ALWC may not deliver a better solution regarding computational complexity or satisfying multiple requirements.