A novel graph-theoretical clustering approach to find a reduced set with extreme solutions of Pareto optimal solutions for multi-objective optimization problems

Kahagalage, Sanath; Turan, Hasan Hüseyin; Jalalvand, Fatemeh; El Sawah, Sondoss

doi:10.1007/s10898-023-01275-y

A novel graph-theoretical clustering approach to find a reduced set with extreme solutions of Pareto optimal solutions for multi-objective optimization problems

Open access
Published: 07 March 2023

Volume 86, pages 467–494, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Global Optimization Aims and scope Submit manuscript

A novel graph-theoretical clustering approach to find a reduced set with extreme solutions of Pareto optimal solutions for multi-objective optimization problems

Download PDF

Sanath Kahagalage ORCID: orcid.org/0000-0003-3873-8660¹,
Hasan Hüseyin Turan¹,
Fatemeh Jalalvand² &
…
Sondoss El Sawah¹

1849 Accesses
5 Citations
Explore all metrics

Abstract

Multi-objective optimization problems and their solution algorithms are of great importance as single-objective optimization problems are not usually a true representation of many real-world problems. In general, multi-objective optimization problems result in a large set of Pareto optimal solutions. Each solution in this set is optimal with some trade-offs. Therefore, it is difficult for the decision-maker to select a solution, especially in the absence of subjective or judgmental information. Moreover, an analysis of all the solutions is computationally expensive and, hence, not practical. Thus, researchers have proposed several techniques such as clustering and ranking of Pareto optimal solutions to reduce the number of solutions. The ranking methods are often used to obtain a single solution, which is not a good representation of the entire Pareto set. This paper deviates from the common approach and proposes a novel graph-theoretical clustering method. The quality of the clustering based on the Silhouette score is used to determine the number of clusters. The connectivity in the objective space is used to find representative solutions for clusters. One step forward, we identify ‘extreme solutions’. Hence, the reduced set contains both extreme solutions and representative solutions. We demonstrate the performance of the proposed method by using different 3D and 8D benchmark Pareto fronts as well as Pareto fronts from a case study in Royal Australian Navy. Results revealed that the reduced set obtained from the proposed method outperforms that from the K-means clustering, which is the most popular traditional clustering approach in Pareto pruning.

A Novel Graph-Theoretical Approach of Selecting Representative Pareto Optimal Solutions for Multi-objective Optimization Problems

Finding Robust Pareto-optimal Solutions Using Geometric Angle-Based Pruning Algorithm

NSLS with the Clustering-Based Entropy Selection for Many-Objective Optimization Problems

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Most real-world problems usually have multiple and conflicting objectives. Therefore, multi-objective optimization methods are of great importance. Multi-objective optimization methods provide a single solution, a few solutions, or Pareto optimal solutions (Pareto set) depending on the solution approach. However, Pareto optimal solutions give a complete picture to the decision-maker. Therefore, this research focuses on the methods, which produce Pareto optimal solutions.

A multi-objective decision-making process can be divided into three sub-steps, starting from the selection of a solution approach to the final decision-making. These steps include generation of the Pareto optimal solutions based on the utilized optimization algorithm, visualization of both Pareto solutions and corresponding decision variables and, lastly, analysis and decision-making as shown in Fig. 1. In the remainder of this section, we will mainly focus on the last step (i.e., analysis and decision-making) and discuss methods utilized in this step together with the advantages and disadvantages of each.

As all of the solutions in the Pareto set are optimal with some trade-offs and considered equally good [39], it is often challenging for the decision-makers to select the solution(s), especially in the absence of subjective or judgmental information [10]. Not to mention the time complexity of analyzing all the Pareto optimal solutions. Even in the presence of subjective or judgmental information, the decision depends on the experience of the decision-maker. Thus, methods to find a reduced (small) set that captures the diversity of the Pareto optimal solutions and hence, a reasonable representation for the entire Pareto set, are important [10, 29, 48]. Researchers have proposed several methods to group (cluster) Pareto set based on the similarity of the values of objective functions [54]. Then, one representative solution for each cluster is chosen to form the reduced set. However, the reduced set obtained from the traditional clustering methods may not cover the entire Pareto optimal solutions. This is due to the representative solutions may not find extreme solutions (refereeing to solutions with the global minimum or maximum of objective function values). Thus, a method to find a reduced set that truly represents the entire Pareto optimal solutions (including extreme solutions), is needed.

There are several approaches to find a reduced set for the Pareto set such as a-priori, interactive, and a-posteriori methods [31, 32]. A-priori methods often lead to a single solution and depend on the decision-makers preference as the decision-makers impose their preference before the optimization. In the interactive methods, a reduced set is computed at each iteration to ultimately obtain a small set of Pareto optimal solutions [48, 54]. In the post Pareto analysis, first, the entire Pareto set for the multi-objective problem is obtained and then the reduced set is computed [4, 10, 41, 49]. Among these methods, a-posteriori methods give an overall picture to the decision-makers [30]. Therefore, in this paper, we also focus on the post Pareto analysis as depicted in Fig. 1. Accurate decisions are needed in every decision context and the nature of the decision determines the level of accuracy required. To make a better decision, the decision-maker needs the full picture. Thus, it is important to obtain a reduced set with small cardinality and is a better representation for the entire Pareto set (i.e., a set with a minimum number of Pareto optimal solutions such that it captures the diversity of the entire Pareto set).

Among the approaches used to find a reduced set of Pareto optimal solutions for the multi-objective optimization problems, the most common approaches that researchers used are ranking based hierarchical approaches or clustering-based approaches such as K-means clustering [5, 10, 41]. K-means is the most popular traditional clustering approach in Pareto pruning [34]. In the ranking based hierarchical approach, a preference is given to one objective over another objective in the pairwise comparison, and a score (weight) is assigned [5, 37] (see also Step 3 in Fig. 1). Then, all the Pareto optimal solutions are ranked based on this score. However, the problem with this approach is that the preference depends on the application and the experience of the decision-maker. As all the Pareto fronts are considered equally good with some trade-off, the final solution obtained from the ranking may not represent the entire Pareto set. What is needed for multi-objective problems is a reduced set which truly represents the entire Pareto set [54]. To achieve this, several clustering algorithms have been employed. Moreover, some researchers have proposed Hypervolume indicator (HYP), which is a widely-used performance indicator for multi-objective evolutionary algorithms (MOEAs), for Pareto pruning [23, 34]. HYP is also used to measure the diversity and efficiency in the solutions in the reduced set [34, 38].

In clustering methods, a representative solution is chosen based on the solution at the cluster center or one closest to the hypothetical best solution (the ideal solution) in the respective cluster [10, 54] (see also Step 3 in Fig. 1). On one hand, the selection of an optimal number of clusters is ambiguous. On the other hand, the reduced set may not include extreme solutions. However, it is also important to analyze extreme solutions [35, 40]. The Silhouette score is proposed to overcome the ambiguity of determining the optimal number of clusters [9]. In this paper, we show how K-means clustering can be combined with the graph theory to capture a better-reduced set, even though we use K-means as a benchmark method for purely comparison purposes. That is, we also use results from K-means to show the importance of adding extreme solutions to the reduced set. This will enhance a reduced set when the reduced set is obtained via clustering. The Silhouette score, which indicates the quality of the clustering, is used to determine the optimal number of clusters within the given upper bound. Then, we use graph-theoretical properties based on the connectivity of the network (graph) that is created for the objective space to extract the representative solution for each cluster. Besides, this also allows us to find extreme solutions. Therefore, the proposed method gives a better-reduced set of Pareto optimal solutions to decision-makers to make their decision with ease. Moreover, the proposed method is independent of subjective decisions and can be used by a non-experienced decision-maker.

In the HYP-based pruning method, multiple solutions or a single solution is used to calculate the Hypervolume [16, 23, 34, 38]. The idea behind this method is to maximize the HYP, i.e., a set of solutions or a solution, which maximizes the Hypervolume, constitutes the pruned set (reduced set). Even though boundary solutions are included in the reduced set, the HYP-based pruning method does not automatically assign non-selected solutions to each of the representative solutions and requires an additional step if the decision-maker wanted to further investigate neighboring solutions of his/her selected solution [34]. This is not the case for clustering methods and the inherent grouping of non-selected solutions is a major advantage of the clustering method over the HYP-based method. As boundary solutions are included in the pruned set from the HYP-based pruning, it is attractive for the decision-maker who is interested in extreme solutions before finalizing the decision [34]. However, the HYP-based method can be very computationally expensive for multi-objectives problems and can be a burden as it requires determining the Hypervolume for every single solution or combination of solutions [55]. Therefore, to this end, this paper proposes a method that can effectively find a reduced set of Pareto sets, which includes extreme solutions as well.

Recently, both K-means clustering and the proposed graph-theoretical clustering, which is based on Gomory–Hu trees (GHT), have applied to predict the failure location on man-made and natural slopes as well as small scale data from DEM simulation [20, 42, 43, 52]. The results revealed that the proposed-graph theoretical clustering outperforms K-means clustering in the sense of grouping points with similar properties. On the other hand, K-means clustering alone may not find the extreme solutions. The GHT algorithm is commonly used in graph partitioning and graph clustering [14, 18], image segmentation [25], webpage segmentation [26], social network analysis [6], biological data analysis [33, 45]. Motivates from these, we propose this graph-theoretical clustering method to find a better-reduced set of Pareto optimal solutions (see also Step 3 in Fig. 1). One step forward to the traditional clustering methods, the proposed clustering method can identify extreme solutions.

First, we apply the proposed method to the benchmark Pareto optimal fronts for different optimization problems reported by [11, 12] to test the performance in general. These Pareto fronts are available at^{Footnote 1}. Next, we apply the proposed methods to the Pareto optimal solutions obtained from a realistic case study from the Royal Australian Navy to gain more decision-making insights into the case study in defence [47]. To the best of the authors’ knowledge, there is no existing post-Pareto analysis that gives the promising reduced set to decision-makers in the context of the military application as we will discuss in Sect. 2. Our main contribution is the proposed graph-theoretical clustering method which can yield to a better-reduced set. Moreover, the proposed method of finding representative solutions can be unitized to find extreme solutions. We combine this method of finding extreme solutions with a K-means clustering to find an improved reduced set. This is to show the importance of adding extreme solutions with the results from clustering approaches. Results from all the test cases revealed that the reduced set obtained from the proposed method better represents the entire set.

The contributions of this paper are mainly threefold.

1.
The paper proposes a novel graph-theoretical clustering method that uses Gomory–Hu trees, to find a reduced set of Pareto optimal solutions. This set includes both representative solutions found for each cluster and extreme solutions. Hence, it captures the diversity of the entire Pareto set. Thus, this set is a better representation of the entire Pareto set. Further, the proposed methods can be applied to any multi-objective problem.
2.
In this paper, we propose a novel method to find representative solutions from each cluster based on the connectivity of the graph. This method can also identify extreme solutions. Therefore, this method can be combined with traditional K-means clustering to obtain a better-reduced set.
3.
According to the best of the authors’ knowledge, this is the first attempt to use Gomory–Hu trees for the post-Pareto analysis to obtain a better-reduced set in the field of military. Note also that there is no existing literature for post-Pareto analysis in the military facility location problem, which integrates workforce planning and capacity allocation.

The rest of the paper is organized as follows. In Sect. 2, we present the current state-of-the-art knowledge of reducing Pareto optimal solutions to a small set which is then used to support decision-makers. In Sect. 3, we discuss the proposed method of finding a reduced set of the Pareto optimal solutions and other relevant background information. In Sect. 4, we show the performance of the proposed method with benchmark Pareto fronts. We present and discuss the case study, multi-objective model, solution approach and results in Sect. 5 to give some decision-making insights before the concluding remarks are given in Sect. 6.

2 Literature review

Researchers are continuously proposing new approaches to improve the quality (referring to being able to represent the entire Pareto set) of the reduced set. For example, SPEA2 is proposed by Zitzler et al. [53] to improve the reduced set obtained by SPEA. The reduced set obtained by the clustering method in SPEA may not keep the outer solutions even though it preserves the characteristics of nondominated set [53]. To overcome this, SPEA2 proposed in [53] uses new archive trancation method. In archiving, we commonly examine the problem of maintaining an approximation of the set of nondominated points visited during a multiobjective optimization [27]. However, this paper proposes a novel method to find a reduced set of Pareto optimal solutions (see flowchart in Fig. 3) based on Gomory–Hu trees, which always preserve the extreme solutions. Moreover, the proposed method finds the optimal number of clusters within the given upper bound as we will explain in Sect. 3. Finding a reduced set with small cardinality to represent the entire Pareto set for the multi-objective optimization problem is not new. In Table 1, we present a selective but, comprehensive literature covering the different methods of obtaining a reduced set and application domain. This section serves for the purpose of highlighting research gaps and the novelty of the proposed method.

Table 1 Academic literature on obtaining a reduced set of Pareto optimal solutions

A novel graph-theoretical clustering approach to find a reduced set with extreme solutions of Pareto optimal solutions for multi-objective optimization problems

Abstract

Similar content being viewed by others

A Novel Graph-Theoretical Approach of Selecting Representative Pareto Optimal Solutions for Multi-objective Optimization Problems

Finding Robust Pareto-optimal Solutions Using Geometric Angle-Based Pruning Algorithm

NSLS with the Clustering-Based Entropy Selection for Many-Objective Optimization Problems

1 Introduction

2 Literature review

3 Method

3.1 Construction of a weighted contact network (graph) \(G^*\) for Pareto optimal solutions

3.2 Gomory–Hu tree based clustering method

Definition 1

3.3 Proposed way of selecting representative solutions

4 Performance of the proposed method on benchmark data

4.1 Results from the proposed method

4.2 Results from the K-means clustering

4.3 Performance comparison

5 Application with a case study

5.1 The case study

5.1.1 Multi-objective model

5.1.2 Solution approach

5.1.3 Pareto optimal solutions

5.2 Results

5.2.1 Results from the proposed method

5.2.2 Results from K-means clustering

5.2.3 Performance comparison

6 Conclusions, limitations and future research

Data Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation