Introduction

In the analysis of electrical distribution systems with reconfigurable structure and radial operation, for large networks the number of possible radial network configurations (calculated from the Kirchhoff’s matrix tree theorem) may be so high to make exhaustive search of all the possible radial configurations practically infeasible. As such, many deterministic and probability-based methods have been exploited to find out the configurations leading to pseudo-optimal solutions for a given objective function or for multi-objective functions [1,2,3]. In particular, multi-objective approaches have been applied to address distribution system reconfiguration by taking into account the trade-off among conflicting objectives. Starting from early contributions [4, 5], different combinations of objectives have been considered by various authors. The systems losses have been typically used in combination with other objectives such as load balancing, reliability indices, number of switching operations, branch current loading, emissions, and voltage deviations. Multi-objective reconfiguration is an effective analysis and decision-making tool for choosing the best configuration that can be operated by considering a given set of load and generation patterns.

The multiple objectives have been handled with different approaches, including:Footnote 1

  • the weighted sum of the individual objectives, using per-unitized individual objectives in order to deal with non-commensurable quantities [4]; limitations on the use of the weighted sum approach are indicated in [6];

  • the use of membership functions in the fuzzy domain, maximizing the fuzzy satisfaction of the optimization of the individual objectives [7];

  • Pareto front analysis, based on the notion of non-dominated solutions for which no other solution exists with better values for all the individual objectives [8];

  • Grey correlation analysis, with the identification of problem-independent superior and inferior solutions for each objective [9].

Among these approaches, the Pareto front analysis is particularly effective because it handles non-commensurable objectives without requiring additional hypotheses or weights. The Pareto front is formed by the non-dominated solutions related to the conflicting objectives considered in multi-objective optimization. The Pareto front approach is used in many applications of stochastic optimization in different domains [10]. With reference to distribution network reconfiguration, specific techniques operating in the Pareto-based framework include the \(\upvarepsilon \)-constrained method [5] and a number of heuristics that construct and update the best-known Pareto front in an iterative way. A number of these techniques are based on genetic algorithms (GA) and evolutionary computation, such as the widely adopted Non-dominated Sorting Genetic Algorithm II (NSGA II) [11] applied in some contributions [12,13,14], the non-dominated sorting evolutionary programming [15], and the micro-genetic algorithm [16]. Further approaches use an artificial immune systems hybridized with ant colony optimization [8] or with graph theory-based techniques [17], the Multi-Objective Binary Particle Swarm Optimisation (MOBPSO) [18], and a branch-and-bound technique driven by non-dominance concepts [19].

Comparisons among different techniques to address multi-objective network reconfiguration have been presented in [18]. In particular, a GA procedure based on NSGA-II concepts and a version of MOBPSO have been tested, the GA procedure showing better performance. From further research carried out by the authors, the MOBPSO has been largely improved by further customizing it with respect to the specific problem addressed, providing results (shown in this paper) significantly better than the ones of the methods previously used.

Furthermore, in order to compare the results obtained from different methods using Pareto front analysis, a proper metric based on the points of the best-known Pareto front has to be exploited. Some metrics have been defined in terms of the location of the Pareto front points, such as the quality factor used in [20], depending on the number and location of the points forming the Pareto front, and the Pareto front mismatch [12], a geometrical indicator depending on the location of the Pareto front points, based on the concepts of hyper-volume calculation [21]. In order to compute the hyper-volume, a reference point is needed. In [21] the coordinates of the reference point are the worst values of the objective obtained on each dimension for the solutions found so far (including the dominated points). In this paper, the coordinates of the reference point are the worst values obtained for each dimension from the points of the best-known Pareto front found so far (excluding the dominated points). The rationale of this choice is that in the problem under analysis the worst solutions from dominated points could be very far from the region containing the points forming the Pareto front. This would generate large hyper-volumes and lead to obtain similar values for the hyper-volume when the Pareto front changes. In the solution used, the region of analysis is limited to the one defined by the Pareto front points, emphasizing the differences among the solutions with different Pareto fronts.

On these bases, the specific contributions of this paper are:

  1. 1.

    to extend and generalize the definition of the Pareto front mismatch metric in order to apply it to real networks for which the complete Pareto front cannot be formed and only the best-known Pareto front is available;

  2. 2.

    to illustrate the rationale and the details of the improvements introduced in the customized MOBPSO version, in which the specific knowledge on the distribution network topology is used to modify the formulation of the solution procedure;

  3. 3.

    to show a set of significant and consistent results obtained on different distribution systems, including a test system and a real system.

The next sections of this paper are organized as follows. Section “Objective Functions and Constraints” summarizes the objective functions and constraints used in this paper. Section “Generalized Metrics to Compare the Results of Pareto Front-Based Optimization Methods” contains the formulation of the generalized metric used to assess the effectiveness of the Pareto front-based methods. Section “Binary PSO: Concepts and Notation” introduces the notation used for the binary PSO and illustrates the characteristics of the novel MOBPSO version used for multi-objective distribution network reconfiguration. Section “The Multi-Objective Binary PSO (MOBPSO)” contains the discussion of the results obtained in the case study applications. The last section contains the concluding remarks.

Objective Functions and Constraints

Let us consider a distribution network with N nodes, Y physical branches (including the ones that are kept open during the system operation) and S supply points. The number of branches that have to remain open for obtaining a radial network is \(\varLambda =Y-N+S\) [22]. The identification of the best set of \(\varLambda \) branches to open (i.e., the identification of the most suitable configuration to be operated) depends on the objective function defined by the user.

In this paper, the two typical conflicting objective functions energy losses and energy not supplied (ENS) are considered for multi-objective optimization. The conflicting nature of these two objectives strongly depends on the values of failure rates of the branches. For a given time interval with h = 1,..., H time steps of duration \(\tau \) each, the energy losses in the network are computed from the energy losses \(\tau R_{yh} I_{yh}^2\) at branch \(y = 1,{\ldots },Y\) as:

$$\begin{aligned} W_{losses}=\tau \sum \nolimits _{h=1}^{H} \sum \nolimits _{y=1}^{Y}R_{yh}I_{yh}^{2} \end{aligned}$$
(1)

The power losses \(R_{yh} I_{yh}^2\) are calculated by solving at each time step h a power flow based on the backward-forward sweep method [23], particularly suitable for radial distribution networks.

The ENS is a reliability indicator calculated on the basis of rated power of the network loads and duration of supply interruptions, by considering a multi-stage service restoration process after the identification of the fault [24]. By considering the set \({\varvec{\Theta }}_n\) containing all the faults \(\vartheta \in {\varvec{\Theta }}_n\) which affect a given node n, the ENS formulation for node n is:

$$\begin{aligned} ENS_n =C_n \mathop \sum \nolimits _{\vartheta \in {\varvec{\Theta }}_n} \lambda _\vartheta \mathop \sum \nolimits _{\xi =1}^{{\Xi }_\vartheta } \psi _{\vartheta n}^{(\xi )} \tau _\vartheta ^{(\xi )} \end{aligned}$$
(2)

where \({\Xi }_\vartheta \) indicates the number of restoration stages for fault \(\vartheta \), \(\tau _\vartheta ^{(\xi )}\) is the duration of the \(\xi \)th stage of the restoration procedure, \(\lambda _\vartheta \) is the failure rate, \(C_n\) is the reference power of node n, and \(\psi _{\vartheta n}^{( \xi )}\) is a Boolean variable indicating whether node n is supplied during the \(\xi ^{\mathrm{th}}\) stage of fault \(\vartheta \), namely, \(\psi _{\vartheta n}^{( \xi )} =0\) if node n is supplied, otherwise \(\psi _{\vartheta n}^{( \xi )} =1\). The overall reliability indicator for the entire network is calculated by summing up the contributions coming from all the nodes:

$$\begin{aligned} ENS=\mathop \sum \nolimits _{n=1}^N ENS_n \end{aligned}$$
(3)

The multi-objective optimization formulation is

$$\begin{aligned} \min \left\{ {W_{losses} ,ENS} \right\} \end{aligned}$$
(4)

subject to the constraints referring to the thermal limits of the network branches and the lower and upper limits of the node voltage magnitude (\(V_{min }= 0.9\hbox { pu}\) and \(V_{max}= 1.1\hbox { pu}\)). Moreover, the topology constraint is handled in the solution process in order to guarantee that any network configuration formed during the solution procedure is radial [12].

Generalized Metrics to Compare the Results of Pareto Front-Based Optimization Methods

Basic Concepts

The comparison among Pareto fronts requires to set up a reference Pareto front, such that the points located on this front cannot be dominated by any point of any other Pareto front used in the analysis. The definition of the reference Pareto front is straightforward when it is possible to determine the complete Pareto front. However, in distribution network analysis, this happens only for small-size networks and microgrids, for which it is possible to generate all the radial configurations and calculate the objective functions from exhaustive search in reasonable computational time. In these cases, the metric defined in [12] can be used.

For the general case in which the network size makes exhaustive search impracticable, the complete Pareto front is not available. Hence, the extension and generalization of the Pareto front mismatch metric provided in this paper is based on using as reference Pareto front the generalized best-known Pareto front obtained in an alternative way.

Fig. 1
figure 1

Areas used in the definition of the generalized best-known Pareto front mismatch metric. a reference area \(S_{ ref}\). b solution area \(S_s\). c difference area \(S_d\)

In particular, probability-based optimization methods are run several times with different seeds for random number extraction, from which a certain number of best-known Pareto fronts are found for all the compared methods. All the points of the best-known Pareto fronts obtained from the compared methods are then taken simultaneously. From these points, the generalized best-known Pareto front is constructed by excluding all the dominated solutions appearing in the best-known Pareto fronts of the individual methods once considering all the points together.

Once the reference Pareto front has been defined, it is possible to calculate some metrics for comparing the Pareto front solutions. Some examples are provided below.

Quality Factor

The quality factor \(Q_F\) metric [20] is based on the percentage of solution points found from the optimization method under analysis forming the best-known Pareto front. Let us define with \(n_F\) the total number of points of the reference Pareto front and with \(n_0\) the number of points of the Pareto front found by the solution method under analysis that are located on the reference Pareto front. Then, the quality factor is expressed in per cent as:

$$\begin{aligned} Q_F =100\frac{n_0 }{n_F } \end{aligned}$$
(5)

Generalized Pareto Front Mismatch

The Pareto front mismatch has been defined in [12] to introduce a more detailed metric based on geometrical considerations, taking into account not only the number of points located on the Pareto front (as in the quality factor) but also their location.

In [12] the reference Pareto front is the complete Pareto front obtained by performing exhaustive search of all the feasible network configurations. In this paper, the generalized best-known Pareto front becomes the reference Pareto front. In practice, the generalized best-known Pareto front is formed by taking the non-dominated solutions extracted from merging all the best-known Pareto fronts (the final ones obtained by repeating the optimization procedure for a given number of times with different initial seeds for random number extraction).

The pseudo-optimal solutions of the individual objective functions appearing on the reference Pareto front are used to define the upper limits for the other dimensions, hence the reference area \(S_{ref}\) is calculated as indicated in Fig. 1a. Then, taking the Pareto front under analysis, the corresponding solution area \(S_s\) (Fig. 1b) and the difference area \(S_d =S_{ref} -S_s\) (Fig. 1c) are determined. The Pareto front mismatch metric associates a performance value \(\gamma _s\) with a solution method, resulting in a given Pareto front:

$$\begin{aligned} \gamma _s =S_d /S_{ref} \end{aligned}$$
(6)

The metric (6) is used to compare different Pareto fronts, with better Pareto fronts corresponding to lower values of \(\gamma _s\). In our case, the metric (6) has been computed for all the best-known Pareto fronts obtained by running the procedures with different seeds for random number extraction.

Binary PSO: Concepts and Notation

Basic Formulation of the PSO and Related Notation

The PSO introduced in [25] is a fast meta-heuristic method based on tracking the behavior of each element (called particle) belonging to a swarm by computing the velocity of each particle.

In the optimization problem addressed in this paper, the swarm is composed of \(j = 1,{\ldots }, J\) particles, each of which represents a network configuration and contains \(b = 1,{\ldots }, B\) decision variables (binary values identifying the states of the branches, 1 = closed and 0 = open). The PSO is solved through an iterative process, with the generic iteration indicated with the letter k and the \(j = 1,{\ldots },J\) particles denoted as \(\mathbf{x}_j^{( k )} =\left\{ {x_{jb}^{(k)} } \right\} .\)

Each particle changes its trajectory by following some leaders that drive the direction of variation of all the particles. In the presence of local leaders, the variation depends on the behavior of particle j itself or of the nearest particles, with different possible neighborhood topologies [26, 27]. The global leader affects the direction of variation of all particles, taking into account the whole history of the entire swarm.

By considering the behavior of the leaders, at iteration k of the iterative process the change of configuration from \(\mathbf{x}_j^{\left( {k-1} \right) }\) to \(\mathbf{x}_j^{(k)}\) is expressed as

$$\begin{aligned} \mathbf{x}_j^{(k)} =\mathbf{x}_j^{\left( {k-1} \right) } +\mathbf{v}_j^{(k)} \end{aligned}$$
(7)

in which the velocity \(\mathbf{v}_j^{(k)}\) of particle j at iteration k is formulated as:

$$\begin{aligned} \mathbf{v}_j^{(k)}= & {} w^{(k)}{} \mathbf{v}_j^{\left( {k-1} \right) } +r^{{\prime }(k)}A_j^{(k)} \left( {\mathbf{p}_j^{\left( {k-1} \right) } -\mathbf{x}_j^{\left( {k-1} \right) } } \right) +\nonumber \\&+\,\,r^{{\prime }{\prime }(k)}B_j^{(k)} \left( {\tilde{\mathbf{x}}^{\left( {k-1} \right) } -\mathbf{x}_j^{\left( {k-1} \right) } } \right) \end{aligned}$$
(8)

where

  • \(\mathbf{p}_j^{\left( {k-1} \right) } \) represents the local (or personal) best, i.e., the configuration of the particle j with the best objective function found so far;

  • \(\tilde{\mathbf{x}}^{\left( {k-1} \right) }\) represents the global best, i.e., the configuration of the particle that provided the best objective function in the previous \(k-1\) iterations;

  • \(A_j^{(k)}\) and \(B_j^{(k)}\) are two learning factors that weigh the terms depending of the local and global best, respectively;

  • \(r^{{\prime }(k)}\) and \(r^{{\prime }{\prime }(k)}\) are random numbers belonging to the range [0,1], applied to the whole population;

  • \(w^{(k)}\) is the inertia weight.

Adaptive Stop Criterion

The iterations proceed until the stop criterion is satisfied. An effective and adaptive stop criterion is to conclude the iterations when only changes in the objective function lower than a user-defined threshold \(\sigma _s\) occur after a given number \(N_s\) of successive iterationsFootnote 2 [28].

Considerations on the Solutions Found

For any heuristic method driven by probabilistic choices, there is no proof that the global optimum can be reached in a finite time. In the practical application of the discrete/binary heuristic methods like MOBPSO, the possible solutions are a countable number of points, and there is no restriction to reaching any of these points, among which the global optimum, in a finite (but not predictable) time. The evidence of this fact may be also indicated with a proof by contradiction: if there would exist a solution point that cannot be reached during the evolution of the method, the method would not be correctly formulated. However, during the execution of these heuristic methods on large systems (for which exhaustive search cannot be run because of the excessive computation time needed) there is no certainty that the global optimum has been found.

Discrete and Binary PSO Representations for Single-Objective Optimization and Adaptability to the Distribution System Reconfiguration Problem

Discrete and Binary PSO Coding

The principles of PSO operating with discrete values have been indicated in [29]. Different types of representations can be used in discrete PSO, for example based on binary entries or on integer encoding strategies [30]. The treatment of the discrete values depends on two user-defined thresholds, denoted here as \(\chi \) and E. In practice, once the new velocity term is calculated as a real number from (8), the probability of particle evolution for each entry \(b = 1,{\ldots },B\) of each particle \(j = 1,{\ldots },J\) is calculated through the expression \(S\left( {v_{jb}^{(k)} } \right) =1/\left( {1+e^{-\chi }} \right) \). Then, if \(S\left( {v_{jb}^{(k)} } \right) \le E\) the entry \(x_{jb}^{(k)}\) is not changed, otherwise it is changed. The change depends on the integer or binary representations:

  1. (a)

    For the integer encoding strategies, rounding the velocity results to integer values is considered in [31] and the enhanced integer coded PSO method [32] has been used to minimize the total branch losses by maintaining the radial network structure and without violating the constraints. In these cases, the change can be implemented by searching for a feasible configuration represented by the integer coding used, for example adjacent to the configuration that has to be changed. In particular, in [32] the notion of local best is replaced with the so-called local optimal list (LOL), which is updated at each iteration if a good solution is found and provides a list of feasible configurations that can replace the configuration to be changed.

  2. (b)

    In the binary case, the change of the entry is from 0 to 1, or vice versa. Applying this type of change to the particles is appropriate for problems in which all the possible sequences of binary values are acceptable. However, the application of these principles to distribution system reconfiguration is not straightforward, as the mechanism of changing 0’s to 1’s or vice versa may create particles whose representations do not satisfy the radiality constraint [33]. This situation can be handled by generating in any case all the configurations, checking their feasibility and if infeasible generating a fictitious value of the objective function such that these configurations will be discarded in the successive part of the solution process [34]. However, in this way the generation of many infeasible solutions may occur, with a drastic increase in the solution time. Another possibility is to take the infeasible configurations and changing them in order to obtain feasible configurations [35]. In [36] a dedicated shift operator is applied to modify the chromosome structure. The Prüfer number encoding is used in [37] to generate only radial configurations for GA applications. The solution adopted in [38] and in this paper (based on binary coding) automatically generates only feasible network configurations, as described below.

Binary PSO Coding Preserving Radial Configurations

A dedicated version of the discrete PSO that benefits from the distribution network representation has been reported in [38] for the single-objective case. In this formulation, the update of the velocity vector (8) for particle j at iteration k is formulated as:

$$\begin{aligned} \mathbf{v}_j^{(k)} =w^{(k)}{} \mathbf{v}_j^{\left( {k-1} \right) } +r^{{\prime }(k)}A_j^{(k)} \mathbf{e}_j^{(k)} +r^{{\prime }{\prime }(k)}B_j^{(k)} \mathbf{g}_j^{(k)} \end{aligned}$$
(9)

where, by denoting with \( \bar{{ \& }}\) the logical operator NAND:

$$\begin{aligned} A_j^{(k)}= & {} \max \left\{ {0;\,f\left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) -f\left( {\mathbf{p}_j^{\left( {k-1} \right) } } \right) } \right\} \end{aligned}$$
(10)
$$\begin{aligned} B_j^{(k)}= & {} \max \left\{ {0;\,f\left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) -f\left( {\tilde{\mathbf{x}}^{\left( {k-1} \right) } } \right) } \right\} \end{aligned}$$
(11)
$$ \begin{aligned} \mathbf{e}_j^{(k)}= & {} \left( {\mathbf{x}_j^{\left( {k-1} \right) } \bar{{ \& }} \,\mathbf{p}_j^{\left( {k-1} \right) } } \right) \end{aligned}$$
(12)
$$ \begin{aligned} \mathbf{g}_j^{(k)}= & {} \left( {\mathbf{x}_j^{\left( {k-1} \right) } \bar{{ \& }}\, \tilde{\mathbf{x}}^{\left( {k-1} \right) } } \right) \end{aligned}$$
(13)

In practice, the values \(A_j^{(k)}\) and \(B_j^{(k)}\) indicated in (10) and (11) depend on the values of the objective function calculated in the current configuration and in the configurations corresponding to the local and global best. In addition, the velocity vector terms are not applied to each branch, but only to a subset of branches determined by using suitable logical operations.

The result of the NAND operation is modified according to the objective function values. For the addend of (9) depending on the local best, if the objective function \(f\left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) \) is lower (i.e., better) than the objective function of the local best particle \(f\left( {\mathbf{p}_j^{\left( {k-1} \right) } } \right) \), the non-null values coming from the NAND operation are set to zero to avoid making changes, except for the inertia weight, otherwise the difference \(f\left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) -f\left( {\mathbf{p}_j^{\left( {k-1} \right) } } \right) \) is applied to modify the velocity in a way depending on the random number \(r^{{\prime }(k)}\), as in the classical PSO. The same treatment is reserved to the addend of (9) depending on the global best.

The procedure reported in [38] continues by considering the velocity values related to the open branches of the configuration \(\mathbf{x}_j^{\left( {k-1} \right) }\). These values are normalized and are then used to build the cumulative distribution function of the normalized velocity, which is processed by a biased roulette wheel procedure [39] to choose the open branch to close. The loop resulting from closing that branch is detected and, by choosing randomly the branch to open in the loop, the radial configuration is restored.

Fig. 2
figure 2

Layout of the 14-node system

In order to provide the details of the binary PSO representation on an illustrative example, let us consider the 14-bus system [40] reported in Fig. 2, in which the nodes have been renumbered and the supply nodes have been merged into a single node (i.e., node 0) for purely topological reasoning. The network is composed of \(Y = 16\) branches, \(N= 14\) nodes and \(S = 1\) supply node, so that each radial configuration has \(\varLambda = 3\) open branches.

By considering particle j at iteration k, Table 1 reports for each branch the network configuration \(\mathbf{x}_j^{\left( {k-1} \right) }\), the local best \(\mathbf{p}_j^{\left( {k-1} \right) }\) and the global best \(\tilde{\mathbf{x}}^{\left( {k-1} \right) }\). The operator NAND permits to find the open branches present in both configurations analyzed, but it also sets to 1 the location of the open branches present only in one of the two configurations.

Table 1 Example of binary PSO operations on the 14-node network

The Multi-Objective Binary PSO (MOBPSO)

MOBPSO Formulation

In the multi-objective framework, a specific issue is the selection of the leaders (the local best and the global best) [41, 42]. Furthermore, a quality metric is needed to understand how much a solution is better than another one [43]. This section presents a novel multi-objective version of the method reported in [38], with binary representation of the particles for multi-objective cases, solution ranking through an appropriate quality metric, and specific solutions implemented for the choice of the local best and global best by taking into account the distribution network topology.

Fig. 3
figure 3

Flow-chart of the proposed MOBPSO

The flow-chart is reported in Fig. 3. The initial Pareto front is formed by identifying the non-dominated points from the solutions of the initial radial configurations. The proposed procedure contains the execution of the power flow calculations, obtaining all the voltages and currents in the system. For each solution of the power flow the constraints (upper limits of the branch currents, and upper and lower limits of the node voltages) are checked. Each solution containing one or more violations is discarded. Thereby, only feasible solutions are processed to form the Pareto fronts. The specific aspects are addressed below, by considering two alternative versions of the method:

\(\bullet \) Base version: it is the multi-objective application of the method composed of the equation set (913), with the application of the Multi-Objective Solution Ranking Method (MOSR) procedure (see Appendix) for the choice of the new population components, and the procedure to preserve the radial configuration implemented as reported in “Binary PSO Coding Preserving Radial Configurations” section. The choice of the branch to open in this case is completely random, i.e., the information coming from the velocity term is not used and all the branches forming the loop have the same probability to be open.

\(\bullet \) Customized version: in this case the choice of the closed branch to open for keeping the network radial is made by using the velocity term (see “How to Use the Velocity Term: the Customized Version” section for details).

Creation of the New Population by Applying the MOBPSO Routine: Base Version

A formulation equivalent to (9) in the multi-objective context is presented, for \(m = 1,{\ldots },M\) objectives.

For each particle \(j = 1,{\ldots },J\) and each iteration k the objective function values are indicated as \(f_m \left( {\mathbf{x}_j^{(k)} } \right) \in \mathfrak {R}^{J,1}\). By considering the configuration \(\mathbf{x}_j^{\left( {k-1} \right) }\), the local leader \(\mathbf{p}_j^{\left( {k-1} \right) }\), and the global leader \(\tilde{\mathbf{x}}_j^{\left( {k-1} \right) }\) (selected inside a Leaders Archive as explained in “Update of the Global Best” section; for this reason the global leader used may change for each particle, and the subscript j appears also in this case), it is possible to calculate the following quantities:

$$\begin{aligned}&\zeta _m \left( {\mathbf{x}_j^{\left( {k-1} \right) } ,\mathbf{p}_j^{\left( {k-1} \right) } } \right) \nonumber \\&\quad =\left\{ {{\begin{array}{ll} 1,\hbox {if }f_m \left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) >&{}f_m \left( {\mathbf{p}_j^{\left( {k-1} \right) } } \right) \\ 0,&{}\hbox {Otherwise} \\ \end{array} }} \right. \end{aligned}$$
(14)
$$\begin{aligned}&\tilde{\zeta }_m \left( {\mathbf{x}_j^{\left( {k-1} \right) }, \tilde{\mathbf{x}}_j^{\left( {k-1} \right) } } \right) \nonumber \\&\quad =\left\{ {{\begin{array}{ll} 1,\hbox {if }f_m \left( {\mathbf{x}_j^{\left( {k-1} \right) } } \right) >&{}f_m \left( {\tilde{\mathbf{x}}_j^{\left( {k-1} \right) } } \right) \\ 0,&{}\hbox {Otherwise} \\ \end{array} }} \right. \end{aligned}$$
(15)

The logical values are calculated by (14) and (15) for all the objective functions. By using the logical operation OR (indicated here with \(\cup \)), it is possible to “sum” the logical values provided by (14) and (15) and to select all the configurations presenting at least one objective higher than the objective of the leaders \(\mathbf{p}_j^{\left( {k-1} \right) }\) and \(\tilde{\mathbf{x}}_j^{\left( {k-1} \right) }\), respectively. Hence, the formulation of the terms \(A_j^{(k)}\) and \(B_j^{(k)}\) indicated in (9) is extended to the multi-objective case as follows:

$$\begin{aligned} A_j^{(k)}= & {} \mathop \bigcup \nolimits _{m=1}^M \zeta _m \left( {\mathbf{x}_j^{\left( {k-1} \right) } ,\mathbf{p}_j^{\left( {k-1} \right) } } \right) \end{aligned}$$
(16)
$$\begin{aligned} B_j^{(k)}= & {} \mathop \bigcup \nolimits _{m=1}^M \tilde{\zeta }_m \left( {\mathbf{x}_j^{\left( {k-1} \right) } , \tilde{\mathbf{x}}_j^{\left( {k-1} \right) } } \right) \end{aligned}$$
(17)

In practice, the single-objective maximization appearing in (10) and (11) is replaced by the OR operation giving result equal to unity if at least one solution is worse than the corresponding solution of the leader considered.

How to Use the Velocity Term: the Customized Version

With the coding used here, the velocity of each element of the particle can be only either 0 or 1. It is then not possible to apply it directly to change the network configuration. Following the approach used in [38] and [18], the velocity values referring to the open branches in \(\mathbf{x}_j^{\left( {k-1} \right) }\) are used as inputs to a biased roulette wheel mechanism, for choosing the open branch \(\ell ^{{\prime }}\) which has to be closed. Then, the set \(\tilde{\mathbf{L}}_j^{(k)}\epsilon \mathbb {N}^{\left( {Y,1} \right) }\) (having an entry equal to 1 in correspondence to the branches forming the loop) is created. The number of elements equal to 1 in the set \(\tilde{\mathbf{L}}_j^{(k)}\) is \( \tilde{L}_i\).

In order to restore the radial network operation, one of the branches belonging to the loop has to be chosen and open, according with the branch exchange principles [40]. In [18] the choice of this branch was random, i.e., the information about the velocity of the branches forming the loop was not used. In this paper, an improved technique based on the information about the velocity of the branches forming the loop is used.

For this purpose, a set \(\tilde{\mathbf{Z}}_j^{(k)}\) is introduced. Its entries \(\tilde{Z}_j\) represent the indices of the branches both belonging to the loop \(\tilde{\mathbf{L}}_j^{(k)}\) and being open branches in \(\mathbf{x}_j^{\left( {k-1} \right) }\), \(\mathbf{p}_j^{\left( {k-1} \right) }\) and \(\tilde{\mathbf{x}}^{\left( {k-1} \right) }\). The set \(\tilde{\mathbf{Z}}_j^{(k)}\) is calculated by applying the OR operation between the two Boolean vectors \(\mathbf{e}_j^{(k)}\) and \(\mathbf{g}_j^{(k)}\), (defined above), and then by applying the AND operation (indicated here with \(\bigcap )\) with the loop \(\tilde{\mathbf{L}}_j^{(k)}\):

$$\begin{aligned} \mathbf{u}_j^{(k)} =\left( {\mathbf{e}_j^{(k)} \cup \mathbf{g}_j^{(k)} } \right) \cap \tilde{\mathbf{L}}_{\varvec{j}}^{({\varvec{k}})} \end{aligned}$$
(18)

Then, the set \(\tilde{\mathbf{Z}}_j^{(k)}\) can be written as

$$\begin{aligned} \tilde{\mathbf{Z}}_j^{(k)} =\hbar \left( {\mathbf{u}_j^{(k)} } \right) \end{aligned}$$
(19)

where \(\hbar \left( {\mathbf{u}_j^{(k)} } \right) \) indicates the operator giving as output the positions of the elements set to 1 in the vector \(\mathbf{u}_j^{(k)}\).

By denoting with \(\oslash \) the empty set, in the customized version the formerly open branch \(\ell ^{{\prime }}\) (i.e., \(\ell ^{{\prime }}\in \tilde{\mathbf{Z}}_j^{(k)} )\) cannot be re-opened, i.e., the set \(\tilde{\mathbf{Z}}_j^{(k)}\) could be empty. If \(\tilde{\mathbf{Z}}_j^{(k)} =\oslash \) the entries of the auxiliary vector \(\mathbf{q}^{(k)}\) are directly obtained as \(q_l^{(k)} =1/\tilde{L}\), for \(l = 1,{\ldots },\tilde{L}\), that is, all the branches belonging to the loop have the same probability to be chosen. On the other hand, if the set \(\tilde{\mathbf{Z}}_j^{(k)}\) is not empty (i.e., \(\tilde{\mathbf{Z}}_j^{(k)} \ne \oslash )\) the determination of the probability to be open for each loop’s branch is more articulated. In particular, an auxiliary vector \(\mathbf{q}^{(k)}\) (with \( \tilde{L}\) elements) is introduced, and its entries \(q_l^{(k)}\) are defined in such a way that the branches coming from the leaders have a higher probability to be open. At the same time, an offset \(\delta > 0\) is introducedFootnote 3 in order to avoid null entries in the vector \(\mathbf{q}^{(k)}\). Hence, for \(l = 1,{\ldots }, \tilde{L}\):

$$\begin{aligned} q_l^{(k)} =\left\{ {{\begin{array}{lll} \delta +v_z^{(k)} ,&{}\quad \hbox {if }z\in \tilde{\mathbf{Z}}_j^{(k)} \\ \delta ,&{}\quad \hbox {Otherwise} \\ \end{array} }} \right. \end{aligned}$$
(20)

Then, the vector \(\mathbf{q}^{(k)}\) is sent to the biased roulette wheel routine to choose the closed branch to open.

Selection and Update of the Local Best and Global Best

Update of the Local Best

The update of the local best \(\mathbf{p}_j^{(k)}\), with \(j = 1,{\ldots },J\), is performed by comparing the new configuration \(\mathbf{x}_j^{(k)}\) with the corresponding previous local best \(\mathbf{p}_j^{\left( {k-1} \right) }\). By exploiting the MOSR ranking concepts illustrated in the Appendix, if the multi-objective function in the new configuration \(\mathbf{x}_j^{(k)}\) is better than the multi-objective function in \(\mathbf{p}_j^{\left( {k-1} \right) }\), the personal best is updated, i.e., \(\mathbf{p}_j^{(k)} =\mathbf{x}_j^{(k)}\), otherwise \(\mathbf{p}_j^{(k)} =\mathbf{p}_j^{\left( {k-1} \right) }\). In particular, in the application used in this paper, the comparison is carried out between two alternatives only. Thereby, the mathematical expressions becomes simpler and are expressed as follows:

$$\begin{aligned} \mathbf{D}=\left[ {{\begin{array}{l} {\mathbf{p}_j^{\left( {k-1} \right) } } \\ {\mathbf{x}_j^{(k)} } \\ \end{array} }} \right] =\left[ {{\begin{array}{ll} {p_1 }&{} {p_2 } \\ {x_1 }&{} {x_2 } \\ \end{array} }} \right] \end{aligned}$$
(21)

Thus, it is possible to calculate both the normalized decision matrix R and, after having defined the weights \(w_{1}\) and \(w_{2}\) for the two objectives, the weighted normalized matrix V becomes:

$$\begin{aligned} \mathbf{V}= & {} \left[ {{\begin{array}{ll} {v_{11} }&{} {v_{12} } \\ {v_{21} }&{} {v_{22} } \\ \end{array} }} \right] =\mathbf{R}\left[ {{\begin{array}{ll} {w_1 }&{} 0 \\ 0&{} {w_2 } \\ \end{array} }} \right] \nonumber \\= & {} \left[ {{\begin{array}{ll} {\frac{p_1 }{\sqrt{p_1^{2}+x_1^{2}}}}&{} {\frac{p_2 }{\sqrt{p_2^{2}+x_2^{2}}}} \\ {\frac{x_1 }{\sqrt{p_1^{2}+x_1^{2}}}}&{} {\frac{x_2 }{\sqrt{p_2^{2}+x_2^{2}}}} \\ \end{array} }} \right] \left[ {{\begin{array}{ll} {w_1 }&{} 0 \\ 0&{} {w_2 } \\ \end{array} }} \right] \nonumber \\= & {} \left[ {{\begin{array}{ll} {w_1 \frac{p_1 }{\sqrt{p_1^{2}+x_1^{2}}}}&{} {w_2 \frac{p_2 }{\sqrt{p_2^{2}+x_2^{2}}}} \\ {w_1 \frac{x_1 }{\sqrt{p_1^{2}+x_1^{2}}}}&{} {w_2 \frac{x_2 }{\sqrt{p_2^{2}+x_2^{2}}}} \\ \end{array} }} \right] \end{aligned}$$
(22)

From the entries of the matrix V, the ideal solution \(A^{+}\) and the negative-ideal solution \(A^{-}\) are calculated as:

$$\begin{aligned} A^{+}= & {} \left[ {\min \left\{ {v_{11} ,v_{21} } \right\} }\quad {\min \left\{ {v_{12} ,v_{22} } \right\} } \right] =\left[ {{\begin{array}{ll} {v_1^+ }&{} {v_2^+ } \\ \end{array} }} \right] \end{aligned}$$
(23)
$$\begin{aligned} A^{-}= & {} \left[ {\max \left\{ {v_{11} ,v_{21} } \right\} }\quad {\max \left\{ {v_{12} ,v_{22} } \right\} }\right] =\left[ {{\begin{array}{ll} {v_1^- }&{} {v_2^- } \\ \end{array} }} \right] \end{aligned}$$
(24)

Finally, the similarity indices of the two particles under analysis are obtained as:

$$\begin{aligned} C_1^+= & {} \frac{S_1^- }{S_1^+ +S_1^-}\nonumber \\= & {} \frac{\sqrt{\left( {v_{11} -v_1^- } \right) ^{2}+\left( {v_{12} -v_2^- } \right) ^{2}}}{\sqrt{\left( {v_{11} -v_1^+ } \right) ^{2}+\left( {v_{12} -v_2^+ } \right) ^{2}}+\sqrt{\left( {v_{11} -v_1^- } \right) ^{2}+\left( {v_{12} -v_2^- } \right) ^{2}}} \end{aligned}$$
(25)
$$\begin{aligned} C_2^+= & {} \frac{S_2^- }{S_2^+ +S_2^- }\nonumber \\= & {} \frac{\sqrt{\left( {v_{21} -v_1^- } \right) ^{2}+\left( {v_{22} -v_2^- } \right) ^{2}}}{\sqrt{\left( {v_{21} -v_1^+ } \right) ^{2}+\left( {v_{22} -v_2^+ } \right) ^{2}}+\sqrt{\left( {v_{21} -v_1^- } \right) ^{2}+\left( {v_{22} -v_2^- } \right) ^{2}}}\nonumber \\ \end{aligned}$$
(26)

If \(C_1^+ >C_2^+\) the local best does not change, i.e., \(\mathbf{p}_j^{(k)} =\mathbf{p}_j^{\left( {k-1} \right) }\); otherwise, \(\mathbf{p}_j^{(k)} =\mathbf{x}_j^{(k)}\).

Update of the Global Best

The selection of the global best does not happen by resorting to the MOSR procedure, to avoid the possibility of obtaining a global best in which the Pareto front points are concentrated too much on the “central” locations, because of the trend of the ranking criterion to fall on the central region of the Pareto front. A classical procedure is the creation of a Leaders Archive, i.e., a set of solutions corresponding to non-dominated points, from which the global best is chosen in a random way each time the velocity of a particle has to be updated.Footnote 4 In this way, it is possible to drive the evolution of the best-known Pareto front to include points located in different regions [44]. The Leaders Archive is formed by a predefined maximum number \(N_{A}\) of points. If there are more then \(N_{A}\) candidate points, a selection procedure has to be used. A classical procedure is based on the crowding distance (the average distance of a point with respect to the two—or more—neighboring points [45]), reducing the number of points in the regions in which the points are very close to each other. A recent version [46] constructs the Leaders Archive by calculating the hyper-volume [21] obtained by using the point under analysis and the reference point, and updates the archive up to \(N_{A}\) points.

In this paper, the update of the Leaders Archive occurs at the beginning of each iteration, by using the hyper-volume criterion [46]. Starting from the solution closest to the reference point for the hyper-volume calculation, each candidate point to be included in the Leaders Archive, sequentially taken from the adjacent solutions along the different dimensions, is associated with its incremental contribution to the global hyper-volume. If the number of candidate points is higher than \(N_{A}\), the points in excess providing the smallest incremental contribution to the hyper-volume are not included in the Leaders Archive. At iteration k, for each particle \(j = 1,{\ldots },J\) the global best \(\tilde{\mathbf{x}}_j^{(k)}\) chosen to contribute in the formation of the new configuration \(\mathbf{x}_j^{(k)}\) is randomly selected among the components of the Leaders Archive.

Stop Criterion

The adaptive stop criterion is the one indicated in “Adaptive Stop Criterion” section.

Case Study Applications

The multi-objective distribution system reconfiguration has been solved by using the customized NSGA-II method illustrated in [12] (denoted here as GA) and the two MOBPSO versions addressed in this paper. The optimization has been coded in Matlab\(^{{\textregistered }}\). The applications have been run on the 33-node network [47] modified as in [12] to add time-varying load and generation power patterns, and on a 90-node real medium voltage (MV) distribution network.

For the test network, the performance of the methods is quantified by using the Pareto front mismatch metric based on the complete Pareto front [12]. For the real MV network, the best-known Pareto front is built by using the results of the different optimization methods, and the performance is assessed by using the quality factor from [20] (see “Quality Factor” section) and the new generalized best-known Pareto front mismatch metric (see “Generalized Pareto Front Mismatch” section).

The input data used for the MOBPSO are \(J= 50\) particles, \(N_s = 15\) iterations and \(N_s^{({max})} =1000\) iterations for the stop criterion, and the inertia weight progressively changing from 0.9 to 0.4 during the iterations [27]. For the GA, the same parameters reported in [12] have been used.

Fig. 4
figure 4

Comparison among GA, MOBPSO from Mazza et al. [18] and MOBPSO base case. a Pareto fronts used to determine the Pareto front mismatch. b CDF of the Pareto front mismatch

33-Node Network

The 33-node network [47] is composed of \(N = 33\) nodes, \(Y = 37\) branches, and \(S = 1\) supply point, from which \(\varLambda = 5\) branches have to be open to obtain radial configurations. The base voltage is 12.66 kV and the base power is 10 MVA. The voltage of the slack node is 1.05 pu. The two objectives have been computed for all the 50,751 network configurations [22] and the complete Pareto front (with 20 points) has been obtained [12]. Figure 4a shows the complete Pareto front, and two examples of best-known Pareto front (coming from the application of GA and from the application of the version of MOPSO reported in [18])

For a more complete analysis, both the GA and the version of MOBPSO reported in [18] have been run for 200 times, with different seeds for random number extraction. The Leaders Archive for the selection of the global best has dimension \(N_{A}= 20\) (i.e., all the points can be randomly chosen as global best, being part of the complete Pareto front). The results shown in Fig. 4b highlight that the performance of the version of MOBPSO reported in [18] was worse than the one of the GA. Now, the results of the MOBPSO base version (in terms of reaching low values of the Pareto front mismatch, as shown from the cumulative distribution function, CDF) are already considerably better than the results of the application of the GA and of the MOBPSO from [18].

Fig. 5
figure 5

Comparisons among GA and the two MOBPSO versions. a Pareto fronts used to determine the Pareto front mismatch. b CDF of the Pareto front mismatch

Both MOBPSO versions (namely, base version and customized version) have been applied to the network. An example of best-known Pareto front for the two MOBPSO versions is shown in Fig. 5a, whereas the statistical representation (reported in Fig. 5b) shows the better performance of the MOBPSO customized version with respect to the GA and the MOBPSO base version. This means that the use of the information about the topology can effectively improve the results in the search of the complete Pareto Front.

Fig. 6
figure 6

Comparisons among the voltage profiles in the initial Pareto front and in the best-known Pareto front obtained with the MOBPSO customized version

The voltage profiles related to the 33-node network in four cases are shown in Fig. 6. Two cases refer to the minimum losses and minimum ENS in the initial Pareto front. The other two cases refer to the best-known Pareto front obtained by the MOBPSO customized version (Fig. 5a). Starting from the solutions belonging to the initial Pareto front, the solutions obtained in the MOBPSO customized version are characterized by remarkable improvement of the voltage profile.

Concerning the computational burden of the methods, Table 2 shows the mean value and the standard deviation of the computational times obtained by running 200 executions of the three methods on an Intel i7-4790 personal computer with CPU at 3.6 GHz and 16 GB RAM. The execution times of the MOBPSO versions are relatively and significantly less than the GA version. This is mainly due to different aspects:

  1. (i)

    The customized GA code contains a number of internal consistency checks to confirm that the mutation operator leads to radial configurations also by performing branch exchange when the open branch to close connects two nodes supplied by the same feeder [12]. These checks are repeated many times and increase the computation time considerably. All checks were successful, so in future version they could be skipped.

  2. (ii)

    The two MOBPSO versions guarantee the radiality of the new particles formed, because the closed branches to open are chosen by using the information related to \(\mathbf{x}_j^{(k-1)}\). Thereby no more checks are requested during the execution.

  3. (iii)

    The computational time of the customized version of MOBPSO is slightly longer and more variable than the one of the base version, due to the handling of the topology. In fact, the branch exchange mechanism takes into account both the loop \(\tilde{\mathbf{L}}_j^{(k)}\) and the open branches in \(\mathbf{x}_j^{(k-1)}\), \(\mathbf{p}_j^{(k-1)}\) and \(\tilde{\mathbf{x}}^{(k-1)}\). The radiality is always guaranteed and the probability to choose open branches belonging to the leaders is higher, but the time consumption slightly increases.

Table 2 Computational burden for the 33-node network
Fig. 7
figure 7

Scheme of the real urban MV distribution network. The dashed lines correspond to the open branches in a radial configuration. A generic radial configuration is shown

Real MV Urban Distribution Network

A further comparison among the methods has been carried out on the real MV network of Fig. 7 serving an urban area. The network is composed of \(N = 90\) nodes, \(Y = 98\) branches and \(S = 3\) supply nodes, leading to \(\varLambda = 11\) branches to open to obtain radial configurations. The network base voltage is 20 kV and the base power is 100 MVA. The supply point voltage is 1.05 pu.

The total number of radial configurations is as high as \(1.176\times 10^{10}\). Hence, it is not practicable to obtain the complete Pareto front for this network. The reference Pareto front needed to assess the performance of the methods is then constructed by combining the points of the best-known Pareto fronts obtained from the GA and the two MOBPSO versions, running each method 200 times to get a relatively higher number of best-known Pareto fronts to combine for extracting the non-dominated solutions. The Leaders Archive for the selection of the global best has dimension \(N_{A} = 20\). The resulting best-known Pareto front (with 67 points, Fig. 8a) is the reference Pareto front used for calculating the performance indicators. Furthermore, Fig. 8a, shows an example of best-known Pareto front obtained by applying GA and the two MOBPSO versions.

Fig. 8
figure 8

Comparisons among the GA and the MOBPSO versions for the real urban MV distribution network. a Pareto fronts used to determine the Pareto front mismatch b CDF of the generalized best-known Pareto front mismatch

Figure 8b shows that also in this case the results of the customized version are substantially better than the results of the GA application. The generalized best-known Pareto front mismatch reaches relatively low values in the best cases. This is also due to the fact that the reference Pareto front in this case is not the complete one (the values would be higher if the complete Pareto front would be available). However, for the comparison among the methods, the use of the new metric introduced is fully appropriate.

The quality factors computed for the GA and for the two MOBPSO versions are reported in Table 3. In this case, higher values correspond to better performance. The customized version of MOBPSO largely outperforms both GA and the MOBPSO base version.

Table 3 Quality factor

The value of the median is interesting: in fact, it highlights the most common quality factor value for both GA and base case of MOBPSO is zero, i.e., for most of the simulations the solutions found do not belong to the Pareto front. In the customized version of MOBPSO versions the median is always higher than zero.

Information on the computational burden is reported in Table 4. The order of magnitude of the computational burden for the two MOBPSO versions is comparable (mean values from about 6 minutes to about 8 minutes, obtained on an Intel i7-4790 personal computer with CPU at 3.6 GHz and 16 GB RAM). The customized version of MOBPSO has longer computational time than the base version, justified by the handling of the topology.

Table 4 Computational burden for the real urban MV network
Fig. 9
figure 9

Quality factor versus computational burden for the two MOBPSO versions. Higher values of the quality factor represent better solutions

A detailed representation of the relation between the performance indices and the computational burden is shown in Fig. 9 for the quality factor and in Fig. 10 for the generalized best-known Pareto front mismatch. The performance improvement for customized version is evident from the higher values of quality factor and the lower values of the generalized Pareto front mismatch, at the expense of an acceptable increase in the computational burden.

Fig. 10
figure 10

Generalized best-known Pareto front mismatch versus computational burden for the two MOBPSO versions. Lower values of the generalized best-known Pareto front mismatch represent better solutions

Fig. 11
figure 11

Voltage profiles at minimum losses in the initial Pareto front and in the MOBPSO customized version

Fig. 12
figure 12

Voltage profiles at minimum ENS in the initial Pareto front and in the MOBPSO customized version

Figures 11 and 12 show the voltage profiles related to the real network. In Fig. 11, the reconfiguration at minimum losses with the MOBPSO customized version leads to a significant improvement of the voltage profile for a large part of the network, while in the remaining part of the network (in which the open branches have not been changed with respect to the initial network) the node voltages remain the same. In Fig. 12 the optimal reconfiguration at minimum ENS carried out with the MOBPSO customized version introduces more general variations in the voltage profile. This is due to the fact that the reduction of losses is consistent with the improvement of the voltage profile, while the ENS improvement follows reliability aspects not necessarily linked with the variation of the network voltages.

Conclusions

In the research on heuristic methods, it has been often found that the customization of the algorithms to embed specific properties of the problem under analysis is successful with respect to the straightforward application of the standard version of the algorithms as general-purpose tools. Following this rationale, in this paper specific novel contributions have been presented on the performance assessment of multi-objective distribution system reconfiguration methods and on the definition of a customized MOBPSO version providing significantly better performance than previous versions of MOBPSO and GA. The generalized best-known Pareto front mismatch metric enables the user to quantify the performance of a given solution method based on the best-known Pareto front in comparison with other methods, in the application to distribution networks of any size. The MOBPSO performance has been largely improved by (i) applying internal ranking in the update of population and leaders and (ii) properly handling the elements involved in the topology variation by maintaining the radial structure. Two MOBPSO versions have been applied to test and real systems, with customized version regularly resulting as the best one. This result shows that the MOBPSO becomes more effective than the GA, raising the interest for using the customized version of MOBPSO in the light of the substantially better solutions provided. Future developments refer to the investigation of the use of further metrics in the comparison of heuristic methods and to the use of further conflicting objectives in the multi-objective reconfiguration problem.