1 Introduction

Environmental, Social and Governance (ESG) factors are becoming increasingly important in investment decision making, not only due to new regulatory frameworks, but also due to the change on the priorities of both, institutional and retail investors. Firms have started to respond to the change on investors’ priorities and they have started to redesign their business models. The COVID-19 pandemic has shown us that those business models which incorporate ESG factors are often better prepared for crisis management, being less exposed to the negative effects of technological or regulatory disruptions (Reteurs, 2020).

The availability of transparent and reliable ESG data allowing the assessment of the different investment alternatives is crucial. Investors, asset managers, financial institutions and other stakeholders are increasingly relying on the ESG ratings and rankings provided by ESG rating agencies.

ESG rating agencies try to measure how well a company has performed highlighting areas that need improvement. They are usually used by firms to identify opportunities in terms of weaknesses or strenghs. However, as stated by Deloitte (2022) “there is no one-fits-all methodology to analyse ESG data used by rating agencies”. In order to get an overall ESG score, each pillar is evaluated individually according to several criteria grouped into several categories. ESG data from the companies is the first required input in any ESG rating process. However, it is not the only one. Other important inputs are the relative importance of the ESG criteria and the comparison method used in the aggregation process. The individual scores are aggregated into an overall or global score that represents the ESG rating of the firm. This aggregation is usually carried out by using a weighted average and by usually considering equal weights. Larker et al. (2022) point out the fact that, surprisingly, several recent studies find low correlations across ESG ratings from main rating agencies. Berg et al. (2022) go one step further, pointing out that main reasons behind these low correlations are the selection of the decision criteria, the assessment method and the weights representing the relative importance of the criteria in the aggregation process.

In this work, we focus on the problem related to the criteria weighting, using a method based on the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS), a distance-based Multiple Criteria Decision Making (MCDM) method, widely used to assist decision makers in analysis, comparisons, and rankings of decision alternatives (Hwang & Yoon, 1981).

In the context of the problem addressed in this paper, TOPSIS needs only two main inputs for its evaluation: weights on criteria and the ratings of the firms on the ESG criteria. Contrary to what happens with other methods, TOPSIS relies more on the data rather than on an exhaustive analysis of the decision makers’ preferences. This is probably one of the reasons why TOPSIS is such a widely used analysis, comparison and ranking method compared to other methods (Shih & Olson, 2022).

The implication on the results of different choices of weights is an issue that, within the framework of TOPSIS, has not been sufficiently discussed (Olson, 2004). As we will see in the next section, several methods have been applied in the TOPSIS framework to elicit weights objectively or subjective, depending on the context of the addressed problem. Nevertheless, in many real decision situations, the decision maker is not able or does not want to elicitate, objectively or subjectively, those weights. This is the situation addressed by UW-TOPSIS.

Un-weighted TOPSIS (UW-TOPSIS) proposed by Liern and Pérez-Gladish (2022) addresses this problem obtaining global ratings based on the best and worst possible situation of each firm without the elicitation of a vector of weights. However, UW-TOPSIS has a drawback: the decision maker does not have decisional information on the relative importance of each criterion in the determining of the obtained ranking. The contribution of this work lies in the identification and description of this problem and its managerial implications.

The rest of the paper is organized as follows. In the following section we carry out a review of the literature describing the main positive and negative features of TOPSIS, focusing on the problematic around the elicitation of weights. Next, in Sect. 3, we describe the UW-TOPSIS algorithm drawing attention to an important managerial limitation of the method: obtaining decisional weights. In this section we deepen on this limitation analizing its consequences from a managerial point of view. In Sect. 4, we propose a solution and in Sect. 5, to illustrate both, the applicability of the proposed approach and the managerial problematic around it and we present a real case study where we rate and rank a set of firms based on Environment, Social and Governance (ESG) and financial criteria. The last section presents the main conclusions of the work.

2 Literature revision

TOPSIS allows the selection of those alternatives that have the shortest distance from the positive ideal solution (PIS) and the farther distance from the negative-ideal solution (NIS). The positive ideal solution maximizes criteria of the type “the more, the better” and minimizes criteria of the type “the less, the better”, whereas the negative ideal solution maximizes “the less, the better” criteria and minimizes “the more, the better” criteria. Based on this and making full use of the attribute information, TOPSIS provides a cardinal ranking of the decision alternatives without requiring independency of the attribute preferences (Chen & Hwang, 1992; Yoon & Hwang, 1995). Due to its popularity, several developments, extensions and applications of the classical TOPSIS method have been proposed in the last decades (Behzadian et al., 2012). The different new approaches depend on the type of the data in the decision matrix, the data normalization process, the type of decision criteria weighting schemes and the selection of distance functions to measure the proximity to the PIS and NIS.

Among the reasons behind the high popularity of TOPSIS, researchers highlight that it is a logic technique able to easily express the rationale of human choice (Corrente & Tasiou, 2023; Huang & Li, 2012; Okul et al., 2014; Olson, 2004; Parkan & Wu, 1997; Roszkowska, 2011). It is also a simple computation process that provides, in its classical formulation a scalar value which quickly measures the relative performance of each alternative considering the best and worst alternatives, simultaneously (Huang & Li, 2012; Kaliszewski & Podkopaev, 2016; Kim et al., 1997; Olson, 2004; Parkan & Wu, 1997; Roszkowska, 2011; Zavadskas et al., 2016).

Several authors have compared the performance of TOPSIS with other techniques such as AHP (Saaty, 1980), ELECTRE (Benayoun et al., 1966) or PROMETHEE (Brans et al., 1984),. Zanakis et al. (1998) conclude that TOPSIS presents few rank reversals compared to ELECTRE, the Multiplicative Exponential Weighting (MEW), the Simple Additive Weighting (SAW), and four versions of AHP problems. Zavadskas et al. (2016) conclude that when compared to AHP, ELECTRE and PROMETHEE the performance of TOPSIS is slightly modified by the number of alternatives and “rank discrepancies are amplified to a lesser extent for increasing values of the number of alternatives and the number of criteria” (Zavadskas et al., 2016). Table 1 summarizes some of the main advantages of classical TOPSIS and its extensions.

Table 1 Main advantages TOPSIS.

However, TOPSIS presents some disadvantages. Table 2 displays some of the main criticisms to this method and its extensions, which are mostly related to the lack of a unified normalization process, the absence of a scheme for weight elicitation and the consequences related to the use of different distance functions. It can also present rank reversal problems. Different extensions of the classical TOPSIS approach have tried to overcome some of these limitations. The choice of the normalization process can alter the results in terms of the ranking. Normalization can give rise to a narrow difference among the data making difficult to reflect the true dominance of alternatives. Liern et al. (2020) delven into this question considering both, precise and imprecise data given by intervals and provide a solution based on the use of membership functions. Discussion around judgment rules for the decision on the components of relative closeness has been addressed by Behavioral TOPSIS (see Shih & Olson, 2022 for a very good description).

Table 2 Main disadvantages of TOPSIS.

Authors as Huang and Li (2012), Okul et al. (2014), Kaliszewski and Podkopaev (2016) and Corrente and Tasiou (2023) point out, as a disadvantage of TOPSIS, a lack of a framework for weights elicitation. In the classical TOPSIS approach preferential weights are usually introduced as inputs in the third step of the algorithm, after normalization of the decision matrix. The weights are generally directly given by the decision maker or the analyst in the beginning of the assessment process and, as in other compensatory MCDM models, they express the relative importance of each criterion reflecting decision maker’s relative preference (Yoon & Hwang, 1995). The choice of the weights representing the relative importance of the decision criteria, is an important step in the application of TOPSIS, because different vectors of weights could provide different results (Olson, 2004). However, as pointed out by Shih and Olson (2022), “the meanings of weights involve more than just the interpretation of importance. For instance, the weight could reflect the rate of substitution among criteria, scaling factors used to convert measures into commensurate overall value, discriminating power of the criteria on the alternatives, and vote values in binary choices” (Shih & Olson, 2022, p. 38).

In this work, we are interested in finding up the discriminating power of the criteria on the alternatives for a certain ranking. That is, instead of elicitating weights for the criteria as inputs (objectively or subjectively, see Shih & Olson, 2022 for a review of usual elicitation techniques), weights become an output of the model, informing us, once a ranking is obtained, about the contribution of each criterion in the composition of the ranking.

Liern and Pérez Gladish (2022) developed Unweighted TOPSIS (UW-TOPSIS) for those situations in which the decision maker is not able or does not want to elicitate precise weights (subjective weights) or does not want to rely on objective weights based on the study of data nature. In UW-TOPSIS, similarly to what happens in some efficiency models (see Kao & Liu, 2000), decision criteria weights are the decision variables in two optimization problems where the objective is to minimize and to maximize the relative proximity of the decision alternatives to the ideal solution. These optimization problems provide for each alternative a relative proximity interval. Each of the extremes of the intervals has an associated matrix of weights reflecting the relative importance of each decision criterion for each alternative. Thus, in the UW-TOPSIS framework, the criteria weights will express, for each alternative, how much each criteria has contributed to the position of that alternative in the ranking. Let us notice that these weights would be probably different for each alternative. This can constitute a limitation of the method, as it does not allow knowing the role that each criterion has played in the ranking, in global terms. In this work, we address and discuss this limitation proposing of a method able to provide these weights, which will be called decisional weights. These decisional weights will be now an output of the model instead of an input and they do not have a preferential meaning.

3 Un-weighted TOPSIS

In what follows we will present the steps of the algorithm proposed by Liern and Pérez-Gladish (2022), Unweighthed TOPSIS (UW-TOPSIS). Steps 1 and 2 remain the same than in the classical TOPSIS framework (see Fig. 1). However, the PIS and NIS solutions are determined now without taking into account the relative importance of the criteria. Weights are introduced as unknowns in step 4 when separation measures from the PIS and NIS are calculated. Their values are determined in step 5 solving two groups of mathematical programing problems which maximize and minimize the separation of each alternative to the PIS and NIS respectively, taking into account different constraints referred to the values of the weights.

Fig. 1
figure 1

Source: own elaboration

TOPSIS framework.

These constraints include the classical constraint in TOPSIS approaches which ensures all the weights are positive and sum up one and other constraints imposing lower and upper bounds on the weights. The resulting mathematical programming problems are, due to the nature of their objective, fractional mathematical programming problems. In what follows we describe the main steps of the UW-TOPSIS method in detail.

Inputs. Let us consider \(\left\{{A}_{i}, 1\le i\le n\right\}\) alternatives, \(\left\{{C}_{j}, 1\le j\le m\right\}\) criteria, d a distance function in \({\mathbb{R}}^{m}\) and \(\mu \) a normalization function.

Step 1. Determine the decision matrix \(\left[{x}_{ij}\right], 1\le i\le n, 1\le j\le m.\)

Step 2. Construct the normalized decision matrix \(\mu \left({x}_{ij}\right)={r}_{ij},\)

$$\left[{r}_{ij}\right], {r}_{ij}\in \left[\mathrm{0,1}\right], 1\le i\le n, 1\le j\le m.$$
(1)

Step 3. Determine the positive ideal \({A}^{+}=({r}_{1}^{+},\dots ,{r}_{m}^{+})\) and the negative ideal solutions \({A}^{-}=({r}_{1}^{-},\dots ,{r}_{m}^{-})\), being

$${r}_{j}^{+}=\left\{\begin{array}{cc}\underset{1\le i\le n}{\mathrm{max}} {r}_{ij}, & j\in J\\ \underset{1\le i\le n}{\mathrm{min}}{ r}_{ij},& j\in J{\prime}\end{array}\right. 1\le j\le m, {r}_{j}^{-}=\left\{\begin{array}{cc}\underset{1\le i\le n}{\mathrm{min}}{ r}_{ij}, & j\in J\\ \underset{1\le i\le n}{\mathrm{max}} {r}_{ij},& j\in J{\prime}\end{array}\right. 1\le j\le m,$$
(2)

where J is associated with “the more, the better” criteria and J’ is associated with “the less, the better” criteria.

Step 4. Let us consider

$$\Omega =\left\{w=\left({w}_{1},\dots , {w}_{m}\right)\in {\mathbb{R}}^{m}, { w}_{j}\in \left[\mathrm{0,1}\right], {\sum }_{j=1}^{m}{w}_{j}=1, { l}_{j}\le {w}_{j}\le {u}_{j}, 1\le j\le m \right\}\ne \varnothing ,$$
(3)

being \({l}_{j},{u}_{j}\ge 0\) lower and upper bounds for each criterion’s weight.

Given \({A}^{+}, {A}^{-}\), we define two separation functions,

$$ D_{i}^{ + } :\Omega \times {\mathbb{R}}^{m} \to \left[ {0,1} \right], D_{i}^{ - } :\Omega \times {\mathbb{R}}^{m} \to \left[ {0,1} \right], 1 \le i \le n, $$

Given by

$${D}_{i}^{+}\left(w\right)=d\left(\left({{w}_{1}r}_{i1},\dots , {{w}_{m}r}_{im}\right),\left({w}_{1}{r}_{1}^{+},\dots , {w}_{m}{r}_{m}^{+}\right)\right), 1\le i\le n,$$
(4)
$${D}_{i}^{-}\left(w\right)=d\left(\left({{w}_{1}r}_{i1},\dots , {{w}_{m}r}_{im}\right),\left({w}_{1}{r}_{1}^{-},\dots , {w}_{m}{r}_{m}^{-}\right)\right), 1\le i\le n.$$
(5)

Step 5. Calculate the function of relative proximity to the ideal solution, \(R_{i} :\Omega \to \left[ {0,1} \right], 1 \le i \le n,\) as

$${R}_{i}\left(w\right)=\frac{{D}_{i}^{-}\left(w\right)}{{D}_{i}^{+}\left(w\right)+{D}_{i}^{-}\left(w\right)}, 1\le i\le n.$$
(6)

Step 6. For each i, \(1\le i\le n\), we calculate the values \({R}_{i}^{min}(w), {R}_{i}^{max}(w)\) solving the two following mathematical programming problems where decision variables are the criteria weights:

$$ R_{i}^{\min } = \mathop {\min }\limits_{w} \left\{ {R_{i} \left( w \right), w \in \Omega } \right\}, \;R_{i}^{\max } = \mathop {\max }\limits_{w} \left\{ {R_{i} \left( w \right), w \in \Omega } \right\},\;1 \le i \le n. $$
(7)

Then, we construct n relative proximity intervals,

$${R}_{i}^{I}=\left[{R}_{i}^{\mathit{min}}, {R}_{i}^{max} \right], 1\le i\le n.$$
(8)

Step 7. We rank the intervals \({R}_{1}^{I}\), \({R}_{2}^{I}\), …, \({R}_{n}^{I}\).

Output. We rank the alternatives \(\left\{{A}_{i}, 1\le i\le n\right\}\) considering that \({A}_{i}\geq {A}_{j}\) if and only if \({R}_{i}^{I}\geq {R}_{j}^{I}\).

To order intervals \(\left\{{R}_{i}^{I}, 1\le i\le n\right\}\) it is usual to select a real number, \({R}_{i}^{*}, 1\le i\le n\), representing the interval and to use each \({R}_{i}^{*}\) as an auxiliar score in the UW-TOPSIS method. For instance, in Liern and Pérez-Gladish (2022, 2023), the authors choose

$$ R_{i}^{*} = \left( {1 - \alpha } \right)R_{i}^{\min } + \alpha R_{i}^{max} , 1 \le i \le n,\;\alpha \;{\text{a constant in}}\;\left[ {0,{ 1}} \right], $$
(9)

and they consider that \({R}_{i}^{I}\geq {R}_{k}^{I}\) if and only if \({R}_{i}^{*}\ge {R}_{k}^{*}.\)

As \({R}_{i}^{min}\) and \({R}_{i}^{max}\) represent the worst and best possible global scores for \({A}_{i}\), respectively, the choice of \(\alpha =0\) means that the decision maker considers the worst case scenario most likely for \({A}_{i}\), and \(\alpha =1\) means the decision maker considers the best scenario most likely for alternative \({A}_{i}\).

Proposition 1.

With previous notation, if in UW-TOPSIS we consider, d, the Manhattan distance in \({\mathbb{R}}^{m},\) \(d\left(x,y\right)=\sum_{j=1}^{m}\left|{x}_{j}-{y}_{j}\right|\), and normalization \({r}_{ij}=\mu \left({x}_{ij}\right)\) given by

$$\mu \left({x}_{ij}\right)= \left\{\begin{array}{cc}\frac{{x}_{ij}-\underset{i}{\mathrm{min}}{x}_{ij}}{\underset{i}{\mathrm{max}}{x}_{ij}-\underset{i}{\mathrm{min}}{x}_{ij}},& j\in J\\ \frac{\underset{i}{\mathrm{max}}{x}_{ij}- {x}_{ij}}{\underset{i}{\mathrm{max}}{x}_{ij}-\underset{i}{\mathrm{min}}{x}_{ij}},& j\in J{\prime}\end{array}\right.$$
(10)

Then,

$${R}_{i}^{min}=\mathrm{Min}\left\{\sum_{j=1}^{m}{{r}_{ij}w}_{j}, w\in\Omega \right\}, {R}_{i}^{max}=\mathrm{Max}\left\{\sum_{j=1}^{m}{{r}_{ij}w}_{j}, w\in\Omega \right\}, 1\le i\le n.$$
(11)

Proof: With normalization given in (10), all the criteria are of the type “the more, the better”. Considering that \({\sum }_{j=1}^{m}{w}_{j}=1, \) replacing in (4) and (5) and with the Manhatann distance, we obtain

$${D}_{i}^{+}\left(w\right)=\sum_{j=1}^{m}{w}_{j}\left(1-{r}_{ij}\right)=1-\sum_{j=1}^{m}{{r}_{ij}w}_{j}, {D}_{i}^{-}\left(w\right)= \sum_{j=1}^{m}{{r}_{ij}w}_{j }, 1\le i\le n.$$

Taking into account (6), \({R}_{i}\left(w\right)=\sum_{j=1}^{m}{{r}_{ij}w}_{j}, 1\le i\le n\).

Definition 1.

Given n alternatives, \(\left\{{A}_{i}, 1\le i\le n\right\},\) m criteria, a distance function d in \({\mathbb{R}}^{m}\) and \(\mu \) a normalization function, we say a ranking of alternatives

$${A}_{{j}_{1}}\geq {A}_{{j}_{2}}\geq {\dots \geq A}_{{j}_{n}},$$

is weigthed-generated, if a vector exists \(w\in\Omega \), such that with it we can obtain that ranking, that is \(S=\left\{w\in\Omega , {R}_{{j}_{1}}\left(w\right)\ge {R}_{{j}_{2}}\left(w\right)\ge {\dots \ge R}_{{j}_{n}}\left(w\right)\right\}\ne \varnothing .\) This vector, \(w\in\Omega \), will be called vector of decisional weights.

Obviously, any ranking obtained with the classical TOPSIS or with any extension considering precise weights an input in the algorithm, is weigthed-generated. However, as we will see later, it is possible to obtain rankings based on \({R}_{i}^{*}\) in UW-TOPSIS which are not weigthed-generated, that is, for which it is not possible to find a vector \(w\in\Omega \), which generates that ranking.

For \({A}_{i}\), we obtain the relative proximity values \({R}_{i}^{min}\) and \({R}_{i}^{max}\) with two vectors \({w}_{i}^{min}=({w}_{i1}^{min},{w}_{i2}^{min}, \dots ,{w}_{im}^{min})\in\Omega \) and \({w}_{i}^{max}=({w}_{i1}^{max},{w}_{i2}^{max}, \dots ,{w}_{im}^{max})\in\Omega \), \(1\le i\le n\), respectively. The UW-TOPSIS method, does not directly provide a set of weights associated to the value, \({R}_{i}^{*}\) given by (9). If the amplitude of the intervals of weights is not very big, \({l}_{j}\le {w}_{j}\le {u}_{j}, 1\le j\le m\), this will not represent an important problem, as for any intermediate relative proximity value we will have an approximte idea of the relative weights of each criterion. However, if the intervals have a big amplitude (for instance, \(0\le {w}_{j}\le 1, 1\le j\le m)\), and we are interested in \({R}_{i}^{*}\in ({R}_{i}^{min},{R}_{i}^{max}\)), we will have to solve a decisional problem. Figure 2 illustrates this situation. Let us observe that for \({R}^{min}\) and \({R}^{max}\) we obtain a matrix of weights in which each row containts the vector of the criteria weights of the decision alternative i. That is, for each decision alternative, we know the relative importance of each criterion determining the minimum and maximum relative proximity to the ideal solution. However, the relative importance of each decision criterion may be different for each alternative.

Fig. 2
figure 2

Source: own elaboration

Decisional weights.

If the decision maker is looking for decisional weights (see Fig. 2), that is, for the weights which contribute to make a particular decision about the relative proximity index expressing the relative importance of each criterion globally, the previously described situation could be even worse, as he/she will be looking for a vector of weights \(w=\left({w}_{1},\dots , {w}_{m}\right)\in\Omega \) able to generate all the relative proximity values \({R}^{*}=\left({R}_{1}^{*}, {R}_{2}^{*}, \dots , {R}_{n}^{*}\right)\) with a common set of weights for all the alternatives. Let us illustrate this situation with an example.

Example 1

Let us consider a decision problem where we need to rank 10 firms from the pharmaceutical industry based on 5 decision criterion (see Table 1). Three of these decision criteria are ESG (Environment, Social, Governance) criteria and the two are financial criteria (Return, Volatility). The forth first criteria are of the type “the more, the better” and the last one is of the type “the less, the better”. Data in Table 1 has been provided by Refinitiv (2021), which offers one of the most comprehensive ESG databases. Specifically, we adopt the individual environmental, social and governance ratings. These metrics range from 0 to 100, with a higher value indicating a better ESG performance of the firm. For the financial return we will use the year to data (YTD) return which is defined as the amount of profit (or loss) realized by an investment since the first trading day of the current calendar year. To calculate YTD, we subtract the starting year value from the current value and we divide the result by the starting-year value. Then we multiply by 100 to convert to a percentage. Volatility will be measure using the standard deviation.

The only information available regarding the relative weight of the decision criteria is that at least two criteria should be taken into account in the decision, that is \(0.05\le {w}_{j}\le 0.5, 1\le j\le 5.\) In this case, the set of feasible weights (see Step 4 in the UW-TOPSIS algorithm) is

$$\Omega =\left\{w=\left({w}_{1},\dots , {w}_{5}\right)\in {\mathbb{R}}^{5}, {0.05\le {w}_{j}\le 0.5, 1\le j\le 5,} {\sum }_{j=1}^{5}{w}_{j}=1\right\}$$

Applying UW-TOPSIS to the decision matrix in Table 3, with the Manhatann distance and normalization given in (10) (see Proposition 1), in Table 4 we display the results for \({R}_{i}^{*}\) obtained as \({R}_{i}^{*}=0.6{R}_{i}^{\mathit{min}}+0.4{ R}_{i}^{max}, 1\le i\le 10.\) The full reproducible code, developed by the authors, is available in the Appendix.

Table 3 Decision matrix.
Table 4 Relative proximity values.

In Table 5 we have displayed the weights with which each firm reaches the worst and best relative proximity value Ri.

Table 5 Matrices with weights associated to Rmin and Rmax.

It is clear that from the information displayed in Table 5 we cannot obtain a unique vector of weights \(w\in\Omega \) with which we could obtain the results in Table 4.

$${F}_{1}\geq {F}_{4}\geq {F}_{2}\geq {F}_{3}\geq {F}_{5}\geq {F}_{9}\geq {F}_{8}\geq {F}_{6}\geq {F}_{7}\geq {F}_{10}$$
(12)

Taking into account Definition 1, we wonder if S is an empty set, being S

$$S=\left\{\begin{array}{cc}w\in\Omega ,& { R}_{1}\left(w\right)\ge {R}_{4}\left(w\right)\ge {R}_{2}\left(w\right)\ge {R}_{3}\left(w\right)\ge {R}_{5}\left(w\right)\ge \\ & \ge {R}_{9}\left(w\right)\ge {R}_{8}\left(w\right)\ge {R}_{6}\left(w\right)\ge {R}_{7}\left(w\right)\ge {R}_{10}\end{array}\right\}$$
(13)

With the distance and normalization chosen for this example, we can apply Proposition 1, and thus \({R}_{i}\left(w\right)=\sum_{j=1}^{5}{{r}_{ij}w}_{j}, 1\le i\le 10\). Then, the inequalities given in (19) are linear and we can easily probe that \(S=\varnothing .\) That is, no vector of weights exist, \(w\in\Omega \), that generates que ranking of firms given in (12). In what follows we will propose a solution for this problem.

4 Obtaining decisional weights

Given a vector \({R}^{*}=({R}_{1}^{*}, {R}_{2}^{*}, \dots , {R}_{n}^{*})\) with \({R}_{i}^{*}\in \left[{R}_{i}^{\mathit{min}}, {R}_{i}^{max} \right], 1\le i\le n,\) we wonder if the system of equations

$$\frac{{D}_{i}^{-}\left(w\right)}{{D}_{i}^{+}\left(w\right)+{D}_{i}^{-}\left(w\right)}={R}_{i}^{*}, 1\le i\le n,$$
(14)

has solution for some \(w\in\Omega \).

Of course, if in (3) \({l}_{j}={u}_{j}, 1\le j\le m\), the system in (14) has a solution w that is the vector of weights used in a classical TOPSIS approach (see Fig. 1). However, if the existence of solution in (14) is not guaranteed, we will use an approximate solution. Specifically, if we consider the function \(R:\Omega \to {[\mathrm{0,1}]}^{n},\) given by

$$R\left(w\right)=\left({R}_{1}\left(w\right),{R}_{2}\left(w\right),\dots ,{R}_{n}\left(w\right)\right),$$
(15)

we need to find a vector \({w}^{*}\in\Omega \) such that \(R\left(w\right)\) is approximately equal to \({R}^{*}\). To find vector \({w}^{*}\), we can solve an optimization problem

$$\mathrm{Min}\left\{d\left(R(w),{R}^{*}\right), w\in\Omega \right\}.$$
(16)

The properties of \(\Omega \) facilitate the existence of solution in (16) (see (3)). For example, (16) can be easyly solved as follows

$$\mathrm{Min}\left\{\frac{1}{n}\sum_{i=1}^{n}{\left(\frac{{D}_{i}^{-}\left(w\right)}{{D}_{i}^{+}\left(w\right)+{D}_{i}^{-}\left(w\right)}-{R}_{i}^{*}\right)}^{2}, w\in\Omega \right\}.$$
(17)

That is, if we suppose that \({R}^{*}\) is the vector with the true values, in (17) we obtain the vector \({w}^{*}\in\Omega \) which minimizes the mean-squared error (MSE), assuming \({R(w}^{*})\) as an estimation of \({R}^{*}\).

The purpose of (17) is to be as close as possible to a set of given scores, \({R}_{i}^{*}\). Nevertheless, this does not guaraantee that the rankings given by \(\left\{{R}_{i}^{*}, 1\le j\le m\right\}\) and \(\left\{{{R}_{i}(w}^{*}), 1\le j\le m\right\}\) are the same. If, in addition to the proximity among the scores, we want to maintain the ranking \({R}_{{j}_{1}}\ge {R}_{{j}_{2}}\ge {\dots \ge R}_{{j}_{n}}\) given by \(\left\{{R}_{i}^{*}, 1\le j\le m\right\}\), as in Definition 1, we should optimize in \(S=\left\{w\in\Omega , {R}_{{j}_{1}}\left(w\right)\ge {R}_{{j}_{2}}\left(w\right)\ge {\dots \ge R}_{{j}_{n}}\left(w\right)\right\}\), that is

$$\mathrm{Min}\left\{d\left(R(w),{R}^{*}\right), w\in \mathrm{S}\right\}.$$
(18)

Solving (18) means finding a vector of weights that reconciles two uses of TOPSIS: a rating and a ranking generator. Nevertheless, as we will see in what follows, feasibility in (18) is not guaranteed (19) and this is one of the questions addressed in this work.

In what follows, we will summarize the previous reasoning as a subrutine that we will use in the overall scheme displayed in Fig. 3.

Fig. 3
figure 3

Source: own elaboration

Flowchart with the different options to obtain a ranking of the alternatives and to know the relative weights of the criteria.

4.1 Subroutine ApW

Using previously notation, we can express the previous reasoning in algorithmic form.

Inputs. Let us consider a vector \({R}^{*}=({R}_{1}^{*}, {R}_{2}^{*}, \dots , {R}_{n}^{*})\) with \({R}_{i}^{*}\in \left[{R}_{i}^{\mathit{min}}, {R}_{i}^{max} \right], 1\le i\le n,\) and an associated ranking of alternatives\({A}_{{j}_{1}}\geq {A}_{{j}_{2}}\geq {\dots \geq A}_{{j}_{n}}\).

Step 1. We construct the set \(S=\left\{w\in\Omega , {R}_{{j}_{1}}\left(w\right)\ge {R}_{{j}_{2}}\left(w\right)\ge {\dots \ge R}_{{j}_{n}}\left(w\right)\right\}\)

Step 2. We calculate

$${EMC}_{S}=\mathrm{Min}\left\{\frac{1}{n}\sum_{i=1}^{n}{\left({R}_{i}\left(w\right)-{R}_{i}^{*}\right)}^{2},w\in \mathrm{S}\right\}.$$
(19)
  • If \({EMC}_{S}\) is feasible, being its solution \({w}^{*}=\left({w}_{1}^{*}, {w}_{2}^{*}, \dots , {w}_{m}^{*}\right),\) go to Step 4.

  • If EMCS is infeasible, go to Step 3.

Step 3. We calculate

$${EMC}_{\Omega }=\mathrm{Min}\left\{\frac{1}{n}\sum_{i=1}^{n}{\left({R}_{i}\left(w\right)-{R}_{i}^{*}\right)}^{2},w\in\Omega \right\}.$$
(20)

and we obtain the optimum solution \({w}^{*}=\left({w}_{1}^{*}, {w}_{2}^{*}, \dots , {w}_{m}^{*}\right).\)

Step 4. Using vector \({w}^{*}=\left({w}_{1}^{*}, {w}_{2}^{*}, \dots , {w}_{m}^{*}\right)\) obtained in (19) or (20), we apply the classic TOPSIS approach to the original decision matrix \(\left[{x}_{ij}\right], 1\le i\le n, 1\le j\le m\), obtaining the relative proximities \({R}_{1}({w}^{*})\), \({R}_{2}({w}^{*})\), …, \({R}_{n}({w}^{*})\).

Output. We rank the alternatives \(\left\{{A}_{i}, 1\le i\le n\right\}\) considering \({A}_{i}\geq {A}_{j}\) if and only if \({R}_{i}({w}^{*})\geq {R}_{j}({w}^{*})\).

As we can observe in Fig. 3, the decision maker will only need to know, at the beginning of the process, the decision alternatives, the decision criteria and to decide about the normalization and distance functions (this decision will be determined by the type of decisional problem). If the criteria weights for the aggregating process can be elicitated by the decision maker (objectively or subjectively), we will apply the classical TOPSIS approach to rank the alternatives. However, in those situations in which the decision maker does not want or is not able to elicitate the criteria weights, we will apply UW-TOPSIS. UW-TOPSIS will consider the weights as unknowns in the optimization problems that search for the minimum and maximum relative proximity of each alternative to the ideal solution. Therefore, after the optimization process we will have, for each alternative, instead of a scalar representing the relative proximity, an interval. The extremes of the intervals will represent the worst and best situation for each alternatives in terms of the value of Ri and will have an associate vector of weights, expressing, for each alternative, the contribution of each criterion in its possition in the ranking. If the decision maker wants to rank the alternatives based on the obtained global scores or rates, Ri, he/she only needs to choose any of the multiple available methods to rank intervals on the real line. As commented before, in this work, we have chosen to work with the convex linear combination of the extremes of the intervals, letting the decision maker decide his/her risk aversion regarding the ranking scenario: if he/she feels pessimistic, he/she will choose the lower extreme of the interval and rank the alternatives accordingly. If he/she feels totally optimistic he/she will choose the upper extreme of the interval. For any other intermediate situation, he/she will choose an intermediate value for the coefficient in the linear combination, obtaining the associate ranking.

At this point of the process, is where the procedure propossed in this work starts. If the ranking is weighted-generated then we will have the decisional weights. If this is not the case, then we will need to find a vector of weights that better aproximate the value of Ri. Once these approximate decisional weights have been obtained they are included in a classical TOPSIS as inputs and the corresponding ranking is obtained.

To show the usefulness of the proposed solution, let us go back to the previous example (Example 1). If in this example, we apply the Subroutine ApW, we obtain the vector of decisional weights

$$ w^{*} = \left( {0.1046952, 0.1943971, 0.1768560, 0.2532284, 0.2708232} \right) $$

The results applying the classical TOPSIS with this vector of weights \({w}^{*}\) are shown in Table 6.

Table 6 Relative proximity values using classical TOPSIS with decisional weights \({w}^{*}\).

Figure 4 displays the relative proximity values \({R}_{i}^{*}\) (Table 2) and \({R}_{i}({w}^{*})\) (Table 6) facilitating comparison and the goodness of fit.

Fig. 4
figure 4

Source: own elaboration

Comparison of relative proximity values.

In the next section we illustrate the proposed model with a real case study extending Example 1.

Remark 2.

If we repeat the reasoning with a degree of optimism α = 0.6, that is \({R}_{i}^{*}=0.4{R}_{i}^{\mathit{min}}+0.6{ R}_{i}^{max}, 1\le i\le 10,\) the obtained ranking would be

$$\begin{array}{c}{R}_{1}^{*}=0.887\geq {R}_{3}^{*}=0.745\geq {R}_{2}^{*}=0.734\geq {R}_{4}^{*}=0.709\geq {R}_{5}^{*}=0.651\geq \\ {\geq R}_{9}^{*}=0.611\geq {R}_{6}^{*}=0.542\geq {R}_{7}^{*}=0.531\geq {R}_{8}^{*}=0.491\geq {R}_{10}^{*}=0.334\end{array}$$
(21)

The vector of weights obtained with our method is

$$ w^{*} = \left( {0.22923, 0.2184641, 0.1745323, 0.2011543, 0.1766192} \right) $$

with error EMC = 0.0011255381 (see (19)). Vector \({w}^{*}\) does not generates the ranking associated to (21), as it permutes firms F8 and F7, that is,

$${F}_{1}\geq {F}_{3}\geq {F}_{2}\geq {F}_{4}\geq {F}_{5}\geq {F}_{9}\geq {F}_{6}\geq {F}_{8}\geq {F}_{7}\geq {F}_{10}.$$

Nevertheless, with vector

$$ w^{*} = \left( {0.1988455, 0.2320587, 0.1923732, 0.1978837, 0.1788389} \right) $$

we can obtain the ranking associated to (21) with an error EMC = 0.001248, slightly higher than the one obtained with our method.

5 Real case study: ranking firms based on their environmental, social and governance performance

Refinitiv measures a company's relative Environmental, Social and Governance (ESG) performance, commitment and effectiveness across 10 main themes, based on publicly available and auditable data (see Fig. 5). To illustrate the proposed methodology with a real case, let us rank the firms included by Refinitiv in the Top 100 EMEA companies in terms of their Environmental, Social and Governance (ESG) Performance using data from august 2022.

Fig. 5
figure 5

Source: own elaboration based on Refinitiv

Refinitiv dimensions and main themes.

The firms relative ESG performance is expressed by means of ESG scores ranging from 0 to 100, reflecting the commitment and effectiveness of the firms in each dimension, based on company-reported data. Refinitive also provides an overall ESG combined (ESGC) score, which takes into account significant ESG controversies affecting the firms. As the importance of ESG factors is different across industries and countries, Refinitiv takes into account the sector of the firms in the case of the environmental and social dimensions and the country in the case of the governance dimension.

In this work we will use the Environmental, Social and Governance scores. In addition, we will consider two financial criteria: return and risk. For the financial return we will use the year to data (YTD) return which is defined as the amount of profit (or loss) realized by an investment since the first trading day of the current calendar year. To calculate YTD, we subtract the starting year value from the current value and we divide the result by the starting-year value. Then we multiply by 100 to convert to a percentage. Volatility will be measure using the standard deviation. All the data, ESG scores and financial data were provided by Refinitiv on august 2022. Table 7 displays the main statistics of the dataset.

Table 7 Descriptive statistics

We have considered 5 different scenarios for the weights. In all scenarios the upper bound is set at 0.5. With this we force the solution to take into account all the decision criteria. For the lower bounds we have consider different situations ranging from 0 to 0.1 (see Table 8). The obtained decisional weights are displayed in Table 8. Results are displayed by sectors. We have used the Global Industry Classification Standard (GICS) which is a common global classification standard used by Refinitiv. Last column shows the root-mean-squared errors (RMSE). As mentioned in the previous section, the full reproducible code, developed by the authors, is available in the Appendix.

Table 8 Results in different scenarios.

Let us observe scenario 2. In this scenario the weights have, in the optimization problem, a minimum bound equal to 0.025 and a maximum bound equal to 0.5. Figure 6 displays the optimum decisional criteria weights by sectors. As we can observe, the financial return has the same relative importance regardless the sector.

Fig. 6
figure 6

Source: own elaboration based on Refinitiv

Decisional weights by sectors in scenario 2.

There is a small discrepancy among sectors in the relative importance of the social criterion and the governance criterion. However, we can observe some important differences in the relative importance of the environmental decision criterion. The sector with the hightest environment relative importance is Communication Services with more than 40% of relative importance. On the other hand, for the Information sector, the relative importance of the environment criteria is less than 15%. The remaining sectors give this decision criteria a similar importance.

It is interesting to observe, how the Information sector is the one giving more importance to financial volatility and Communication Services sector is the sector for which financial volatility has less importance.

The obtained results provide information to the decision maker regarding those criteria which are not influencing the ranking of the firms given their sectors. In the previously commented example, the financial return of the firms is not determining their position in the ranking. The same is applicable to the social criterion. The focus in this case, would be on the environmental and governance criteria and on the financial volatility.

6 Conclusions

Investors, consumers and goverments are increasingly demanding reliable ESG rating and ranking of firms. These ratings can identified the weaknesses and strenghs in terms of ESG performance of the companies, therefore representing an opportunity to improve. Several rating agencies provide all around the world ESG global ratings. However, the methodology behind the rating and ranking processes followed by these agencies has been widely critizied. Among its main weaknesses we can find problems related to the reliability of the ESG data provided by the companies but also related to the decision criteria selection and to the determination of their relative importance in the aggregation process giving rise to the global ESG rating.

TOPSIS is a well-known and widely applied MCDM distance-based method which has a double possible use: the rating and ranking of a set of decision alternatives based on different decision criteria. Classical TOPSIS approaches usually require two main inputs: the individual scores of the alternatives with respect to the criteria and criteria weights. If the decision maker is able to elicitate, objectively or subjective, the criteria weights then, the classical TOPSIS approaches are useful. Otherwise, for those situations in which this is not possible we propose the use of UW-TOPSIS.

In UW-TOPSIS the decision criteria weights are considered variables in two optimization problems which determine and interval with the minimum and maximum relative proximity to the positive ideal solution. In this method, two matrices of weights are obtained associated to the extremes of the relative proximity intervals. However, if the decision maker looks for a unique vector of weights with the relative importance of the criteria (decisional weights), as in the classic TOPSIS, the UW-TOPSIS method is not always able to direcly provide it. In this paper, we discuss this situation and its managerial consequences proposing an approximate method that solves the problem of the determination of those decision weights.

The proposed approach has been applied to the ranking of a set of firms based on ESG and financial criteria. Using different lower and upper bounds we have generated several scenarios which have provided us with decisional vectors of weights expressing the relative importance of the decision criteria globally, that is, taking into account all the alternatives. Following the spirit of Refinitiv, we have calculated the vectors of weights taking into account the sector classification of the firms. The vectors with weights can be easily obtained taking also into account the country of the firms. The obtained results inform us about the real relevance of each decision criteria.

The full reproducible code, developed by the authors, available for all the readers, is displayed in the Appendix.