A novel entropy-based weighted attribute selection in enhanced multicriteria decision-making using fuzzy TOPSIS model for hesitant fuzzy rough environment

The existing approaches of multicriteria decision-making (MCDM) process might yield unreliable and questionable results. The notable challenges of MCDM approaches are rank reversal paradox and uncertainty. The prime inspiration for researchers is the MCDM for hesitant fuzzy sets (HFSs). In some scenarios, the decision-makers could not choose one from numerous values while expressing their preferences. HFS which is the extension of fuzzy sets (FS) is found to be helpful in solving such decision-making (DM) problems. The DM process is revolutionized with the commencement of powerful and efficient tools of data representation for expressing vagueness and uncertainty in data sets as FSs (both generalized and hesitant ones). This paper copes with one such novel approach that involves entropy-based attribute weighting, followed by an evaluation of approximate sets in the fuzzy rough framework. Correlation of the input alternatives in respect of evaluation criteria and the output class is evaluated. With the fuzzy technique for ordered preference by similarity to ideal solutions (FTOPSIS), the generated correlation matrix is utilized for calculating the degree of closeness (δ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \delta $$\end{document}) of the output classes to the input alternatives. This paper made a novel contribution of performance indicator centered on FTOPSIS for the hesitant fuzzy rough domain. The proposed method’s efficiency is established through comprehensive and systematic experimentation on datasets utilized by researchers globally. The proposed algorithms prove its ability to handle datasets that involve human-like hesitant thinking in the MCDM system by contrasting with the existing ones.


Introduction
For decades, MCDM has remained as an inexorable topic of research. Optimum selection of alternatives considerably affects the DM of picking a suitable one from a provided set of conflicting criteria. The uncertainty and vagueness involved with the human DM process could be effectually modeled by FS theory. MCDM embraces attributes, decision methods, selection criteria, and even subjective estimation of experts [1]. Improvisation in classical FSs [2] was done for handling the uncertainties and vagueness. Extended versions of FSs embrace fuzzy rough sets (FRS) [3] which could handle the indiscernible datasets effectually in a fuzzy framework. Researchers made countless attempts for incorporating reallife complex scenarios that involve uncertainty into the datasets and solve it utilizing FTOPSIS [4][5][6][7]. It has now been meticulously adopted in several use cases on account of its simplicity, comprehensive mathematical concept along with computational efficiency. The extension of the classical TOP-SIS approach in regard of fuzzy logic, namely FTOPSIS, has also been effectively implemented in disparate applications like Networks, Supply Chain Management, Defense Industry, Construction, Healthcare, etc., FTOPSIS was employed in countless practical use cases, starting from choosing a suitable supplier for manufacturing through assessment of service 1 3 quality and ending at selection and ranking of the renewable energy (RE) sources, confirming that is widely implemented in innumerable practical issues. Additionally, the energy policies' selection and ranking of the RE sources are the eminent challenges tackled by FTOPSIS. Hence, there TOPSIS studies are becoming popular regarding the problems, which consider sustainable development, environment, and RE sources. Decision-makers present variable opinions for the alternatives, which brings uncertainty. HFS has an imperative role in modeling such uncertainty. This difference in opinions could be due to inadequate information or their different backgrounds. Researchers have widely explored HFSs in respect of the aggregation operators (AOs), various information measures as well as their application on DM [8]. Expert assessment of the attributes is done utilizing probable membership values that the attribute could possibly take. HFSs were proffered by [9] and were intensively utilized by researchers in respect of AOs for DM [10][11][12]. An outline of trends and tools associated with HFSs was studied by [13]. A fusion of the Rough Sets (RS) model and HFSs was explored by [14] by rendering an axiomatic and constructive mathematical framework. Probabilistic and Pawlak's models were propounded by [15]. Enhanced concept associated to approximate precision and roughness for hesitant fuzzy compatible rough space was examined by [16]. Dual HFSs and associated AOs were studied by [17]. Attribute reduction was intensively examined by [18]. The utilization of decision-theoretic RSs for the purpose of resolving DM problems in HFSs was carried out by [19]. However, it might be difficult or expensive to develop criteria set, wherein all criteria are independent in certain situations. In some real-life scenarios, on account of the higher uncertainty of the situation and the restricted cognition of human thinking, it is hard for decision-makers to make a choice in selecting merely one alternative as of a candidate alternative set or evaluation arguments set to show their preference. They might highly hesitate amongst several alternatives or evaluation arguments. In these similar scenarios, it is reasonable to formulate a new DM rule or build a tool that permits decisionmakers to express their judgments or preferences on several objects with individual degrees of hesitation. Consequently, it is requisite to comprehensively study the HFSs with interactive criteria and construct an MCDM approach by considering the interaction amongst criteria. This paper has brought about a pioneering work in the FRSs field as it bridges the gap from RSs to HFSs for attribute reduction. It can elevate the DM efficiency and lessen the decision pressure, because, here, the decision-makers are permitted to express their preference in form of entropy centered weighted attribute selection.
The forthcoming section handles preliminaries of hesitant FRSs, as well as RSs, and is followed by methodology and experimentation. A detailed explanation of proposed work and its implementation on two disparate cases of hesitant fuzzy data sets are done in subsequent sections.

Preliminaries
Here, the basic RS and FRS concepts are expounded in detail.
Definition 1 [20] Consider information system 'I', universes of discourse 'X', non-empty finite set 'A', and attribute value ' Y a ' where I = (X, A). for every a ∶ X → Y a for every a ∈ A . And, 'A' that is a decision system could be defined as A = (C ∪ D) , where C and D are a set of conditional and decision attributes, respectively. The core notion in RS theory exists in finding the lower approximations (LA) as well as upper approximations (UA) centered on IND (P)equivalence relation, where Definition 2 [20] If (x, y) ∈ IND(P) , then (x, y) is indiscernible by 'P' attribute. Consider an equivalence class generated as of IND (P) as [x] P . Here, the LA is P − X and UA is P X , and both are evaluated as The tuples P − X and P X are termed an RS: Definition 3 [20,21] The considered positive region comprises all objects which could be positively classified to the classes of U/Q. The determination of dependence between the attributes is proffered by Eq. (3).
By determining the change in the dependence, while a feature is added or removed, significance of the feature is evaluated by [20,22].
The issue of crisp LA and UA adversely influences the classification accuracy and is effectually handled by FRS explained in [3,23,24].

Definition 4
The definitions of Membership functions for fuzzy LA and fuzzy UA are proffered as Eq. (4) where F i -fuzzy equivalence classes belonging to U/P.
A fuzzy positive area is then evaluated using extension principle as: (2) (4) Likewise, a new Fuzzy dependence function could be evaluated as: RS theory as introduced by Pawlak regards the information subspaces in the sort of LA, UA, and boundary region, whereas the FRSs approximate the same subspaces as overlapping regions having certain membership values [25]. The FRSs' concept was extended by Zhang et al. [26] and Chen et al. [27] for the cases embracing DM uncertainty. Hesitant FRSs have been utilized effectually in the literary works for handling hesitant DM.

Hesitant fuzzy sets: basic concepts
Definition 5 Consider X as a reference set and, here, the HFSs A on the X set defined in respect of function h A (x) . While it is employed to X, it returns a sub set A as where h A (x) could be called hesitant fuzzy elements (HFE) [10,28] and it indicates the set of possible membership degrees of x ∈ X element to A. Definition 6 For a given HFE (h), the lower bound as well as upper bound as per [29] are,

Definition 7
The score function of the HFSs s(h A (x)) as per [29] is: However, the normalized score function could be proffered as: h − (x) = minh(x) h + (x) = maxh(x).
Definition 8 If X, Y are the '2' non-empty finite universes and as well R signifies "X to Y" hesitant fuzzy relationship, then (X, Y, R) is called Hesitant fuzzy rough approximations (HFRA) space. For any P ∈ HF(Y) , the LA and UA are indicated by R − (P) and R (P) respectively [26], where where Definition 9 As X stands as a finite universe of discourse, Torra et al. [9] offered the succeeding operations on hesitant FRSs. For any P, Q ∈ HF(X) , then for all x ∈ X:

The union of HFSs
The proposed correlation grounded on entropy-centric ordered weighted approach for HFRS is proffered as: where m (A, B) satisfies the below properties By utilizing Cauchy's Schwarz inequality, the above equation becomes: Therefore: When A = B, then: Definition 11 [30] Information entropy H(X) of knowledge X proffers the uncertainty measure about knowledge X and is evaluated as

Methodology
Here, a detailed and systematic description on the proposed mathematical design for DM in HFR framework is proffered. The novelty exists in rendering weighted entropy centered optimum attribute selection method for assessing correlation of the input alternatives with the output class in HFR domain. Entropy weight approach gauges value dispersion in DM and is the common weighting methodology. If the degree of dispersion is greater, then its degree of differentiations will be greater, and can derive more information. Moreover, the maximal weight must be provided to the index and vice versa. This entropy weighting approach always gives reliable and effective results. As per [9], the DM uncertainty could be best expounded with the employment of HFSs. The relevant attributes could be specified for further processing utilizing entropy centered evaluation of weights for the attributes. MCDM in HFS was extensively studied by [10,16]. Nevertheless, the performance indicators employed by Zhang et al. [29] render ambiguous outcomes on the dataset utilized in this work. Hence, these performance indicators are re-framed in the proposed model. As the fifth parameter, the FTOPSIS centered performance indicator is utilized to assess the alternatives appropriately. A detailed clarification of the approach is proffered below: Also consider R(x i , y j ) as the relational matrix which shows the fuzzy relation from X → Y where input is x i (x i ∈ X) and output is y j (y ∈ Y) 2. This step finds S n which is the normalized score matrix (NSM), where S indicates a score matrix as per Definition 3 3. As provided in Definition 8, the entropy-based determination of weights of attributes is where s ij signifies the NSM. Attribute weights are given is 4. Calculation of correlation coefficient for every alternative A i and the output y j is given in step 7. 5. Calculation of LA and UA spaces in respect of (X, Y, R) is symbolized as R − (P) and R (P) which are the '2' approximate hesitant FRS. 6. Computation of the performance indices ( PI i ) [29] is detailed below: The applied decision rules are: 1. If PI 1 ∩ PI 2 ∩ PI 3 ∩ PI 4 ≠ � , and then, the optimal output will be The renders a rational solution to the problem of ascertaining optimum attributes for a specific dataset. Find for every alternative as: This work proposes as the PI 5 , an additional performance indicator in fuzzy rough approach. The decision rules are also enhanced accordingly to have rules which assist in choosing input samples having maximal correlation with the class and regarding the output parameters. The rules are re-framed as: 1. If PI 1 ∩ PI 2 ∩ PI 3 ∩ PI 4 ∩ PI 5 ≠ � , then optimal output will be y k where k = PI 1 ∩ PI 2 ∩ PI 3 ∩ PI 4 ∩ PI 5 . 2. If PI 1 ∩ PI 2 ∩ PI 3 ∩ PI 4 ∩ PI 5 = � , then optimal output would be y k where k = (PI 1 ∩ PI 2 ∩ PI 3 ) ∪ (PI 4 ∪ PI 5 ). 3. If PI 1 ∩ PI 2 ∩ PI 3 = � , then optimal output will be y k with k = (PI 1 ∩ PI 2 ) ∪ (PI 4 ∪ PI 5 ). 4. If PI 1 ∩ PI 2 = � , then optimal output would be y k ; here, k = (PI 4 ∪ PI 5 ). 5. If (PI 4 ∪ PI 5 ) = � , then optimal output will be PI 5 .

Experimentation and implementation
Experimentation is made on two datasets. A medical diagnosis dataset which is utilized by [6,31,32] is proffered as Table 1.
Medical diagnosis dataset has patients A = {A 1 , A 2 , A 3 , A 4 } who show the symptoms are evinced as x = {x 1 , x 2 , x 3 , x 4 , x 5 } where x 1 indicates "temperature",x 2 stands for "headache",x 3 stands for "stomach pain",x 4 stands for "cough", and x 5 stands for "chest pain". The probable diseases are evinced as Y = {y 1 , y 2 , y 3 , y 4 } where y 1 stands for "Viral fever",y 2 stands for "Malaria",y 3 stands for "Typhoid", and y 4 stands for "Chest problem". Table 2 indicates the values that are possible as per the expert information. Grounded on the steps described in methodology, correlation matrix is proffered as Fig. 1 and is calculated. It is followed by the evaluation of LA and UA sets. The proposed performance indicator ( PI 5 ) is evaluated utilizing the FTOPSIS technique as elucidated in Step 7. Finally, the rules stated in the proposed work are applied for diagnosis of the disease. Table 3 evinces the calculations for ideal positive and negative solutions, and . Performance indicator PI 5 provides between the input samples and the outputs. Hence, for the medical diagnosis problem,y 1 exhibits greater to the input samples, i.e., patients. This result is completely consistent with the outcomes acquired utilizing the performance indicators proposed by [16] However, the below example clearly emphasizes the necessity of the proposed performance indicator i.e.PI 5 as PI 1 to PI 4 performance indicators produced ambiguous results. Consider the following HFS in X = {x 1 , x 2 , x 3 , x 4 , x 5 } which indicates the decision given by the risk evaluation committee. Let A = {A 1 , A 2 … A 10 } be the ten firms to be evaluated on the basis of criteria { x 1 :managers' work experience, x 2 :profitability, x 3 :operating capacity, x 4 : ability of paying debt, and x 5 : market competition}. The outcome is also provided as imprecise membership values as evaluated by the risk evaluation committee in the form of FS which is a special form of hesitant set [1]. The corresponding HFDM is proffered as Table 4. Y = {y 1 , y 2 , y 3 } where y 1 : corporate stability index, y 2 : survival index and y 3 :long-term economical growth. Let the correlation between the criteria x i and Y is provided by the risk evaluation committee as indicated in Tables 4 and 5.
The algorithm commences with the evaluation of score matrix for Table 6 as expounded in Definition 7. The evaluated score matrix is proffered in Table 6.
The NSM given in Table 7 facilitates the evaluation of entropy and weights (as in Definition 8) to have optimal attributes. The NSM given in Table 8 facilitates the evaluation of entropy and weights (as in Definition 8) for the computation of optimal attributes: The weight vector w j symbolizes the significance of the attributes. Therefore, further steps involve the computation of the weighted decision matrix proffered as Table 9 which    is attained by multiplying the elements of Table 8 with their respective column weights given by w j . Table 9 is same as Table 1, but the only difference is that the length of all sequences is made the same by extending the higher membership value for a specific sequence as stated by [1]. This updation in the HFDM is needed for the evaluation of the correlation matrix. Table 9 is then utilized to evaluate the correlation coefficient HFRS (A i , y i ) utilizing Definition 10. This further enables the calculation of m (A i , y i ) as evinced in Fig. 2 Figure 2 signifies the correlation of the ten firms A i which were to be evaluated centered on the criteria { x 1 :managers' work experience, x 2 :profitability, x 3 :operating capacity, x 4 : ability of paying debt, and x 5 :market competition with Y = {y 1 , y 2 , y 3 } , where { y 1 : corporate stability index, y 2 : survival index as well as y 3 : long-term economical growth. The output y 1 has a maximal degree of correlation (0.213) to the input samples A 8 , while the output y 2 has 0.205 (higher) to the input samples A 5 , and the output y 3 has 0.219 (higher) to the input samples A 1 and A 7 . As given in the methodology, the upper HFRA and lower HFRA are evaluated utilizing Definition 8. The outcomes are proffered as Tables 10 and 11 For calculating the performance indices, the equivalent score matrices of LA and UA hesitant FRSs are needed. These sets are evaluated and even tabulated as Table 12 and 13.
Calculation for PI 5 grounded on FTOPSIS approach is then carried out. The correlation matrix which is the input matrix for FTOPSIS is evaluated. Figure 2 details those correlation matrices between input samples and output samples. 0.08 0.14 0.11 0.05 0.14 A 10 0.14 0.09 0.11 0.21 0.12 Table 8 HFDM with repetition in required membership values  Table 9 Weighted hesitant fuzzy decision matrix The weights for y i for the further calculations are presumed to be 1 as all the outputs y i are equally significant. Figure 3 indicates the ideal positive solution and ideal negative solution as expounded in step 7. This follows the ED calculation which is evinced in Tables 14 and 15. Finally, Fig. 4 indicates calculation. Figure 4 evinces of the 'Y' output in respect of the input samples. The output y 1 has higher (0.58) to the input samples A i which means that the ten firms could provide a better corporate stability as contrasted to longterm economical growth and survival index. That means the output of the survival index ( y 2 ) as well as long-term economical growth ( y 3 ) gives of 0.44 and 0.51 to the   Table 16   Table 16 implies that the entire alternatives cannot be estimated utilizing an algorithm that is recommended in [16]. Column 6 has a letter I written for alternatives A 3 , A 4 , A 9 and A 10 .Those alternatives have PI 2 carrying two values. Zhang et al.'s algorithm [29] does not render a solution for these cases. Nevertheless, the proposed algorithm incorporated an additional performance indicator PI 5 that is centered on FTOPSIS, and this is capable of having a solution to the aforesaid ambiguity. The betwixt the input alternatives and output aids the fuzzy rough centered MCDM in making a suitable decision. Therefore, the proposed work exclusively renders correlations betwixt the input alternatives and the output in line with the evaluation criterion. It is concluded as of the aforesaid outcomes that the proposed DM can be properly employed to resolve the manifold and DM issues with completely unidentified attribute weights. The proposed work renders a helpful means for managing multicriteria fuzzy DM issues within attribute weights. An appropriate entropy weighting methodology derives the attribute weights as per alternative, and it picks the best alternative as per them.

Conclusion
The proposed work methodically modeled the MCDM for hesitant FRSs. An additional performance parameter "FTOPSIS centered " is also proposed here to resolve ambiguous cases effectively. And, this is confirmed via implementations on multiple datasets. Correlation matrix which shows the correlation of input alternatives with the output class grounded on a certain set of criteria eventually assists in computing the proposed FTOPSIS centered performance index. Entropy-centric weighing of the attribute aids in selecting the relevant as well as non-redundant attributes. Grounded on the volume of information, this entropy approach finds the index's weight for the attributes, which is the objective fixed weight methodology. The   . 4 Degree of closeness between input and output samples disorder degrees of the attributes and their utility in the system information are ascertained by Entropy. Finally, the evaluations of upper HFRA and lower approximate HFRA further facilitate the selection of optimum attributes. Thus, a generic approach which is hybrid entropy-centric optimal attribute selector, i.e., RSs and HFSs, shall effectually assist the researchers in vagueness and uncertain DM problems without an ambiguity. Utilizing this proposed entropy weight centric approach, the weights of the attributes are found and the appropriate attributes are selected which eradicates the disturbances (caused by man) and makes outcomes as per facts. The entropy weight together with FTOPSIS method is clear, simple, and reasonable when contrasted to fuzzy synthetic assessment and other evaluation approaches. Nevertheless, the entropy weighting approach merely regards the numerical discrimination degrees of the attribute index and disregards rank discrimination. These shortcomings signify that the entropy approach could not exactly reflect the significance of the index weight, thus causing distorted DM results. This problem can well be tackled in future. In addition, knowledge reduction is the notable content for the research of RS theory. Therefore, in the future, the proposed algorithm can be extended grounded on intervalvalued FRSs and type 2 FSs for knowledge reduction under complete information systems.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/. A 1 y 1 y 1 y 1 y 3 y 1 y 1 y 1 A 2 y 1 y 2 y 1 y 3 y 1 y 1 y 1 A 3 y 1 y 1 and y 2 y 1 y 3 I y 1 y 1 A 4 y 1 y 1 and y 2 y 1 y 3 I y 1 y 1 A 5 y 1 y 2 y 1 y 3 y 1 y 1 y 1 A 6 y 1 y 1 y 1 y 3 y 1 y 1 y 1 A 7 y 1 y 1 y 1 y 3 y 1 y 1 y 1 A 8 y 1 y 2 y 1 y 3 y 1 y 1 y 1 A 9 y 1 y 1 and y 2 y 1 y 1 I y 1 y 1 A 10 y 1 and y 2 y 1 y 1 and y 2 y 3 I y 1 y 1