Introduction

The convergence of massive advancement in embedded computing, wireless communication and diverse sensor technology has fostered the emergence of WSN very swiftly. A WSN consists of enormous tiny devices called sensors to monitor the required field. A simple pictorial representation of WSN is shown in Fig. 1. There are numerous WSN applications, e.g. industrial monitoring, structural monitoring, climatic monitoring, defence, environmental monitoring, and health care [1, 2]. With the miniature size of Sensor Node (SN), there is a restriction of limited energy, storage, communication and computation. WSN has constraints in energy, computation and communication [3, 4]. Battery-operated SN depletes energy because of long-distance transmission to BS and redundant data processing. A single node failure may throw the network into an unreliable state. Thus, reducing energy consumption is a challenging issue that has attracted many researchers. Routing techniques which are capable of reducing energy consumption are highly desirable. Cluster-based routing has proved to be a promising approach [5,6,7]. Election of CH and formation of clusters are crucial parts in dragging out the lifespan of the network.

Fig. 1
figure 1

Representation of WSN

The incorporation of FL helps in efficiently handling the decision-making behaviour of human solving uncertainty. Since there are several overlapping parameters that affect energy consumption, thus, this uncertainty can be driven by FL. Furthermore, FL possesses the potential to deal with imprecision in data and conflicting situations using heuristic human reasoning without requiring a complex mathematical model [8]. Regardless of the evidential advantages of FL by its widespread successful deployment in diverse domains, there is a comparatively limited number of fuzzy-based routing algorithms than fuzzy-based clustering algorithm. Since most of the cluster-based routing protocols require only a simple decision-making process (i.e. single-hop transmission of data from CH to BS), and hence the use of FL is unnecessary. However, for energy-aware clustering and routing demand comprehensive decision-making, FL represents an effective approach [9].

A fuzzy-based system has four primary modules: Fuzzifier maps the crisp input value to fuzzy linguistic value along with the assignment of membership function (MF). Knowledge Base has a set of IF–THEN rules or conditions made by the user, which is considered by Inference Engine while making decisions and inferring or drawing conclusions. A fuzzy set is acquired by the Defuzzifier mapping it into a crisp output value.

Contribution of paper

This paper propounds E-FUCA, which is a distributed protocol for unequal clustering approach for protracting the stability span by balancing the load. The contributions made in this paper are as follows.

  • Maximal clustering protocols are probabilistic and elect CH based on larger residual energy, its aloofness from BS and density of node, which is insufficient for electing the suitable candidate for CH.

  • E-FUCA is an enhancement over the FUCA [10] protocol.

  • FUCA contemplates the remnant power, nearness to BS and node density for calculating rank and competition radius during CH election whereas, in E-FUCA, instead of node density, the average distance to communicating nodes is considered because node density is incapable of giving a complete insight of energy expenditure by CH. Still, average distance can provide a clear idea of communication cost to be carried by CH if selected.

  • In FUCA, during the formation of clusters, non-CH nodes make the greedy decision of choosing the nearest CH without any consideration of CH’s existing load. In contrast, in E-FUCA, non-CH nodes do not make a greedy decision but choose their CH intelligently based on its rank obtained during CH selection, closeness to that CH and number of nodes in its cluster radius obtained during CH selection.

  • FUCA considers BS near/corner of the target field, whereas E-FUCA considers BS’s location at the centre and at far off place.

  • For unequal clustering, we consider FL in the routing of information from CH to BS.

  • We have designed FIS for the selection of next-hop so that energy efficiency can further be enhanced.

  • For gauging the performance of E-FUCA, simulation experiments are performed and obtained results are contrasted with the state of the art approaches such as FUCA [10], LEACH [11] and URBD [12] protocol. Experimental results validate the prolonged stability period, larger average energy with load balancing.

  • The complexity of the proposed E-FUCA in terms of time and message is discussed and computed.

The rest of this paper is summarised as follows: discussion on pertinent work is done in “Pertinent work”. System and Energy model description is provided in “Wireless sensor network model”; the description of the proposed E-FUCA protocol is done in “The proposed approach: E-FUCA”. Simulation experiment and evaluation of performance is shown in “Simulation experiment and result analysis”. Lastly, a summary of the proposed E-FUCA protocol with concluding remarks is discussed in “Conclusion”.

Pertinent work

Energy efficiency is the demanding task of WSN, which can be provided by the clustering approaches. Some of the pertinent unequal clustering approaches are discussed in this section. The literature survey of any cluster-based proposed work is incomplete without the discussion of the LEACH [11] protocol. In the year 2000, Heinzelman et al. propound LEACH, which makes local decisions by adopting a probabilistic method for the selection of CH. For balancing the network load, CHs are rotated in each round as static CH prematurely expires in comparison to non-CH nodes in the network. The data aggregation is done at the CH level to minimise communication cost. Limitations of this protocol are that the CH selection is purely randomised, and crucial factors such as residual energy and aloofness from BS, which affect energy, are not put into consideration.

PRODUCE [13] protocol is proposed for eliminating the hot spot problem makes use of local probabilities for cluster formation of unequal size. CH nearer to BS focuses on inter-cluster communication, whereas CH at a distant place may focus on intra-cluster communication. It successfully balances the network load and prolongs the lifetime. EDUC [14] protocol, which is a distributed algorithm, evades the hot spot problem and energy dissipation in heterogeneous WSN. It involves the energy-driven rotation method of clusters. Every node in this protocol gets an opportunity to be CH in its lifetime. This method is not useful in multi-hop networks. LUCA [15] is based on probability to prevent the hotspot problem. The size of clusters varies with remoteness to BS. GPS is bundled with SN and is location-aware. A backoff timer is there with the randomised initial value. If an SN receives a message form CH, it joins it; else, it will proclaim its candidature. EADUC [16] protocol is designed to gather data periodically in WSN. The weight for CH candidature is based on remnant energy along with the degree of node and exhibits better performance in terms of lifetime. CHEF [17] is a fuzzy-based protocol wherein there are two inputs for FIS: local distance and remnant energy of node. For evaluating the fuzzified inputs and calculating the chance of a node to be chosen as coordinator of the cluster, there are nine fuzzy rules. CHUFL [18] is distributed protocol in which fuzzy inputs are remnant energy, reachability and distance to the BS. The non-CHs choose the nearest CHs to form clusters. A distributive clustering protocol, namely FBECS [19], is proposed, which assigns a pre-defined probability to SN based on distance from BS. It uses FL for the selection of adequate nodes for the role of CH.

An FL-based clustering algorithm is proposed in EAUCF [20], which uses remnant energy and remoteness to BS for electing CH. Nine IF–THEN fuzzy rules are used for selecting tentative CH. Competitive radius is calculated by each tentative CH its candidature. But this proposed work does not anticipate energy exhaustion due to large intra-communication resulting in fading the protocol performance. An improvement over EAUCF is FBUC [21], in which the tentative CHs are selected on a probabilistic method. Competition radius and Chance are computed during the CH election. For cluster formation, the non-CH nodes calculate the chance of each CH on the basis of density and distance to CH. The protocol achieves a better lifetime in comparison to LEACH and EAUCF. The proposed IFUC [22] protocol is capable of reducing energy consumption and lengthening network lifetime. For nominating CHs and computing the range of the cluster, FL is used. The factors considered are remnant energy, closeness to BS and density of nodes. SN with a greater chance is selected as the final CH. DUCF [23] protocol makes load balancing certain by cluster formation using FL. The inputs to the fuzzifier are remoteness to BS, node density and remnant energy. There are two output variables, namely size and chance. The size of the cluster is dependent on the chance obtained. Mamdani method is used for inference.

A distribution independent unequal clustering is propounded in MOFCA [24], which contemplates remnant energy, calculated density and remoteness to BS for clustering. For reducing the intra-cluster relay, the cluster radius is varied as per the remoteness to BS. It successfully addresses the hotspot and energy hole problem. A diverse approach for cluster formation is proposed in MCFL [25]. Those candidates who are most eligible are chosen as CH, and no re-clustering takes place for a few rounds so as to reduce the message exchange for forming clusters. Experimental results exhibit the good performance of the proposed work than its comparatives.

FUCA [10] is a probabilistic approach for unequal clustering. Input variables considered are closeness to BS, remnant energy and density. There are two output variables: rank and competition radius. Higher ranking nodes in the competition radius are elected as CH. It achieves better performance than its counterparts. URBD protocol [12] is another unequal clustering protocol that is based on the density of nodes for clustering. There are two phases in this protocol: CH selection and Member-Join. In this protocol, density and distance parameters are used in collaboration for cluster formation resulting in a longer lifetime.

Tian et al. proposed LEACHEN [26], which introduces a multi-hop clustering cum routing method, which takes into account the fuzzy output as well multipath tree for improving the efficiency of the network. For the routing of information in multipath mode, three input variables are considered, i.e. remaining energy, traffic load and minimum hops. However, in a real-world scenario, other factors should be taken into consideration. AlShawi et al. proposed a routing algorithm [27], which includes FL and an A* algorithm for lifetime enhancement. Remaining energy and Traffic load as the input parameter to the fuzzy system. Leabi and Abdalla [28] proposed a routing protocol using FL and an immune system that contemplates remnant energy and shortest hop for determining a route for communication to sink. This approach improves the efficiency of the network.

Jiang et al. proposed FLEOR [29] for optimised routing. FLEOR considers three factors in the fuzzy-based routing process. The inputs to the inference engine are the degree of closeness to sink (DCS), degree of closeness to the shortest path (DCSP) and degree of energy balance. NORIA [30] is a fuzzy-based routing protocol that considers a fuzzy rule set for parent election and role assignment in routing. The parameters fed to fuzzy systems are the number of hops to the BS and the remaining battery level. A fuzzy-based routing from node to sink is proposed in [28], which contemplates remaining energy and shortest hop as input variables to the fuzzy system for computing edge cost. The simulation results are compared with the Dijkstra routing technique. Haider and Yusuf proposed an energy-optimised approach based on FL [31]. They considered six input variables for the fuzzy system and computed the cost of the same. The simulation results exhibit a reliable and efficient approach, but if the size of the network grows, then the fuzzy system with six input variable will become more complex.

In the aforementioned approaches, a greedy decision is made by the non-CH nodes by choosing the nearest CH candidate for cluster formation, whereas some of these approaches use FL for calculating chance so that non-CH nodes may choose their respective CH based on the chance obtained. Most of these protocols do not consider routing along with clustering, which may limit the performance of the protocol. We have considered FL for all three cases, i.e. CH selection, cluster formation and routing for extending the lifetime of the protocol. Table 1 depicts the summary of the aforementioned approaches.

Table 1 Comparison of pertinent work

Wireless sensor network model

Maximising lifetime problem

Designing the architecture of WSN is a very challenging task as the SNs have limited power, computational capability and memory [3]. Energy consumption is the most significant among the three factors as the power source (battery) is irreplaceable. One of the promising solution to achieve energy efficiency is clustering [11]. In the cluster-based routing, deployed SNs are divided into clusters, and one of the SN plays the role of CH. If the CH is inefficient, then the protocol could not maximise energy efficiency [14]. FL has been used in the selection of efficient CHs [32], which has improved the lifetime of the WSN. Even if we select the efficient CHs, then also in most of the protocols, the data are transmitted directly to BS, which limits the performance of the protocol. For maximising lifetime, energy-efficient clustering and routing algorithm have to be in place.

System model

In the proposed protocol, the network has homogeneous nodes with battery level at par, i.e. all the SNs are having the same energy level when they are deployed. The SNs have dispersed arbitrarily over the target field. Once the network gets operational, BS and SNs are immobile, i.e. neither the BS nor the SNs will change their location. There is a continuous power supply to the BS. The radio in SNs is capable of directional communication to conserve energy. The battery of SN is irreplaceable/non-rechargeable as in typical deployment, and the SNs are left unattended once deployed. Once the SNs are deployed, each SN will broadcast a hello_message. The separation distance between the two SNs is computed by the Received Signal Strength Index (RSSI). With RSSI, the SNs can estimate the location of other nearby SNs. SN is presumed to be lifeless only if the battery supply is fully drained. There is no constraint for BS in terms of processing and storage.

Energy consumption model

In E-FUCA, the radio energy model used in FUCA [10] is adapted. The energy of the WSN may get drained in transmission, amplification, reception, sensing, aggregation.

The energy dissipated for transmitting (ETx) and receiving (ERx) s bits over distance d is given by the following equations:

$$ E_{Tx} (s,d)\, = \left\{ {\begin{array}{*{20}c} {sE_{{{\text{elec}}}} + s\varepsilon_{fs} d^{2} ,} & {d < d_{o} } \\ {sE_{{{\text{elec}}}} + s\varepsilon_{mp} d^{4} ,} & {d \ge d_{o} } \\ \end{array} } \right., $$
(1)
$$ E_{Rx} \,(s) = E_{{Rx - {\text{elec}}}} (s) = s.E_{{{\text{elec}}}} , $$
(2)

where Eelec is the energy dissipated in electronic circuitry, do is a threshold that determines either free space (\(\varepsilon_{fs}\)) or multipath (\(\varepsilon_{mp}\)) model adopted and it can be calculated by the following equation:

$$ d_{0} = \sqrt {\frac{{\varepsilon_{fs} }}{{\varepsilon_{mp} }}} . $$
(3)

In amplification of the signal, energy (Eamp) dissipated is calculated by the following equation:

$$ E_{{{\text{amp}}}} \, = \left\{ \begin{gathered} \varepsilon_{fs} d^{2} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} if{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} d < d_{o} \hfill \\ \varepsilon_{mp} d^{4} ,{\kern 1pt} {\kern 1pt} if{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} d \ge d_{o} \hfill \\ \end{gathered} \right.\,. $$
(4)

For a CH, the amount of energy (ECH) exhausted in a round is computed by the following equation:

$$ E_{CH} \, = ns(E_{elec} + \varepsilon_{fs} d_{CM}^{{}} + E_{DA} ), $$
(5)

where dCM is the distance to cluster members and EDA is the energy exhausted in data aggregation.

For a non-CH node, the energy (EnCH) dissipated is computed by the following equation, in which dCH is the distance from its CH:

$$ E_{nCH} \, = s(E_{{{\text{elec}}}} + \varepsilon_{fs} d_{CH}^{{}} ) $$
(6)

Decision variables

Residual energy is considered because a lower energy node is not suitable for CH candidature as it is a resource-intensive task. Residual energy can be calculated by

$$ R\_ E\left( {{\text{Node}}\left( i \right)} \right) = \overline{E} - \overline{e}, $$
(7)

where Ē is the initial energy level during deployment, and ē is the energy dissipated till now.

Closeness to BS is vital for consideration as CH candidates need to forward the accumulated data. If this distance is too long, then the node will dissipate more energy. The competition radius is inversely proportional to this distance as a closer node will have a larger radius as compared to the node at far off place from BS. The closeness to BS can be calculated as

$$ \delta_{BS} \left( {{\text{node}}\left( i \right)} \right) = \sqrt {\left( {{\text{node}}\left( i \right).x - BS.x} \right)^{2} + \left( {{\rm{node}}\left( i \right).y - BS.y} \right)^{2} } . $$
(8)

Average distance is crucial in calculating rank because the intra-cluster communication cost is dependent on the separation distance. The average distance from a node (i) can be computed as

$$ {\text{Avg}}\_{\rm{dis}}\left( {{\rm{node}}\left( i \right)} \right) = \frac{1}{m + 1}\left\{ {\sum\limits_{k = 1}^{m} {d_{k} } + \delta_{BS} } \right\}, $$
(9)

where dk is the distance to communicating nodes, δBS is the remoteness to BS.

Rank determines the candidature weight of an SN to become CH. The higher the rank of SN, the higher will be the probability of the SN to be selected as CH. The rank of each SN can be computed using FL, as shown in Fig. 2.

Fig. 2
figure 2

FIS designed for CH selection in E-FUCA

The closeness to CH is considered because, to reduce the intra-cluster communication cost, cluster members should be closer to CH. It can be calculated by

$$ CN\_CH\left( {{\text{node}}\left( i \right)} \right) = \sqrt {\left( {{\text{node}}\left( i \right).x - CH_{i} .x} \right)^{2} + \left( {{\text{node}}\left( i \right).y - CH_{i} .y} \right)^{2} } , $$
(10)

where node(i).x, node(i).y are x and y coordinates of the node and CHi.x, CHi.y are the coordinates of CH under consideration.

The number of nodes in a cluster radius is useful to determine if the cluster is overcrowded, then it will increase the burden on CH as it has to expend more power in receiving data from a large number of SNs. It can be calculated by

$$ ND\left( {{\text{node}}\left( i \right)} \right) = \mathop \sum \limits_{j = 1}^{n} {\text{node}}\left( j \right) \,{\rm{s}}{\text{.t}}{.}\,d\left( {{\rm{node}}\left( i \right),{\text{ node}}\left( j \right)} \right) < = \, K^{\prime}_{i} , $$
(11)

where d() represents the distance between two nodes, and i is the cluster radius of a node (i).

The distance reduced to BS is considered because we need to ensure that there is a significant reduction of distance after each hop. It can be calculated as

$$ \dot{D}\left( {{\text{node}}\left( i \right)} \right) = \delta_{BS} \left( {{\text{node}}\left( i \right)} \right) - d\left( {{\rm{node}}\left( i \right),{\rm{node}}\left( j \right)} \right), $$
(12)

where \(\delta_{BS}\) is the distance to BS, and d() represents the distance between two nodes.

The proposed approach: E-FUCA

The proposed protocol “E-FUCA” is designed to enhance the stability period to make the network more reliable as well as achieving a load-balanced network. E-FUCA is an improvement over the FUCA [10] protocol. The improvements can be enumerated in the following ways: first, In E-FUCA, for computing the rank of a node for CH candidature, the average distance to communicating nodes is calculated as one of the parameters together with remnant energy and aloofness to BS, unlike FUCA which considers node density, residual energy and aloofness from BS. Merely calculating the node density does not fulfil the requirement as the communication cost cannot be calculated only on the basis of node density. Some SNs may be nearer, and some SNs may be at far off place. Thus, to determine the nearest approximation of communication cost, the average distance may serve the purpose instead of node density. Second, in FUCA, during the cluster formation, non-CH nodes select the closest CH without determining the overall load on the CH candidate. In our protocol, the non-CH node will calculate the CH chance to determine which cluster must be joined. This CH chance is calculated on the basis of three parameters; the rank of CH, closeness to that CH and number of nodes in CH competition radius. This chance will help in minimising the extraneous energy dissipation in intra-cluster communication. Third, in FUCA, there is no focus on the routing of data, but in the proposed E-FUCA protocol, the fuzzy-based routing algorithm is designed to further prolong the network's lifetime. The working of the designed protocol is partitioned into rounds. In each round, there are three stages, selection of CH, Cluster formation and Data dissemination stage.

Selection of CH

In this phase, the selection of the CHs is decided on the basis of their characteristics. At the initiation of a round, a random number is generated by every node for becoming tentative CH. The threshold probability (TProb) is compared with the generated number. If the number is less than TProb, then the node becomes a tentative CH. Once the tentative CHs are determined, these nodes calculate their rank using designed FIS, as shown in Fig. 2.

The calculation of rank and competition radius is done using three input variables: remnant energy, the average distance to communicating nodes and closeness to BS.

In FUCA, node density is considered. Node density cannot determine exactly the energy consumption by the CH node for intra-communication. This can be illustrated in Example 1.

Example 1

Suppose there are two nodes N1 and N2, competing for CH candidature. Their current energy level is 0.3 J, and closeness to BS is 150 m with equal node density as 10 (i.e. there are ten neighbouring nodes). FUCA protocol will generate equal rank for both the nodes N1 and N2 as all the values passed on to the FIS are the same because it does not consider the distance to the neighbouring nodes. In the case of the E-FUCA protocol, for N1 and N2, it will compute the average distance to all the communicating nodes. Thus, the rank generated for both the nodes N1 and N2 will be different, which will give a better perspective for CH candidature.

There are two output variables: rank and competition radius. Rank determines the candidature weight of an SN. The higher the rank of SN, the higher will be the probability of the SN to be selected as CH. Competition radius determines the radio range of a node within which it can communicate. It may vary according to the rank obtained by SN as a low energy node ought not to communicate to a longer radio range as it will lead to quicker energy dissipation in intra-cluster communication. The MF plots for input and output variables are shown in Figs. 3 and 4.

Fig. 3
figure 3

MF for input variables in CH selection

Fig. 4
figure 4

MF for output variables in CH selection

We have used Trapezoidal and Triangular MF for boundary and intermediate variables, respectively, because they provide faster calculation and are simpler to implement. Each MF has to satisfy one condition that its degree of membership should range from 0 to 1. There are other MFs that can also be used like Sigmoid, Bell, Gaussian etc. but proposed E-FUCA depicted better results with Triangular and Trapezoidal MF. The linguistic variables which are used are shown in Table 2. The input variables are fed to the designed fuzzy inference system, and IF–THEN rules are applied to calculate rank and competition radius, which are described in Table 3. Here, the Mamdani Inference method [33] is applied, which is most commonly used [19, 34] because of its simplicity and characteristics.

Table 2 Linguistic variables for input and output in CH selection
Table 3 Fuzzy rules for rank and competition radius

For defuzzification, the centre of area method is used to obtain crisp value from output linguistics variables. After the calculation of rank and competition radius, tentative CH nodes broadcast their candidature within the competition radius, and a higher ranking node is selected as CH. The selection procedure of CH is described in Algorithm 1.

figure a

Cluster formation

After the completion of the CH selection procedure, all the nodes which are not selected for the CH role need to make the decision to join the appropriate cluster. In most of the protocols, the non-CH nodes make a greedy decision of joining the nearest CH without determining the load on that CH. In this proposed protocol, the overall load of the CH candidate is already determined by its rank, which is computed during the CH election on the basis of its closeness to BS, its current energy level and average distance to nearby nodes. The decision of choosing the CH by the non-CH node is supported by the designed FIS as shown in Fig. 5, and the MF functions used for the input variables and output variables are presented in Fig. 6 and Fig. 7.

Fig. 5
figure 5

FIS designed for cluster formation in E-FUCA

Fig. 6
figure 6

MF for input variables in cluster formation

Fig. 7
figure 7

MF for output variables in cluster formation

Non-CH nodes calculate the chance of each CH on the basis of IF–THEN rules applied to the inputs: CH_Rank, number of nodes in the competition radius of CH and distance to that CH. Explanation to support this intelligent decision is described in Example 2.

Example 2

Suppose there are two CH nodes C1 and C2. A non-CH node (N1) needs to choose a CH between C1 and C2. Suppose rank of C1 = 2 and rank of C2 = 98. Distance from N1 to C1 is 14 m, and C2 is 16 m. According to FUCA protocol, N1 will take a greedy decision and directly choose C1 as its CH without considering its low rank, which could be due to low energy, a large number of neighbouring nodes and a large distance to BS. If all the nodes make greedy decisions like this, then it could result in more power dissipation as CH responsibility is a resource-intensive task. In the case of the E-FUCA protocol, node N1 will take this decision intelligently by considering the rank of CH, the number of nodes in the competition radius of CH and closeness to CH before choosing its CH. Finally, it will choose C2 as its CH, although C1 is closer to N1. This will result in reducing the load on low-rank CH nodes and balancing the energy dissipation by the CH nodes, thereby contributing an extension of the lifetime of the network.

The linguistic variables used in input and output variables are depicted in Table 4.

Table 4 Linguistic variables for input and output in cluster formation

The IF–THEN rules applied for determining the chance of CHs are described in Table 5. After the calculation of the chance of each CH node, the non-CH node joins the CH, which is having the highest chance value by transmitting a join request (JOIN_REQ) message. The CH node accepts the request received from all non-CH nodes and forms the cluster. The cluster formation procedure is explained in Algorithm 2.

Table 5 Fuzzy rules for computing chance of CH
figure b

Data dissemination

Once the clustering process gets completed, the data dissemination stage begins. SNs sense the target area and generate the data on a periodic basis. SNs forward the collected data to their respective CH as per the TDMA slot for preventing loss of data in a collision. Once the CHs collect data from all their cluster members, it compresses the data prior to forwarding it to the BS. In most of the protocols, CHs forward data directly to BS, which depletes a large amount of energy of the CHs. For conserving the energy in forwarding the data from CH to BS, CH will make a decision using a designed FIS, which takes three inputs, namely, next-hop rank, nearness to next-hop and distance reduced to BS as shown in Fig. 8 for computing the cost of next-hop. The next-hop can be one of the chosen CHs or BS. The LV for input and output variables are shown in Table 6. The MF for input and output variables is depicted in Figs. 9 and 10. The CH calculates the eligibility of every other next-hop CH nodes, which are in the direction of BS, using the IF–THEN rules designed for mapping inputs to output which are shown in Table 7.

Fig. 8
figure 8

FIS designed for routing in proposed E-FUCA protocol

Table 6 Linguistic variables for input and output in routing
Fig. 9
figure 9

MF for input variables in routing

Fig. 10
figure 10

MF for output variables in routing

Table 7 Fuzzy rules for computing cost of next-hop

With an objective to minimise the distance as well as preventing the intermediate CH nodes from overburden during data forwarding, the CH selects the next-hop CH node having maximum eligibility. Once the best next-hop CH node is selected, the current CH checks its remoteness to BS as well as the distance to the next-hop. If the distance to BS is shorter, then it will forward the data to the BS; else, it will forward the data to the next-hop. The process of forwarding the data is elaborated in Algorithm 3.

figure c

In this manner, all the CHs forward data to BS for further processing and completes one round of proposed work. For a better understanding of the complete flow of the proposed work, we have drawn a flow chart, as shown in Fig. 11, describing the steps involved in clustering and routing of proposed work.

Fig. 11
figure 11

Flow chart of proposed E-FUCA protocol

Simulation experiment and result analysis

For evaluation of the proposed E-FUCA protocol, simulation experiments are performed extensively for E-FUCA, DEFL [8], URBD [12], FUCA [10] and LEACH [11] under four scenarios in MATLAB, and experimental results are obtained. In scenario-1, the field size is chosen as 200 × 200 m2 with 100 SNs having 1 J of initial energy and the position of BS is kept at a distant position from the field, i.e. (100, 300). In scenario-2, the field size is similar to scenario-1, and the BS is kept at the centre of the field, i.e. (100, 100). There are 200 SNs with an initial energy of 0.5 J. In scenario-3, the field size is 300 × 300 with 300 nodes with 0.5 J. The position of BS is kept at the bottom centre, i.e. (150, 0). In scenario-4, the field size is 500 × 500 with BS located at (0, 500), i.e. at the top-left position of the field. 500 SNs are deployed with 0.5 J of energy. All four scenarios are shown in Fig. 12. The reason behind choosing these four scenarios is that the proposed protocol can be applied to any type of application wherein the position of BS either can be at the centre of the field or beyond the boundaries of the target area at a remote place. The experimental values considered for different parameters are stated in Table 8. For the evaluation and comparison of the E-FUCA with FUCA, LEACH, URBD and DEFL, the performance metrics chosen are Stability period, Total Average Energy, Total Alive nodes, Quarter Node Death (QND) and Half Node Death (HND).

Fig. 12
figure 12

Network scenarios for E-FUCA

Table 8 Description of parameters used for simulation

Since the objective of WSN is to collect surrounding information, it is necessary that all the SN deployed should be alive so that cent per cent coverage is guaranteed. Reliability, in terms of coverage, is directly proportional to the stability period [35, 36]. Figure 13 exhibits the performance of E-FUCA, FUCA, LEACH, URBD and DEFL protocols in terms of Stability period for four scenarios.

Fig. 13
figure 13

Stability period

The stability period determines the round in which the death of the first node occurred in the network [36]. The larger the stability period, the more the protocol will be reliable because of the complete coverage. We can see that for Scenario-1, the stability period of the E-FUCA protocol is 147.47%, 87.9%, 70.24% and 26.10% better than LEACH, FUCA, URBD and DEFL protocols, respectively. Similarly, for Scenario-2, it is 157.89%, 99.8%, 84.21% and 42.03% enhanced as compared to LEACH, FUCA, URBD and DEFL, respectively. The stability period of E-FUCA over LEACH, FUCA, URBD and DEFL is protracted by 282.50%, 130.94%, 59.38% and 47.12% for scenario-3 and 983.33%, 490.91%, 136.36% and 85.71% for scenario-4, respectively. The proposed E-FUCA has performed tremendously well in terms of stability period because not only the best candidate is chosen for the CH role, but also non-CH nodes take the intelligent decision of selecting the appropriate CH.

In Fig. 14, a graph for QND is plotted for four scenarios. In this graph, an assessment of the performance of the proposed E-FUCA in terms of the first quarter of nodes death can be seen. For scenario-1, E-FUCA has performed 84.33%, 72.15%, 29.38% and 21.19% better than LEACH, FUCA, URBD and DEFL protocols, respectively, and for Scenario-2, it is 79. 96%, 42.92%, 24.28% and 15.10%. Likewise, in scenario-3, E-FUCA has shown improvement of 158.57%, 123.46%, 57.39% and 39.23% over LEACH, FUCA, URBD and DEFL protocols, respectively. Significant enhancement in QND can be seen for scenario-4, where E-FUCA boosted QND by 212.73%, 177.42%, 82.01% and 77.32% over LEACH, FUCA, URBD and DEFL, respectively.

Fig. 14
figure 14

QND for four scenarios

Figure 15 depicts the performance of E-FUCA, LEACH, FUCA, URBD and DEFL protocols in terms of HND. We have contemplated HND only because once half of the nodes are dead; then the complete coverage cannot be guaranteed in most of the cases. In scenario-1, the HND of the proposed E-FUCA protocol is enhanced by 31.81%, 26.03%, 16.95% and 13.11% over LEACH, FUCA, URBD and DEFL protocols, and for Scenario-2, it is extended by 52.21%, 43.20%, 18.66% and 12.27% more than LEACH, FUCA, URBD and DEFL protocols, respectively. In the case of scenario-3, E-FUCA increased HND by 102.56%, 68.09%, 33.90% and 16% over LEACH, FUCA, URBD and DEFL protocols. Similarly, for scenario-4, it is incremented by 176.92%, 144.07%, 35.42% and 22.73% over LEACH, FUCA, URBD and DEFL protocols, respectively.

Fig. 15
figure 15

HND for four scenarios

Figure 16 presents the total average energy of the network for four scenarios. We can observe that the total average energy of the E-FUCA is dissipating at a very slow rate as compared to LEACH and FUCA. It can be clearly observed that the LEACH protocol poorly performed as compared to FUCA, URBD, DEFL and E-FUCA because it does not consider the crucial parameters during CH selection that affect the energy of the network. FUCA protocol has performed better than LEACH but poor in comparison to E-FUCA because it adapts a greedy approach in cluster formation as non-CH nodes choose the closest CH irrespective of considering its existing load. URBD has better performance than FUCA and LEACH because it considers density and distance in cluster formation but has poor performance than E-FUCA because E-FUCA considers average distance instead of node density.

Fig. 16
figure 16

The total energy of the network for four scenarios

In Fig. 17, total alive nodes for different round slices are presented for the four scenarios considered. For scenario-1, we can observe that all nodes are alive in E-FUCA protocol approximately up to 950 rounds, whereas in LEACH and FUCA protocol, the count of the alive node is merely 55%, and for URBD and DEFL protocols, almost 30% of nodes are dead. It can be clearly observed in scenario-2 that E-FUCA performs extremely better than its comparatives. Up to 1500 rounds in E-FUCA protocol, all the nodes are alive, whereas, in the case of LEACH and FUCA, less than 50 per cent of the nodes are alive in the network. In the URBD and DEFL protocols, almost a quarter of nodes are dead in the network, which is poorer as compared to E-FUCA. In scenario-3, almost all the nodes are alive up to 800 rounds in E-FUCA protocol, whereas in case LEACH and FUCA, the network has expired. If we talk about the URBD protocol, less than a quarter of nodes are alive, but in the case of DEFL, one-third of nodes are alive in the network. In scenario-4, at 300 round, LEACH and FUCA lost more than three-fourth of deployed nodes, URBD lost more than three-fifth nodes, and DEFL lost a quarter of nodes, whereas the proposed E-FUCA protocol lost only one-tenth nodes. E-FUCA has shown better performance because it considers influential parameters during the CH election. In addition, at the time of cluster formation, non-CH nodes make an intelligent decision of choosing their CH by determining its existing load.

Fig. 17
figure 17

Total alive nodes for four scenarios

Complexity analysis of E-FUCA

Time complexity

There are total n nodes deployed in the network. For the CH selection, each node will compute its rank and competition radius independently. In the worst case, an SN will make (n − 1) number of comparisons of rank for getting itself elected as CH, as shown in Algorithm 1. Therefore, for n nodes, a total n (n − 1) number of comparisons occur for CH selection. For the formation of the cluster, every non-CH node will calculate the chance of each node in the CH_NODE list. Thus, in the worst case, there will be (n − 1) comparisons. If there are k CHs, then in the case of routing, there will be k comparisons. Therefore, the complexity of the E-FUCA Protocol in terms of BIG-OH will be O(n2).

Message complexity

At the commencement of each round, all the SN generate an RN and if that RN < Tprob, then that SN broadcasts a message (CH_MSG). Let the number of CH be k for each round. Therefore, the total CH_MSG messages will be k. The non-CH nodes will transmit a message (JOIN_REQ) to CH, which will be (n-k). TDMA schedule will be broadcast to cluster members who will be equal to k. Thus, the total number of messages exchanged for a selection of CH and the formation of clusters in a round will be k + (n − k) + k = n + k. In the case of routing, the total messages forwarded will be k. Thus, the message complexity of the proposed protocol will be O(n).

Conclusion

While designing WSN, the proliferation of energy efficiency is a key concern. Distributing the load among all nodes at par may result in a better stability period. E-FUCA is designed to enhance the performance of FUCA protocol by considering remnant energy, closeness to BS and average distance to nearby nodes instead of node density during CH election. In addition, in the E-FUCA protocol, non-CH nodes intelligently determine the prevailing load of CH before making a decision of selecting its CH. Energy-efficient Fuzzy-based next-hop selection is proposed for protracting network lifetime. The experimental evaluation of the propound work is carried out for four different cases wherein the BS position is kept at various places in the area of interest, meeting the requirement of all kinds of applications. The simulation results proclaim remarkable performance of E-FUCA over LEACH, FUCA, URBD and DEFL in all four scenarios in context to stability period, QND, HND, total average energy and total alive nodes.