Skip to main content

Advertisement

Log in

A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

Wireless sensor networks (WSN) have been widely used in urban network system and networked monitoring system, which provide easy connectivity and high physical data rate. Considering the battery-limited property of sensor nodes, recently, energy harvesting (EH) technology is introduced into WSN, which can alleviate traditional WSN problems (energy consumption, energy equilibrium, transmission efficiency, etc.). Current EH-WSN routing algorithms generally use the continuous energy harvesting mode, therefore, how to design an efficient routing algorithm for discrete energy harvesting mode and ensure the overall energy balance and conservation is still a great challenge and needs to be solved. Especially, under the mobile environment, the impact of route stability needs to be considered, which makes the design more complicated. To address the above problems, we propose a learning automata (LA) theory based stable and energy-efficient routing algorithm for discrete EH-mobile WSN (DEH-LA-SERA, for short). Firstly, we construct a multi-factors measurement model for sensor nodes, which contains node stability model, energy ratio function, expected harvesting energy model (using Markov decision process method) and direction judgement model. On this basis, we derive the node weighted value, i.e., selecting probability, which can be used to determine whether a node can be chosen as relay node. Secondly, with the help of LA theory, we construct a feedback mechanism to adjust the optimal path. With this solution, we can ensure the overall energy balance and conservation while holding the stability of selected path. As demonstrated in simulation experiments, our algorithm, DEH-LA-SERA, achieved the best performance in route survival time, energy consumption, energy balance and acceptable performance in end-to-end delay and packets delivery ratio.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. The expression of feedback for LA theory has different derivative forms.

  2. If the transmission area of node is a hexagonal shape [33], the transmission range R of node is bounded by the radius \(R_h\) of this hexagonal shape, i.e., \(R\approx 0.91 R_h\).

  3. The vertical distance means the length from a node to the line segment connecting source and destination.

  4. The probability of this extreme case occurrence is related to the degree of network sparsity and node distribution.

  5. Overhearing technique [35, 36] has been greatly improved which only needs a small amount of energy consumption.

  6. Noting that different mobility mode needs different mathematics modeling and different parameters setting.

  7. The energy setting has many different patterns, here we only give one pattern, which is available for NS-3.

  8. Based on our data transmission velocity setting, the forwarding delay is much less than the transmission delay.

  9. In [9], frequency is used to measure the node stability.

  10. It needs to note that the frequency is also affected by the transmission range, direction change and network density.

References

  1. Akyildiz, I. F., Su, W., Sankarasubramaniam, Y., & Cayirci, E. (2002). Wireless sensor networks: A survey. Computer Networks, 38(4), 393–422.

    Article  Google Scholar 

  2. Al-Karaki, J. N., & Kamal, A. E. (2004). Routing techniques in wireless sensor networks: A survey (pp. 6–28). Piscataway: IEEE Press.

    Google Scholar 

  3. Shaikh, F. K., & Zeadally, S. (2016). Energy harvesting in wireless sensor networks: A comprehensive review. Renewable and Sustainable Energy Reviews, 55, 1041–1054.

    Article  Google Scholar 

  4. Sarma, H. K. D., Mall, R., & Kar, A. (2016). \(\text{ E }^2 \text{ R }^2\): Energy-efficient and reliable routing for mobile wireless sensor networks. IEEE Systems Journal, 10(2), 604–616.

    Article  Google Scholar 

  5. Sarma, H. K. D., Kar, A., & Mall, R. (2016). A hierarchical and role based secure routing protocol for mobile wireless sensor networks. Wireless Personal Communications, 90(3), 1067–1103.

    Article  Google Scholar 

  6. Tamandani, Y. K., & Bokhari, M. U. (2015). SEPFL routing protocol based on fuzzy logic control to extend the lifetime and throughput of the wireless sensor network. Wireless Networks, 22(2), 1–7.

    Google Scholar 

  7. Ye, R., Boukerche, A., Wang, H., Zhou, X., & Yan, B. (2017). \(\text{ E }^3\)TX: an energy-efficient expected transmission count routing decision strategy for wireless sensor networks. Wireless Networks, 3, 1–14.

    Article  Google Scholar 

  8. Li, F., & Wang, L. (2018). Energy-aware routing algorithm for wireless sensor networks with optimal relay detecting. Wireless Personal Communications, 98(2), 1701–1717.

    Article  MathSciNet  Google Scholar 

  9. Mottaghinia, Z., & Ghaffari, A. (2018). Fuzzy logic based distance and energy-aware routing protocol in delay-tolerant mobile sensor networks. Wireless Personal Communications, 3, 1–20.

    Google Scholar 

  10. Khasawneh, A., Latiff, M. S. B. A., Kaiwartya, O., & Chizari, H. (2017). A reliable energy-efficient pressure-based routing protocol for underwater wireless sensor network. Wireless Networks, 24(6), 2061–2075.

    Article  Google Scholar 

  11. Kansal, A., & Srivastava, M. B. (2003). An environmental energy harvesting framework for sensor networks. In Proceedings of international symposium on low power electronics and design (pp. 481–486), Seoul, South Korea. IEEE.

  12. Kansal, A., Hsu, J., Zahedi, S., & Srivastava, M. B. (2007). Power management in energy harvesting sensor networks. ACM Transactions on Embedded Computing Systems, 6(4), 1–35.

    Article  Google Scholar 

  13. Tan, Y. K., & Panda, S. K. (2010). Optimized wind energy harvesting system using resistance emulator and active rectifier for wireless sensor nodes. IEEE Transactions on Power Electronics, 26(1), 38–50.

    Google Scholar 

  14. Kimball, J. W., Kuhn, B. T., & Balog, R. S. (2009). A system design approach for unattended solar energy harvesting supply. IEEE Transactions on Power Electronics, 24(4), 952–962.

    Article  Google Scholar 

  15. Muhammad, U. B., Ezugwu, A. E., Ofem, P. O., Rajamäki, J., & Aderemi, A. O. (2017). Energy neutral protocol based on hierarchical routing techniques for energy harvesting wireless sensor network. Proceedings of American Institute of Physics Conference Series, 1836(1), 921–960.

    Google Scholar 

  16. Tang, W., Zhang, K., & Jiang, D. (2018). Physarum-inspired routing protocol for energy harvesting wireless sensor networks. Telecommunication Systems, 32, 1–18.

    Google Scholar 

  17. Liu, Z., Yang, X., Zhao, P., & Yu, W. (2016). On energy-balanced backpressure routing mechanisms for stochastic energy harvesting wireless sensor networks. International Journal of Distributed Sensor Networks, 8(12), 1–11.

    Google Scholar 

  18. Lu, T., Liu, G., & Chang, S. (2018). Energy-efficient data sensing and routing in unreliable energy-harvesting wireless sensor network. Wireless Networks, 24(2), 611–625.

    Article  Google Scholar 

  19. Hieu, T. D., Dung, l T., & Kim, B. S. (2016). Stability-aware geographic routing in energy harvesting wireless sensor networks. Sensors, 16(5), 1–15.

    Article  Google Scholar 

  20. Sun, G., Shang, X., & Zuo, Y. (2018). La-CTP: Loop-aware routing for energy-harvesting wireless sensor networks. Sensors, 18(2), 434–453.

    Article  Google Scholar 

  21. Tang, J., Liu, A., Zhang, J., et al. (2018). A trust-based secure routing scheme using the traceback approach for energy-harvesting wireless sensor networks. Sensors, 18(3), 751–793.

    Article  Google Scholar 

  22. Chin, K. W., Wang, L., & Soh, S. (2016). Joint routing and links scheduling in two-tier multi-hop RF-energy harvesting networks. IEEE Communications Letters, 20(9), 1864–1867.

    Article  Google Scholar 

  23. Ashraphijuo, M., Aggarwal, V., & Wang, X. (2015). On the capacity of energy harvesting communication link. IEEE Journal on Selected Areas in Communications, 33(12), 2671–2686.

    Article  Google Scholar 

  24. Trillingsgaard, K. F., & Popovski, P. (2014). Communication strategies for two models of discrete energy harvesting In Proceedings of IEEE international conference on communications (pp. 2081–2086), Sydney, NSW, Australia. IEEE.

  25. Narendra, K. S., & Thathachar, M. A. L. (2012). Learning automata: An introduction. USA:DBLP.

  26. Thathachar, M. A. L., & Sastry, P. S. (1987). A hierarchical system of learning automata that can learn the globally optimal path. New York: Elsevier Science Inc.

    Book  MATH  Google Scholar 

  27. Beigy, H., & Meybodi, M. R. (2011). Utilizing distributed learning automata to solve stochastic shortest path problems. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 14(05), 591–615.

    Article  MathSciNet  MATH  Google Scholar 

  28. Kim, J., & Lee, J. W. (2017). Energy adaptive MAC for wireless sensor networks with RF energy transfer: Algorithm, analysis, and implementation (pp. 1–15). Alphen aan den Rijn: Kluwer Academic Publishers.

    Google Scholar 

  29. Guo, S., Shi, Y., Yang, Y., et al. (2017). Energy efficiency maximization in mobile wireless energy harvesting sensor networks. IEEE Transactions on Mobile Computing, PP(99), 1–1.

    Google Scholar 

  30. Huang, L. (2017). Optimal sleep-wake scheduling for energy harvesting smart mobile devices. IEEE Transactions on Mobile Computing, 14(2), 1394–1407.

    Article  MathSciNet  Google Scholar 

  31. Zhang, H., Huang, S., Jiang, C., et al. (2017). Energy efficient user association and power allocation in millimeter wave based ultra dense networks with energy harvesting base stations. IEEE Journal on Selected Areas in Communications, PP(99), 1–1.

    Google Scholar 

  32. West, D. B. (2005). Introduction to graph theory (2nd ed., p. 260). New York: McGraw-Hill Higher Education.

    Google Scholar 

  33. Zonoozi, M. M., & Dassanayake, P. (1997). User mobility modeling and characterization of mobility patterns. IEEE Journal on Selected Areas in Communications, 15(7), 1239–1252.

    Article  Google Scholar 

  34. Mcdonald, A. B., & Znati, T. (1999). A path availability model for wireless ad-hoc networks. In Proceedings of wireless communications and networking conference (pp. 35–40). New Orleans, LA, USA. IEEE.

  35. Biswas, S., & Datta, S. (2004). Reducing overhearing energy in 802.11 networks by low-power interface idling. In Proceedings of IEEE international conference on performance (pp. 695-700), Phoenix, AZ, USA. IEEE.

  36. Le, H. C., Guyennet, H., & Felea, V. (2007). OBMAC: An overhearing based MAC protocol for wireless sensor networks. In Proceedings of international conference on sensor technologies and applications (pp. 547–553), Valencia, Spain. IEEE.

  37. Riley, G. F., & Henderson, T. R. (2010). The ns-3 network simulator. In Modeling and tools for network simulation (pp. 15–34).

  38. Kushner, H. J. (1984). Approximation and weak convergence methods for random processes, with applications to stochastic systems theory. Cambridge: MIT Press.

    MATH  Google Scholar 

Download references

Acknowledgements

The work is supported by the National Natural Science Foundation of China (No.61772386).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hu-yin Zhang.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Convergence Analysis for Using LA Theory

Appendix: Convergence Analysis for Using LA Theory

In appendix, we analyse the convergence results of our LA based routing algorithm in detail. To find the conclusion, we first derive the drift function of our LA based routing algorithm; then, we use the the interpolation method to represent the action probability as a interpolation sequence. With this, we can approximate our algorithm (the normalized selecting probability, in this analysis, it can be also called as action probability) to an ordinary differential equation (ODE). Finally, we can find that the this ODE converges to the solution of the optimization problem.

In our paper, each node has a learning automaton, the candidate next hop nodes can be regarded as the the chosen actions of LA. Let \(\{N_i\}\) represents the set of node i’s candidate next hop nodes, there is a tight correspondence between the selecting probability of candidate next hop nodes and the chosen action probability p. We can use the following expression to represent it

$$\begin{aligned} p_j=\frac{Nw_{j}}{\sum ^{N_i}_{j=1}Nw_j} \end{aligned}$$
(32)

Thus, we can use the theory of stochastic process to represent the genericity one step update formula of the chosen action probability (it is called the drift function, which means the increment in the conditional expectation,)

$$\begin{aligned} p'_{j}(t+1)=E[p_{j}(t+1)|p_{j}(t)]-p_{j}(t) |j\in N_i \end{aligned}$$
(33)

Owing to the fact that p(t) is a Markov process and the dynamics of it depend on the update factor (\(\varLambda\)), \(\beta (t)\) directly depends on p(t) (not on the iteration time). Therefore, \(p'_{t}\) can be derived by the function of \(p_{t}\). Combining the formula (25)–(26), \(p'_{j}(t)\) for our algorithm can be obtained as following

$$\begin{aligned} \begin{aligned} p_{j}'(t)&= \varLambda \cdot p_{j}(t)\left[ 1-p_{j}(t)E[\beta _{j}(t)]- \varLambda \cdot \sum _{q\ne j}p_{q}(t)p_{j}(t)E[\beta _{q}(t)]\right] \\&= \varLambda \cdot p_{j}(t)\sum _{q\ne j}p_{q}(t)[E[\beta _{ij}(t)]-E[\beta _{iq}(t)]]\quad |j,q\in N_i \end{aligned} \end{aligned}$$
(34)

q represents the ID of other candidate actions for node i, which are not selected. Noting that \(\beta\) has been defined in Sect. 2.

Defining a function f(.), where

$$\begin{aligned} \begin{aligned}&{\text {maxmize:}}\,f(p)=E[\beta |p] \\&{\text {subject to:}}\,p_{j}\ge 0; i\in [1,]N; j\in N_i\\&\sum ^{N_i}_{j=1}p_{j}=1 \end{aligned} \end{aligned}$$
(35)

We can get the conclusion as following

$$\begin{aligned} f(p)=\sum _{q\ne j}p_q(t)E[\beta (t)] \end{aligned}$$
(36)

Thus, we can reach the following conclusion by differentiating both sides of the formula (36)

$$\begin{aligned} \frac{\vartheta f}{\vartheta p_j}=E[\beta _j(t)] \end{aligned}$$
(37)

Thus, the drift function \(p_j(t)\) for our LA based routing algorithm can be represented as [substituting formula (37) in formula (34).]

$$\begin{aligned} p_j'(t) = \varLambda \cdot p_{j}(t)\cdot \sum _{q\ne j}p_{q} \left[ \frac{\vartheta f}{\vartheta p_j}-\frac{\vartheta f}{\vartheta p_q}\right] | j,q\in N_i \end{aligned}$$
(38)

Based on the definition in LA theory, \(\varLambda\) represents the genericity maximum update rate. Through the above analysis, we find \(p_j'(t)\) for our algorithm can be represented as the function of \(p_j(t)\). Hence, we define a function \(F_j()\) as following

$$\begin{aligned} \begin{aligned}&F_j(p_j)=p_j\cdot \sum _{q\ne j}p_{q} \left[ \frac{\vartheta f}{\vartheta p_j}-\frac{\vartheta f}{\vartheta p_q}\right] \\&\text {where}\\&p_j'(t)=\varLambda \cdot F_j(p_j) \end{aligned} \end{aligned}$$
(39)

Using the interpolation method to rewrite the action probability p(t) for our routing algorithm

$$\begin{aligned} p(t)=p^{\varLambda }(\tau ) \end{aligned}$$
(40)

where \(\tau \in [t\varLambda ,(t+1)\varLambda ]\),\(P^{\varLambda }(\tau )\) is a piecewise constant interpolation function. Now we just only care about the convergence of the interpolation sequence \(P^{\varLambda }()\).

Generally, the genericity maximum update rate \(\varLambda\) is close to 0. Hence based on the weak convergence theory [38], we can make an assertion that \(p^{\varLambda }(\tau )\) can weakly converge to the solution set of an ordinary differential equation (ODE), which can be represented as

$$\begin{aligned} {\left\{ \begin{array}{ll} \frac{dx_{j}}{d\tau } =F_{j}(x) \\ x(0)=p^{\varLambda }(0)=p(0) \end{array}\right. } \end{aligned}$$
(41)

It should be explained why this conclusion is true. Firstly, based on the LA theory, \(p(t+1)\), \(\beta (t)\), \(\alpha (t)\) constitute a Markov process(from the long term point of view, we do not prove this fact). Secondly, the outputs of the learning automata are finite (owing to the fact that the neighbors of node are finite). Thirdly, the feedback results of p(t) still take values between 0 and 1 (it means the probability space can be guaranteed). Fourthly, the feedback function defined in our algorithm \(\varphi (t)\) is bounded, continuous and independent of \(\varLambda\). Finally, this ODE has the unique solution for each initial x(0). Hence, we get the conclusion that when \(\varLambda \rightarrow 0\), the sequence \(p^\varLambda ()\) weakly converges to the solution of this ODE. Clearly, in our algorithm, \(\varLambda\) meets this condition (\(\varLambda\) in our algorithm is represented as a and b, and they all close to 0(\(a=b=0.10\))).

Remark 5

It needs to note that this ODE is a particular case of the weak convergence theory [38].

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hao, S., Zhang, Hy. & Wang, J. A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network. Wireless Pers Commun 107, 437–469 (2019). https://doi.org/10.1007/s11277-019-06284-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-019-06284-3

Keywords

Navigation