A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network

Hao, Sheng; Zhang, Hu-yin; Wang, Jing

doi:10.1007/s11277-019-06284-3

A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network

Published: 03 April 2019

Volume 107, pages 437–469, (2019)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

Sheng Hao¹,
Hu-yin Zhang¹ &
Jing Wang¹

242 Accesses
3 Citations
Explore all metrics

Abstract

Wireless sensor networks (WSN) have been widely used in urban network system and networked monitoring system, which provide easy connectivity and high physical data rate. Considering the battery-limited property of sensor nodes, recently, energy harvesting (EH) technology is introduced into WSN, which can alleviate traditional WSN problems (energy consumption, energy equilibrium, transmission efficiency, etc.). Current EH-WSN routing algorithms generally use the continuous energy harvesting mode, therefore, how to design an efficient routing algorithm for discrete energy harvesting mode and ensure the overall energy balance and conservation is still a great challenge and needs to be solved. Especially, under the mobile environment, the impact of route stability needs to be considered, which makes the design more complicated. To address the above problems, we propose a learning automata (LA) theory based stable and energy-efficient routing algorithm for discrete EH-mobile WSN (DEH-LA-SERA, for short). Firstly, we construct a multi-factors measurement model for sensor nodes, which contains node stability model, energy ratio function, expected harvesting energy model (using Markov decision process method) and direction judgement model. On this basis, we derive the node weighted value, i.e., selecting probability, which can be used to determine whether a node can be chosen as relay node. Secondly, with the help of LA theory, we construct a feedback mechanism to adjust the optimal path. With this solution, we can ensure the overall energy balance and conservation while holding the stability of selected path. As demonstrated in simulation experiments, our algorithm, DEH-LA-SERA, achieved the best performance in route survival time, energy consumption, energy balance and acceptable performance in end-to-end delay and packets delivery ratio.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Stable and Energy-Efficient Routing Algorithm Based on Learning Automata Theory for MANET

Article 12 April 2018

AI Assisted Energy Optimized Sustainable Model for Secured Routing in Mobile Wireless Sensor Network

Article 16 April 2024

Power allocation algorithm based on mixed integer nonlinear programming in WSN

Article 23 February 2018

Notes

The expression of feedback for LA theory has different derivative forms.
If the transmission area of node is a hexagonal shape [33], the transmission range R of node is bounded by the radius $R_h$ of this hexagonal shape, i.e., $R\approx 0.91 R_h$.
The vertical distance means the length from a node to the line segment connecting source and destination.
The probability of this extreme case occurrence is related to the degree of network sparsity and node distribution.
Overhearing technique [35, 36] has been greatly improved which only needs a small amount of energy consumption.
Noting that different mobility mode needs different mathematics modeling and different parameters setting.
The energy setting has many different patterns, here we only give one pattern, which is available for NS-3.
Based on our data transmission velocity setting, the forwarding delay is much less than the transmission delay.
In [9], frequency is used to measure the node stability.
It needs to note that the frequency is also affected by the transmission range, direction change and network density.

References

Akyildiz, I. F., Su, W., Sankarasubramaniam, Y., & Cayirci, E. (2002). Wireless sensor networks: A survey. Computer Networks, 38(4), 393–422.
Article Google Scholar
Al-Karaki, J. N., & Kamal, A. E. (2004). Routing techniques in wireless sensor networks: A survey (pp. 6–28). Piscataway: IEEE Press.
Google Scholar
Shaikh, F. K., & Zeadally, S. (2016). Energy harvesting in wireless sensor networks: A comprehensive review. Renewable and Sustainable Energy Reviews, 55, 1041–1054.
Article Google Scholar
Sarma, H. K. D., Mall, R., & Kar, A. (2016). $\text{ E }^2 \text{ R }^2$: Energy-efficient and reliable routing for mobile wireless sensor networks. IEEE Systems Journal, 10(2), 604–616.
Article Google Scholar
Sarma, H. K. D., Kar, A., & Mall, R. (2016). A hierarchical and role based secure routing protocol for mobile wireless sensor networks. Wireless Personal Communications, 90(3), 1067–1103.
Article Google Scholar
Tamandani, Y. K., & Bokhari, M. U. (2015). SEPFL routing protocol based on fuzzy logic control to extend the lifetime and throughput of the wireless sensor network. Wireless Networks, 22(2), 1–7.
Google Scholar
Ye, R., Boukerche, A., Wang, H., Zhou, X., & Yan, B. (2017). $\text{ E }^3$TX: an energy-efficient expected transmission count routing decision strategy for wireless sensor networks. Wireless Networks, 3, 1–14.
Article Google Scholar
Li, F., & Wang, L. (2018). Energy-aware routing algorithm for wireless sensor networks with optimal relay detecting. Wireless Personal Communications, 98(2), 1701–1717.
Article MathSciNet Google Scholar
Mottaghinia, Z., & Ghaffari, A. (2018). Fuzzy logic based distance and energy-aware routing protocol in delay-tolerant mobile sensor networks. Wireless Personal Communications, 3, 1–20.
Google Scholar
Khasawneh, A., Latiff, M. S. B. A., Kaiwartya, O., & Chizari, H. (2017). A reliable energy-efficient pressure-based routing protocol for underwater wireless sensor network. Wireless Networks, 24(6), 2061–2075.
Article Google Scholar
Kansal, A., & Srivastava, M. B. (2003). An environmental energy harvesting framework for sensor networks. In Proceedings of international symposium on low power electronics and design (pp. 481–486), Seoul, South Korea. IEEE.
Kansal, A., Hsu, J., Zahedi, S., & Srivastava, M. B. (2007). Power management in energy harvesting sensor networks. ACM Transactions on Embedded Computing Systems, 6(4), 1–35.
Article Google Scholar
Tan, Y. K., & Panda, S. K. (2010). Optimized wind energy harvesting system using resistance emulator and active rectifier for wireless sensor nodes. IEEE Transactions on Power Electronics, 26(1), 38–50.
Google Scholar
Kimball, J. W., Kuhn, B. T., & Balog, R. S. (2009). A system design approach for unattended solar energy harvesting supply. IEEE Transactions on Power Electronics, 24(4), 952–962.
Article Google Scholar
Muhammad, U. B., Ezugwu, A. E., Ofem, P. O., Rajamäki, J., & Aderemi, A. O. (2017). Energy neutral protocol based on hierarchical routing techniques for energy harvesting wireless sensor network. Proceedings of American Institute of Physics Conference Series, 1836(1), 921–960.
Google Scholar
Tang, W., Zhang, K., & Jiang, D. (2018). Physarum-inspired routing protocol for energy harvesting wireless sensor networks. Telecommunication Systems, 32, 1–18.
Google Scholar
Liu, Z., Yang, X., Zhao, P., & Yu, W. (2016). On energy-balanced backpressure routing mechanisms for stochastic energy harvesting wireless sensor networks. International Journal of Distributed Sensor Networks, 8(12), 1–11.
Google Scholar
Lu, T., Liu, G., & Chang, S. (2018). Energy-efficient data sensing and routing in unreliable energy-harvesting wireless sensor network. Wireless Networks, 24(2), 611–625.
Article Google Scholar
Hieu, T. D., Dung, l T., & Kim, B. S. (2016). Stability-aware geographic routing in energy harvesting wireless sensor networks. Sensors, 16(5), 1–15.
Article Google Scholar
Sun, G., Shang, X., & Zuo, Y. (2018). La-CTP: Loop-aware routing for energy-harvesting wireless sensor networks. Sensors, 18(2), 434–453.
Article Google Scholar
Tang, J., Liu, A., Zhang, J., et al. (2018). A trust-based secure routing scheme using the traceback approach for energy-harvesting wireless sensor networks. Sensors, 18(3), 751–793.
Article Google Scholar
Chin, K. W., Wang, L., & Soh, S. (2016). Joint routing and links scheduling in two-tier multi-hop RF-energy harvesting networks. IEEE Communications Letters, 20(9), 1864–1867.
Article Google Scholar
Ashraphijuo, M., Aggarwal, V., & Wang, X. (2015). On the capacity of energy harvesting communication link. IEEE Journal on Selected Areas in Communications, 33(12), 2671–2686.
Article Google Scholar
Trillingsgaard, K. F., & Popovski, P. (2014). Communication strategies for two models of discrete energy harvesting In Proceedings of IEEE international conference on communications (pp. 2081–2086), Sydney, NSW, Australia. IEEE.
Narendra, K. S., & Thathachar, M. A. L. (2012). Learning automata: An introduction. USA:DBLP.
Thathachar, M. A. L., & Sastry, P. S. (1987). A hierarchical system of learning automata that can learn the globally optimal path. New York: Elsevier Science Inc.
Book MATH Google Scholar
Beigy, H., & Meybodi, M. R. (2011). Utilizing distributed learning automata to solve stochastic shortest path problems. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 14(05), 591–615.
Article MathSciNet MATH Google Scholar
Kim, J., & Lee, J. W. (2017). Energy adaptive MAC for wireless sensor networks with RF energy transfer: Algorithm, analysis, and implementation (pp. 1–15). Alphen aan den Rijn: Kluwer Academic Publishers.
Google Scholar
Guo, S., Shi, Y., Yang, Y., et al. (2017). Energy efficiency maximization in mobile wireless energy harvesting sensor networks. IEEE Transactions on Mobile Computing, PP(99), 1–1.
Google Scholar
Huang, L. (2017). Optimal sleep-wake scheduling for energy harvesting smart mobile devices. IEEE Transactions on Mobile Computing, 14(2), 1394–1407.
Article MathSciNet Google Scholar
Zhang, H., Huang, S., Jiang, C., et al. (2017). Energy efficient user association and power allocation in millimeter wave based ultra dense networks with energy harvesting base stations. IEEE Journal on Selected Areas in Communications, PP(99), 1–1.
Google Scholar
West, D. B. (2005). Introduction to graph theory (2nd ed., p. 260). New York: McGraw-Hill Higher Education.
Google Scholar
Zonoozi, M. M., & Dassanayake, P. (1997). User mobility modeling and characterization of mobility patterns. IEEE Journal on Selected Areas in Communications, 15(7), 1239–1252.
Article Google Scholar
Mcdonald, A. B., & Znati, T. (1999). A path availability model for wireless ad-hoc networks. In Proceedings of wireless communications and networking conference (pp. 35–40). New Orleans, LA, USA. IEEE.
Biswas, S., & Datta, S. (2004). Reducing overhearing energy in 802.11 networks by low-power interface idling. In Proceedings of IEEE international conference on performance (pp. 695-700), Phoenix, AZ, USA. IEEE.
Le, H. C., Guyennet, H., & Felea, V. (2007). OBMAC: An overhearing based MAC protocol for wireless sensor networks. In Proceedings of international conference on sensor technologies and applications (pp. 547–553), Valencia, Spain. IEEE.
Riley, G. F., & Henderson, T. R. (2010). The ns-3 network simulator. In Modeling and tools for network simulation (pp. 15–34).
Kushner, H. J. (1984). Approximation and weak convergence methods for random processes, with applications to stochastic systems theory. Cambridge: MIT Press.
MATH Google Scholar

Download references

Acknowledgements

The work is supported by the National Natural Science Foundation of China (No.61772386).

Author information

Authors and Affiliations

School of Computer, Wuhan University, Wuhan, 430079, Hubei Province, People’s Republic of China
Sheng Hao, Hu-yin Zhang & Jing Wang

Authors

Sheng Hao
View author publications
You can also search for this author in PubMed Google Scholar
Hu-yin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hu-yin Zhang.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Convergence Analysis for Using LA Theory

In appendix, we analyse the convergence results of our LA based routing algorithm in detail. To find the conclusion, we first derive the drift function of our LA based routing algorithm; then, we use the the interpolation method to represent the action probability as a interpolation sequence. With this, we can approximate our algorithm (the normalized selecting probability, in this analysis, it can be also called as action probability) to an ordinary differential equation (ODE). Finally, we can find that the this ODE converges to the solution of the optimization problem.

In our paper, each node has a learning automaton, the candidate next hop nodes can be regarded as the the chosen actions of LA. Let $\{N_i\}$ represents the set of node i’s candidate next hop nodes, there is a tight correspondence between the selecting probability of candidate next hop nodes and the chosen action probability p. We can use the following expression to represent it

$$\begin{aligned} p_j=\frac{Nw_{j}}{\sum ^{N_i}_{j=1}Nw_j} \end{aligned}$$

(32)

Thus, we can use the theory of stochastic process to represent the genericity one step update formula of the chosen action probability (it is called the drift function, which means the increment in the conditional expectation,)

$$\begin{aligned} p'_{j}(t+1)=E[p_{j}(t+1)|p_{j}(t)]-p_{j}(t) |j\in N_i \end{aligned}$$

(33)

Owing to the fact that p(t) is a Markov process and the dynamics of it depend on the update factor ($\varLambda$), $\beta (t)$ directly depends on p(t) (not on the iteration time). Therefore, $p'_{t}$ can be derived by the function of $p_{t}$. Combining the formula (25)–(26), $p'_{j}(t)$ for our algorithm can be obtained as following

$$\begin{aligned} \begin{aligned} p_{j}'(t)&= \varLambda \cdot p_{j}(t)\left[ 1-p_{j}(t)E[\beta _{j}(t)]- \varLambda \cdot \sum _{q\ne j}p_{q}(t)p_{j}(t)E[\beta _{q}(t)]\right] \\&= \varLambda \cdot p_{j}(t)\sum _{q\ne j}p_{q}(t)[E[\beta _{ij}(t)]-E[\beta _{iq}(t)]]\quad |j,q\in N_i \end{aligned} \end{aligned}$$

(34)

q represents the ID of other candidate actions for node i, which are not selected. Noting that $\beta$ has been defined in Sect. 2.

Defining a function f(.), where

$$\begin{aligned} \begin{aligned}&{\text {maxmize:}}\,f(p)=E[\beta |p] \\&{\text {subject to:}}\,p_{j}\ge 0; i\in [1,]N; j\in N_i\\&\sum ^{N_i}_{j=1}p_{j}=1 \end{aligned} \end{aligned}$$

(35)

We can get the conclusion as following

$$\begin{aligned} f(p)=\sum _{q\ne j}p_q(t)E[\beta (t)] \end{aligned}$$

(36)

Thus, we can reach the following conclusion by differentiating both sides of the formula (36)

$$\begin{aligned} \frac{\vartheta f}{\vartheta p_j}=E[\beta _j(t)] \end{aligned}$$

(37)

Thus, the drift function $p_j(t)$ for our LA based routing algorithm can be represented as [substituting formula (37) in formula (34).]

$$\begin{aligned} p_j'(t) = \varLambda \cdot p_{j}(t)\cdot \sum _{q\ne j}p_{q} \left[ \frac{\vartheta f}{\vartheta p_j}-\frac{\vartheta f}{\vartheta p_q}\right] | j,q\in N_i \end{aligned}$$

(38)

Based on the definition in LA theory, $\varLambda$ represents the genericity maximum update rate. Through the above analysis, we find $p_j'(t)$ for our algorithm can be represented as the function of $p_j(t)$. Hence, we define a function $F_j()$ as following

$$\begin{aligned} \begin{aligned}&F_j(p_j)=p_j\cdot \sum _{q\ne j}p_{q} \left[ \frac{\vartheta f}{\vartheta p_j}-\frac{\vartheta f}{\vartheta p_q}\right] \\&\text {where}\\&p_j'(t)=\varLambda \cdot F_j(p_j) \end{aligned} \end{aligned}$$

(39)

Using the interpolation method to rewrite the action probability p(t) for our routing algorithm

$$\begin{aligned} p(t)=p^{\varLambda }(\tau ) \end{aligned}$$

(40)

where $\tau \in [t\varLambda ,(t+1)\varLambda ]$,$P^{\varLambda }(\tau )$ is a piecewise constant interpolation function. Now we just only care about the convergence of the interpolation sequence $P^{\varLambda }()$.

Generally, the genericity maximum update rate $\varLambda$ is close to 0. Hence based on the weak convergence theory [38], we can make an assertion that $p^{\varLambda }(\tau )$ can weakly converge to the solution set of an ordinary differential equation (ODE), which can be represented as

$$\begin{aligned} {\left\{ \begin{array}{ll} \frac{dx_{j}}{d\tau } =F_{j}(x) \\ x(0)=p^{\varLambda }(0)=p(0) \end{array}\right. } \end{aligned}$$

(41)

It should be explained why this conclusion is true. Firstly, based on the LA theory, $p(t+1)$, $\beta (t)$, $\alpha (t)$ constitute a Markov process(from the long term point of view, we do not prove this fact). Secondly, the outputs of the learning automata are finite (owing to the fact that the neighbors of node are finite). Thirdly, the feedback results of p(t) still take values between 0 and 1 (it means the probability space can be guaranteed). Fourthly, the feedback function defined in our algorithm $\varphi (t)$ is bounded, continuous and independent of $\varLambda$. Finally, this ODE has the unique solution for each initial x(0). Hence, we get the conclusion that when $\varLambda \rightarrow 0$, the sequence $p^\varLambda ()$ weakly converges to the solution of this ODE. Clearly, in our algorithm, $\varLambda$ meets this condition ($\varLambda$ in our algorithm is represented as a and b, and they all close to 0($a=b=0.10$)).

Remark 5

It needs to note that this ODE is a particular case of the weak convergence theory [38].

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hao, S., Zhang, Hy. & Wang, J. A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network. Wireless Pers Commun 107, 437–469 (2019). https://doi.org/10.1007/s11277-019-06284-3

Download citation

Published: 03 April 2019
Issue Date: 01 July 2019
DOI: https://doi.org/10.1007/s11277-019-06284-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network

Abstract

Access this article

Similar content being viewed by others

A Stable and Energy-Efficient Routing Algorithm Based on Learning Automata Theory for MANET

AI Assisted Energy Optimized Sustainable Model for Secured Routing in Mobile Wireless Sensor Network

Power allocation algorithm based on mixed integer nonlinear programming in WSN

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendix: Convergence Analysis for Using LA Theory

Remark 5

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Learning Automata Based Stable and Energy-Efficient Routing Algorithm for Discrete Energy Harvesting Mobile Wireless Sensor Network

Abstract

Access this article

Similar content being viewed by others

A Stable and Energy-Efficient Routing Algorithm Based on Learning Automata Theory for MANET

AI Assisted Energy Optimized Sustainable Model for Secured Routing in Mobile Wireless Sensor Network

Power allocation algorithm based on mixed integer nonlinear programming in WSN

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendix: Convergence Analysis for Using LA Theory

Appendix: Convergence Analysis for Using LA Theory

Remark 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation