Reinforcement Learning for Matrix Computations: PageRank as an Example

Borkar, Vivek S.; Mathkar, Adwaitvedant S.

doi:10.1007/978-3-319-04483-5_2

Vivek S. Borkar¹⁷ &
Adwaitvedant S. Mathkar¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8337))

Included in the following conference series:

International Conference on Distributed Computing and Internet Technology

1503 Accesses
3 Citations

Abstract

Reinforcement learning has gained wide popularity as a technique for simulation-driven approximate dynamic programming. A less known aspect is that the very reasons that make it effective in dynamic programming can also be leveraged for using it for distributed schemes for certain matrix computations involving non-negative matrices. In this spirit, we propose a reinforcement learning algorithm for PageRank computation that is fashioned after analogous schemes for approximate dynamic programming. The algorithm has the advantage of ease of distributed implementation and more importantly, of being model-free, i.e., not dependent on any specific assumptions about the transition probabilities in the random web-surfer model. We analyze its convergence and finite time behavior and present some supporting numerical experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Thorndike, E.L.: Animal intelligence: an experimental study of the associative processes in animals. Psychological Review, Monograph Supplement 2(8) (1998)
Google Scholar
Bush, R.R., Mosteller, F.: A mathematical model of simple learning. Psychological Review 58, 313–323
Google Scholar
Estes, K.W.: Towards a statistical theory of learning. Psychological Review 57, 94–107
Google Scholar
Bertsekas, D.P.: Dynamic Programming and Optimal Control, 4th edn., vol. 2. Athena Scientific, Belmont (2007)
Google Scholar
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-dynamic Programming. Athena Scientific, Belmont (1996)
MATH Google Scholar
Gosavi, A.: Simulation-based Optimization, Parametric Optimization Techniques and Reinforcement Learning. Springer, New York (2003)
Book MATH Google Scholar
Powell, W.B.: Approximate Dynamic Programming: Solving the Curses of Dimensionality, 2nd edn. Wiley, New York (2011)
Book Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Szepesvari, C.: Algorithms for Reinforcement Learning. Morgan and Claypool Publishers (2010)
Google Scholar
Sargent, T.J.: Bounded Rationality in Macroeconomics. Oxford Uni. Press, Oxford (1994)
Google Scholar
Thrun, S., Burgard, W., Fox, D.: Probabilistic Robotics. MIT Press, Cambridge (2005)
MATH Google Scholar
Robbins, H., Monro, J.: A stochastic approximation method. Annals of Math. Stat. 22, 400–407 (1951)
Article MATH MathSciNet Google Scholar
Borkar, V.S., Makhijani, R., Sundaresan, R.: How to gossip if you must (preprint, 2013), http://arxiv.org/abs/1309.7841
Borkar, V.: Reinforcement Learning - A Bridge between Numerical Methods and Markov Chain Monte Carlo. In: Sastry, N.S.N., Rajeev, B., Delampady, M., Rao, T.S.S.R.K. (eds.) Perspectives in Mathematical Sciences. World Scientific (2008)
Google Scholar
Langville, A.N., Meyer, C.D.: Google’s PageRank and Beyond: The Science of Search Engine Rankings. Princeton Uni. Press, Princeton (2006)
Google Scholar
Langville, A.N., Meyer, C.D.: Deeper inside PageRank. Internet Mathematics 1(3), 335–380 (2004)
Article MATH MathSciNet Google Scholar
Avrachenkov, K., Litvak, N., Nemirovsky, D., Osipova, N.: Monte Carlo methods in PageRank computation: when one iteration is sufficient. SIAM J. Numer. Anal. 45(2), 890–904 (2007)
Article MATH MathSciNet Google Scholar
Avrachenkov, K., Litvak, N., Nemirovsky, D., Smirnova, E., Sokol, M.: Quick detection of top-k personalized PageRank lists. In: Frieze, A., Horn, P., Prałat, P. (eds.) WAW 2011. LNCS, vol. 6732, pp. 50–61. Springer, Heidelberg (2011)
Chapter Google Scholar
Polyak, B.T., Timonina, A.V.: PageRank: new regularizations and simulation models. In: Proc. of 11th IFAC World Congress, Milano, August 28-September, pp. 11202–11207 (2011)
Google Scholar
Ishii, H.: Distributed randomized algorithms for PageRank computation. IEEE Trans. Auto. Control 55(9), 1987–2002 (2010)
Article Google Scholar
Nazin, A.V., Polyak, B.T.: ‘The randomized algorithm for finding an eigenvector of the stochastic matrix with application to PageRank. Doklady Mathematics 79(3), 424–427 (2009)
Article MATH MathSciNet Google Scholar
Zhao, W., Chen, H-F. and Fang, H-T.: Convergence of distributed randomized PageRank algorithms. arXiv:1305.3178 [cs.SY] (2013)
Google Scholar
Vigna, S.: Spectral ranking, http://arxiv.org/abs/0912.0238
Borkar, V.S.: Stochastic Approximation: A Dynamical Systems Viewpoint. Hindustan Publ. Agency, Cambridge Uni. Press, New Delhi, Cambridge (2008)
Google Scholar
Ho, Y.-C.: An explanation of ordinal optimization: Soft computing for hard problems. Information Sciences 113(3-4), 169–192 (1999)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology, Powai, Mumbai, 400076, India
Vivek S. Borkar & Adwaitvedant S. Mathkar

Authors

Vivek S. Borkar
View author publications
You can also search for this author in PubMed Google Scholar
Adwaitvedant S. Mathkar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Technology & Computer Science, Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, 400005, Mumbai, India
Raja Natarajan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Borkar, V.S., Mathkar, A.S. (2014). Reinforcement Learning for Matrix Computations: PageRank as an Example. In: Natarajan, R. (eds) Distributed Computing and Internet Technology. ICDCIT 2014. Lecture Notes in Computer Science, vol 8337. Springer, Cham. https://doi.org/10.1007/978-3-319-04483-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-04483-5_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04482-8
Online ISBN: 978-3-319-04483-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics