Skip to main content

Maximum-Reward Motion in a Stochastic Environment: The Nonequilibrium Statistical Mechanics Perspective

  • Chapter
  • First Online:
  • 3438 Accesses

Part of the book series: Springer Tracts in Advanced Robotics ((STAR,volume 107))

Abstract

We consider the problem of computing the maximum-reward motion in a reward field in an online setting. We assume that the robot has a limited perception range, and it discovers the reward field on the fly. We analyze the performance of a simple, practical lattice-based algorithm with respect to the perception range. Our main result is that, with very little perception range, the robot can collect as much reward as if it could see the whole reward field, under certain assumptions. Along the way, we establish novel connections between this class of problems and certain fundamental problems of nonequilibrium statistical mechanics . We demonstrate our results in simulation examples.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Ad-hoc sensor networks may also be very valuable for environmental monitoring. In fact, such technologies have been developed over past years. We note that the presented approach is for motivational purposes. Yet, it may be beneficial over the ad-hoc network approach due to substantial energy savings at the stationary sensors (as communication requirement is much lower); hence, the sensor nodes require less maintenance. The main drawbacks are additional complexity of a mobile vehicle, and communication delay due to the mobile vehicles physically carrying the data.

References

  1. Heer, C.V.: Statistical Mechanics, Kinetic Theory, and Stochastic Processes. Academic Press, New York (2012)

    Google Scholar 

  2. Mazonko, G.F.: Nonequilibrium Statistical Mechanics. Wiley, Weinheim (2008)

    Google Scholar 

  3. Chou, T., Mallick, K., Zia, R.K.P.: Non-equilibrium statistical mechanics: from a paradigmatic model to biological transport. Rep. Prog. Phys. (2011)

    Google Scholar 

  4. Shaw, L.B., Zia, R.K.P., Lee, K.H.: Totally asymmetric exclusion process with extended objects: a model for protein synthesis. Phys. Rev. E 68(2), (021910) (2003)

    Google Scholar 

  5. Ingber, L.: Statistical mechanics of nonlinear nonequilibrium financial markets. Math. Model. 5, 343–361 (1984)

    Article  MATH  MathSciNet  Google Scholar 

  6. Antal, T., Schutz, G.M.: Asymmetric exclusion process with next-nearest-neighbor interaction: some comments on traffic flow and a nonequilibrium reentrance transition. Phys. Rev. E 62(1), 83–93 (2000)

    Article  Google Scholar 

  7. Nagatani, T.: Bunching of cars in asymmetric exclusion models for freeway traffic. Phys. Rev. E 51(2), 922–928 (1995)

    Article  Google Scholar 

  8. Fleming, G.R., Ratner, M.A.: Grand challenges in basic energy sciences. Phys. Today 61(7), 28 (2008)

    Article  Google Scholar 

  9. Basic Energy Sciences Advisory Committee. Directing matter and energy: five challanges for science and imagination. pp. 1–144. November 2007

    Google Scholar 

  10. National Research Council Committee on CMMP 2010. Condensed-matter and materials physics: the science of world around us. January 2010

    Google Scholar 

  11. Otte, M., Correll, N., Frazzoli, E.: Navigation with foraging. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3150–3157. IEEE (2013)

    Google Scholar 

  12. Karaman, S., Frazzoli, E.: High-speed flight in an ergodic forest. In: IEEE International Conference on Robotics and Automation (2011)

    Google Scholar 

  13. Antoine, B., Jean-Christophe, Z., Dario, F.: Vision-based control of near-obstacle flight. Auton. Robots 27(3), 201–219 (2009)

    Google Scholar 

  14. Scherer, S., Singh, S., Chamberlain, L., Elgersma, M.: Flying fast and low among obstacles: methodology and experiments. Int. J. Robot. Res. 27(5), 549–574 (2008)

    Article  Google Scholar 

  15. Bertsimas, D.J.: A stochastic and dynamic vehicle routing problem in the Euclidean plane. Oper. Res. 39(4), 601–615 (2008)

    Article  Google Scholar 

  16. Boucheron, S., Lugosi, G., Massart, P.: Concentration Inequalities : A Nonasymptotic Theory of Independence. Oxford University Press, Oxford (2013)

    Book  Google Scholar 

  17. Dubhashi, D.P., Panconesi, A.: Concentration of Measure for the Analysis of Randomized Algorithms. Cambridge University Press, New York (2009)

    Book  MATH  Google Scholar 

  18. Dumaz, L., Virág, B: The right tail exponent of the tracy-widom-beta distribution. arXiv preprint arXiv:1102.4818 (2011)

  19. Johansson, K.: Shape fluctuations and random matrices. Commun. Math. Phys. 209(2), 437–476 (2000)

    Article  MATH  Google Scholar 

  20. Martin, J.B.: Linear growth for greedy lattice animals. Stoch. Process. Appl. 98(1), 43–66 (2002)

    Article  MATH  Google Scholar 

  21. Martin, J.B.: Last-passage percolation with general weight distribution. Markov Process. Relat. Fields 12(2), 273–299 (2006)

    MATH  Google Scholar 

  22. Zeng, X., Hou, Z., Guo, C., Guo, Y.: Directed last-passage percolation and random matrices. Adv. Math. 42(3), 3 (2013)

    Google Scholar 

  23. Urmson, C., et al.: Autonomous driving in urban environments: boss and the urban challenge. J. Field Robot. 25(8), 425–466 (2008)

    Article  Google Scholar 

  24. Koenig, S., Likhachev, M., Furcy, D.: Lifelong planning A\(\ast \). Artif. Intel. (2004)

    Google Scholar 

  25. Schrijver, A.: Combinatorial Optimization. Springer, Berlin (2003)

    MATH  Google Scholar 

  26. Steele, M.J.: Probability Theory and Combinatorial Optimization. SIAM, Philadelphia (1996)

    Google Scholar 

  27. Dumaz, L., Virág, B.: The right tail exponent of the Tracy-Widom \(\beta \) distribution. Annales de l’Institut Henri Poincaré, Probabilités et Statistiques 49(4), 915–933 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fangchang Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Ma, F., Karaman, S. (2015). Maximum-Reward Motion in a Stochastic Environment: The Nonequilibrium Statistical Mechanics Perspective. In: Akin, H., Amato, N., Isler, V., van der Stappen, A. (eds) Algorithmic Foundations of Robotics XI. Springer Tracts in Advanced Robotics, vol 107. Springer, Cham. https://doi.org/10.1007/978-3-319-16595-0_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16595-0_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16594-3

  • Online ISBN: 978-3-319-16595-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics