Decision-Theoretic Control of Planetary Rovers

Zilberstein, Shlomo; Washington, Richard; Bernstein, Daniel S.; Mouaddib, Abdel-Illah

doi:10.1007/3-540-37724-7_16

Shlomo Zilberstein⁵,
Richard Washington⁶,
Daniel S. Bernstein⁵ &
…
Abdel-Illah Mouaddib⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2466))

460 Accesses
11 Citations
1 Altmetric

Abstract

Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such resources as operation time, power, storage capacity, and communication bandwidth. Moreover, the limited computational resources of the rover limit the complexity of on-line planning and scheduling. We describe two decision-theoretic approaches to maximize the productivity of planetary rovers: one based on adaptive planning and the other on hierarchical reinforcement learning. Both approaches map the problem into a Markov decision problem and attempt to solve a large part of the problem off-line, exploiting the structure of the plan and independence between plan components. We examine the advantages and limitations of these techniques and their scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bapna, D., Rollins, E., Murphy, J., Maimone, E., Whittaker, W., and Wetter-green, D.: The Atacama Desert Trek: Outcomes. IEEE International Conference on Robotics and Automation (ICRA-98) (1998) 597–604
Google Scholar
Bernstein, D.S., and Zilberstein, S.: Reinforcement Learning for Weakly-Coupled MDPs and an Application to Planetary Rover Control. European Conference on Planning (2001)
Google Scholar
Bernstein, D.S., Zilberstein, S., Washington, R., and Bresina, J.L.: Planetary Rover Control as a Markov Decision Process. Sixth International Symposium on Artificial Intelligence, Robotics, and Automation in Space (2001)
Google Scholar
Boutilier, C., Dean, T., and Hanks, S.: Decision-Theoretic Planning: Structural Assumptions and Computational Leverage. Journal of Artificial Intelligence Research 1 (1999) 1–93
MathSciNet Google Scholar
Boyan, J.A., and Littman, M.L.: Exact Solutions to Time-Dependent MDPs. Advances in Neural Information Processing Systems MIT Press, Cambridge, MA (2001)
Google Scholar
Bresina, J.L., Golden, K., Smith, D.E., and Washington, R.: Increased Flexibility and Robustness of Mars Rovers. Fifth International Symposium on Artificial Intelligence, Robotics and Automation in Space (1999)
Google Scholar
Bresina, J.L., and Washington, R.: Expected Utility Distributions for Flexible, Contingent Execution. AAAI-2000 Workshop: Representation Issues for Real-World Planning Systems (2000)
Google Scholar
Bresina, J.L., and Washington, R.: Robustness Via Run-Time Adaptation of Contingent Plans. AAAI Spring Symposium on Robust Autonomy (2001)
Google Scholar
Bresina, J.L., Bualat, M., Fair, M., Washington, R., and Wright, A.: The K9 On-Board Rover Architecture. European Space Agency (ESA) Workshop on On-Board Autonomy (2001)
Google Scholar
Cardon, S., Mouaddib, A.-I., Zilberstein, S., and Washington, R.: Adaptive Control of Acyclic Progressive Processing Task Structures. Seventeenth International Joint Conference on Artificial Intelligence (2001) 701–706
Google Scholar
Christian, D., Wettergreen, D., Bualat, M., Schwehr, K., Tucker, D., and Zbinden, E.: Field Experiments with the Ames Marsokhod Rover. Field and Service Robotics Conference (1997)
Google Scholar
Dean, T., and Boddy, M.: An Analysis of Time-Dependent Planning. Seventh National Conference on Artificial Intelligence (1988) 49–54
Google Scholar
Dean, T., and Lin, S.-H.: Decomposition Techniques for Planning in Stochastic Domains. Fourteenth International Joint Conference on Artificial Intelligence (1995) 1121–1127
Google Scholar
Estlin, T., Gray, A., Mann, T., Rabideau, G., Castano, R., Chien, S., and Mjol-sness, E.: An Integrated System for Multi-Rover Scientific Exploration. Sixteenth National Conference on Artificial Intelligence (1999) 541–548
Google Scholar
Fikes, R., and Nilsson, N.: Strips: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence 2 (1971) 189–208
Article MATH Google Scholar
Foreister, J.-P., and Varaiya, P.: Multilayer Control of Large Markov Chains. IEEE Transactions on Automatic Control 23(2) (1978) 298–304
Article Google Scholar
Fukunaga, A., Rabideau, G., Chien, S., Yan, D.: Toward an Application Framework for Automated Planning and Scheduling. International Symposium on Artificial Intelligence, Robotics and Automation for Space (1997)
Google Scholar
Green, C.: Application of Theorem Proving to Problem Solving. First International Joint Conference on Artificial Intelligence (1969) 219–239
Google Scholar
Hansen, E.A., and Zilberstein, S.: LAO*: A Heuristic Search Algorithm that Finds Solutions with Loops. Artificial Intelligence 129(1–2) (2001) 35–62
Article MATH MathSciNet Google Scholar
Hauskrecht, M., Meuleau, N., Kaelbling, L.P., Dean, T., and Boutilier, C.: Hierarchical Solution of Markov Decision Processes Using Macro-Actions. Fourteenth International Conference on Uncertainty in Artificial Intelligence (1998)
Google Scholar
Kaelbling, L., Littman, M., and Cassandra, A.: Planning and Acting in Partially Observable Stochastic Domains. Artificial Intelligence 101(1–2) (1998) 99–134
Article MATH MathSciNet Google Scholar
Limonadi, D.: Smart Lander Reference Surface Scenario and Reference Vehicle Description. Jet Propulsion Laboratory Interoffice Memorandum, November 2 (2001)
Google Scholar
Mishkin, A.H., Morrison, J.C., Nguyen, T.T., Stone, H.W., Cooper, B.K., and Wilcox, B.H.: Experiences with Operations and Autonomy of the Mars Pathfinder Microrover. IEEE Aerospace Conference (1998)
Google Scholar
Mouaddib, A.-I., and Zilberstein, S.: Optimal Scheduling of Dynamic Progressive Processing. Thirteenth Biennial European Conference on Artificial Intelligence (1998) 449–503
Google Scholar
Muscettola, N., Nayak, P.P., Pell, B., and Williams, B.C.: Remote Agent: To Boldly Go Where No AI System Has Gone Before. Artificial Intelligence 103(1–2) (1998) 5–47
Article MATH Google Scholar
Parr, R.: Flexible Decomposition Algorithms for Weakly-Coupled Markov Decision Problems. Fourteenth International Conference on Uncertainty in Artificial Intelligence (1998)
Google Scholar
Puterman, M.L.: Markov Decision Processes-Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., New York, NY (1994)
MATH Google Scholar
Sacerdoti, E.D.: Planning in a Hierarchy of Abstraction Spaces. Artificial Intelligence 5(2) (1974) 115–135
Article MATH Google Scholar
Simmons, R., and Koenig, S.: Probabilistic Robot Navigation in Partially Observable Environments. Fourteenth International Joint Conference on Artificial Intelligence (1995) 1080–1087
Google Scholar
Stoker, C, R.oush, T., and Cabrol, N.: Personal communication (2001)
Google Scholar
Sutton, R.S., and Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA (1998)
Google Scholar
Sutton, R.S., Precup, D., and Singh, S.: Between MDPs and Semi-MDPs: Learning, Planning, and Representing Knowledge at Multiple Temporal Scales. Artificial Intelligence, 112 (2000) 181–211
Article MathSciNet Google Scholar
Watkins, C.: Learning from Delayed Rewards. PhD Thesis, Cambridge University, Cambridge, England (1989)
Google Scholar
Wellman, M.P., Larson, K., Ford, M., and Wurman, P.R.: Path Planning under Time-Dependent Uncertainty. Eleventh Conference on Uncertainty in Artificial Intelligence (1995) 532–539.
Google Scholar
Zilberstein, S., and Mouaddib, A.-I.: Reactive Control of Dynamic Progressive Processing. Sixteenth International Joint Conference on Artificial Intelligence (1999) 1268–1273
Google Scholar
Zilberstein, S., and Mouaddib, A.-I.: Adaptive Planning and Scheduling of On-Board Scientific Experiments. European Space Agency (ESA) Workshop on On-Board Autonomy (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Univ. of Massachusetts, Amherst, MA, 01003, USA
Shlomo Zilberstein & Daniel S. Bernstein
RIACS, NASA Ames Research Center, MS 269-3, Moffett Field, CA, 94035, USA
Richard Washington
Laboratoire GREYC, Université de Caen, F14032, Caen, France
Abdel-Illah Mouaddib

Authors

Shlomo Zilberstein
View author publications
You can also search for this author in PubMed Google Scholar
Richard Washington
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Bernstein
View author publications
You can also search for this author in PubMed Google Scholar
Abdel-Illah Mouaddib
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Informatik IX, Technische Universität München, Orleansstr. 34, 81667, München, Germany
Michael Beetz
Fraunhofer-Institut, Autonome Intelligente Systeme (AIS), Schloss Birlinghoven, 53754, Sankt Augustin, Germany
Joachim Hertzberg
LAAS-CNRS, 7, Avenue du Colonel Roche, 31077, Toulouse cedex, France
Malik Ghallab
College of Engineering Dept. of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, 48019, USA
Martha E. Pollack

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zilberstein, S., Washington, R., Bernstein, D.S., Mouaddib, AI. (2002). Decision-Theoretic Control of Planetary Rovers. In: Beetz, M., Hertzberg, J., Ghallab, M., Pollack, M.E. (eds) Advances in Plan-Based Control of Robotic Agents. Lecture Notes in Computer Science(), vol 2466. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-37724-7_16

Download citation

DOI: https://doi.org/10.1007/3-540-37724-7_16
Published: 28 February 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00168-3
Online ISBN: 978-3-540-37724-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics