Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 603))

  • 577 Accesses

Abstract

This chapter introduces the Planning and Learning to Adapt Swiftly to Teammates to Improve Cooperation (PLASTIC) algorithms that enable an ad hoc team agent to cooperate with a variety of different teammates. One might think that the most appropriate thing for an ad hoc team agent to do is to “fit in” with its team by following the same behavior as its teammates. However, if the teammates’ behaviors are suboptimal, this approach will limit how much the ad hoc agent can help its team. Therefore, in this book, we adopt the approach of learning about different teammates and deciding how to act by leveraging this knowledge. This approach allows an ad hoc agent to reason about how well its knowledge of past teammates predicts its current teammates’ actions as well as to convert this knowledge into the actions it needs to take to accomplish its goals. If the knowledge of prior teammates accurately predicts the current teammates and the ad hoc agent is given enough time to plan, this approach will lead to optimal performance of the ad hoc agent, helping its team achieve the best possible outcome. Note that this may not be the optimal performance of any team, but it is optimal for the ad hoc agent given that the behaviors of its teammates are fixed.

This chapter contains material from three publications: [14]. Note that some of Sect. 5.2 is joint work with Sarit Kraus and Avi Rosenfeld in addition to Peter Stone [3].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barrett, Samuel, and Peter Stone. 2014. Cooperating with unknown teammates in robot soccer. In AAAI workshop on multiagent interaction without prior coordination (MIPC 2014), July 2014.

    Google Scholar 

  2. Barrett, Samuel, Peter Stone, and Sarit Kraus. 2011. Empirical evaluation of ad hoc teamwork in the pursuit domain. In Proceedings of the tenth international conference on autonomous agents and multiagent systems (AAMAS), May 2011.

    Google Scholar 

  3. Barrett, Samuel, Peter Stone, Sarit Kraus, and Avi Rosenfeld. 2013. Teamwork with limited knowledge of teammates. In Proceedings of the twenty-seventh conference on artificial intelligence (AAAI), July 2013.

    Google Scholar 

  4. Barrett, Samuel, and Peter Stone. 2015. Cooperating with unknown teammates in complex domains: A robot soccer case study of ad hoc teamwork. In Proceedings of the twenty-ninth conference on artificial intelligence (AAAI), January 2015.

    Google Scholar 

  5. Blum, A, and Y. Mansour. 2007. Algorithmic game theory, chapter learning, regret minimization, and equilibria. Cambridge University Press.

    Google Scholar 

  6. Silver, David, and Joel Veness. 2010. Monte-Carlo planning in large POMDPs. In Advances in neural information processing systems 23 (NIPS). 2010.

    Google Scholar 

  7. Hall, Mark, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The WEKA data mining software: An update. SIGKDD Explorations 11: 10–18.

    Article  Google Scholar 

  8. Pardoe, David, and Peter Stone. 2010. Boosting for regression transfer. In Proceedings of the twenty-seventh international conference on machine learning (ICML), June 2010.

    Google Scholar 

  9. Yao, Yi, and G. Doretto. Boosting for transfer learning with multiple sources. In Proceedings of the conference on computer vision and pattern recognition (CVPR), June 2010.

    Google Scholar 

  10. Huang, Pipei, Gang Wang, and Shiyin Qin. 2012. Boosting for transfer learning from multiple data sources. Pattern Recognition Letters 33(5): 568–579.

    Article  Google Scholar 

  11. Zhuang, Fuzhen, Xiaohu Cheng, SinnoJialin Pan, Yu. Wenchao, Qing He, and Zhongzhi Shi. 2014. Transfer learning with multiple sources via consensus regularized autoencoders. In Machine learning and knowledge discovery in databases, vol. 8726, ed. Toon Calders, Floriana Esposito, Eyke Hllermeier, and Rosa Meo, 417–431., Lecture notes in computer science Berlin Heidelberg: Springer.

    Google Scholar 

  12. Fang, Min, Yong Guo, Xiaosong Zhang, and Xiao Li. 2015. Multi-source transfer learning based on label shared subspace. Pattern Recognition Letters 51: 101–106.

    Article  Google Scholar 

  13. Ge,Liang, Jing Gao, and Aidong Zhang. 2013. OMS-TL: A framework of online multiple source transfer learning. In Proceedings of the 22nd ACM international conference on information & knowledge management, CIKM ’13, 2423–2428, ACM, New York, NY, USA, 2013.

    Google Scholar 

  14. Damien Ernst, Pierre Geurts, and Louis Wehenkel. 2005. Tree-based batch mode reinforcement learning. Journal of machine learning research (JMLR), 503–556.

    Google Scholar 

  15. Christopher John Cornish Hellaby Watkins. 1989. Learning from Delayed Rewards. Ph.D thesis, King’s College, Cambridge, May 1989.

    Google Scholar 

  16. Deisenroth, Marc Peter. 2013. Gerhard Neumann, and Jan Peters. A survey on policy search for robotics. Foundations and Trends in Robotics 2(1–2): 1–142.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samuel Barrett .

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Barrett, S. (2015). The PLASTIC Algorithms. In: Making Friends on the Fly: Advances in Ad Hoc Teamwork. Studies in Computational Intelligence, vol 603. Springer, Cham. https://doi.org/10.1007/978-3-319-18069-4_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-18069-4_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-18068-7

  • Online ISBN: 978-3-319-18069-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics