Abstract
Mutually beneficial behavior in repeated games can be enforced via the threat of punishment, as enshrined in game theory’s well-known “folk theorem.” There is a cost, however, to a player for generating these disincentives. In this work, we seek to minimize this cost by computing a “Stackelberg punishment,” in which the player selects a behavior that sufficiently punishes the other player while maximizing its own score under the assumption that the other player will adopt a best response. This idea generalizes the concept of a Stackelberg equilibrium. Known efficient algorithms for computing a Stackelberg equilibrium can be adapted to efficiently produce a Stackelberg punishment. We demonstrate an application of this idea in an experiment involving a virtual autonomous vehicle and human participants. We find that a self-driving car with a Stackelberg punishment policy discourages human drivers from bullying in a driving scenario requiring social negotiation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Basu, C., Yang, Q., Hungerman, D., Singhal, M., Dragan, A.D.: Do you want your autonomous car to drive like you? In: ACM/IEEE International Conference on Human-Robot Interaction, pp. 417–425 (2017)
Brodsky, J.S.: Autonomous vehicle regulation: how an uncertain legal landscape may hit the brakes on self-driving cars. Berkeley Technol. Law J. 31, 851–878 (2016)
Brooks, R.: Unexpected consequences of self driving cars (2017). http://rodneybrooks.com/unexpected-consequences-of-self-driving-cars/
Bruce, A.: Planning for human-robot interaction: representing time and human intention. Ph.D. thesis, Thesis, Robotics Institute, Carnegie Mellon University (2005)
Chesterman, S.: Do driverless cars dream of electric sheep? SSRN (2016). SSRN. https://ssrn.com/abstract=2833701 or https://doi.org/10.2139/ssrn.2833701
Condon, A.: The complexity of stochastic games. Inf. Comput. 96(2), 203–224 (1992)
Letchford, J., Conitzer, V.: Computing optimal strategies to commit to in extensive-form games. In: Proceedings of the 11th ACM Conference on Electronic Commerce, pp. 83–92. ACM (2010)
Littman, M.L., Stone, P.: A polynomial-time Nash equilibrium algorithm for repeated games. Decis. Support Syst. 39(1), 55–66 (2005)
Littman, M.L.: Algorithms for sequential decision making. Ph.D. thesis, Department of Computer Science, Brown University, February 1996. Technical report CS-96-09
Munoz de Cote, E., Littman, M.L.: A polynomial-time Nash equilibrium algorithm for repeated stochastic games. In: 24th Conference on Uncertainty in Artificial Intelligence (UAI 2008) (2008)
Osborne, M.J., Rubinstein, A.: A Course in Game Theory. The MIT Press, Cambridge (1994)
Shapley, L.: Stochastic games. Proc. Nat. Acad. Sci. U.S.A. 39, 1095–1100 (1953)
Tennant, C., Howard, S., Franks, B., Bauer, M.W.: Autonomous vehicles: negotiating a place on the road (2016). http://www.lse.ac.uk/website-archive/newsAndMedia/PDF/AVs-negociating-a-place-on-the-road-1110.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Cooper, M. et al. (2019). Stackelberg Punishment and Bully-Proofing Autonomous Vehicles. In: Salichs, M., et al. Social Robotics. ICSR 2019. Lecture Notes in Computer Science(), vol 11876. Springer, Cham. https://doi.org/10.1007/978-3-030-35888-4_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-35888-4_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35887-7
Online ISBN: 978-3-030-35888-4
eBook Packages: Computer ScienceComputer Science (R0)