Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness

Liu, Tongtong; McCalmon, Joe; Rahman, Md Asifur; Lischke, Cameron; Halabi, Talal; Alqahtani, Sarra

doi:10.1007/978-3-031-21203-1_21

Tongtong Liu¹²,
Joe McCalmon¹²,
Md Asifur Rahman¹²,
Cameron Lischke¹³,
Talal Halabi¹⁴ &
…
Sarra Alqahtani¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13753))

Included in the following conference series:

International Conference on Principles and Practice of Multi-Agent Systems

776 Accesses
1 Citations

Abstract

Cooperative Multi-Agent Reinforcement Learning (c-MARL) enables a team of agents to determine the global optimal policy that maximizes the sum of their accumulated rewards. This paper investigates the robustness of c-MARL to a novel adversarial threat, where we target and weaponize one agent, termed the compromised agent, to create natural observations that are adversarial for its team. The goal is to lure the compromised agent to follow an adversarial policy that pushes activations of its cooperative agents’ policy networks off distribution. This paper shows mathematically the exploitation steps of such an adversarial policy in the centralized-learning and decentralized-execution paradigm of c-MARL. We also empirically demonstrate the susceptibility of the state-of-the-art c-MARL algorithms, namely MADDPG and QMIX, to the compromised agent threat by deploying four attack strategies in three environments in white and black box settings. By targeting a single agent, our attacks yield highly negative impact on the overall team reward in all environments, reducing it by at least 33% and at most 89.6%. Finally, we provide recommendations on improving the robustness of c-MARL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Adversarial attacks on cooperative multi-agent deep reinforcement learning: a dynamic group-based adversarial example transferability method

Article Open access 03 July 2023

Security Analysis of Poisoning Attacks Against Multi-agent Reinforcement Learning

Learning Cooperative Behaviours in Adversarial Multi-agent Systems

Notes

1.
https://github.com/SarraAlqahtani22/MARL-Robustness.
2.
Our code and demos are available here:https://github.com/SarraAlqahtani22/MARL-Robustness.

References

Amoozadeh, M., et al.: Security vulnerabilities of connected vehicle streams and their impact on cooperative driving. IEEE Commun. Mag. 53(6), 126–132 (2015). https://doi.org/10.1109/MCOM.2015.7120028
Article Google Scholar
Bagnell, J.A.D.: An invitation to imitation. Technical report. CMU-RI-TR-15-08, Carnegie Mellon University, Pittsburgh, PA (2015)
Google Scholar
Bakakeu, J., Kisskalt, D., Franke, J., Baer, S., Klos, H.H., Peschke, J.: Multi-agent reinforcement learning for the energy optimization of cyber-physical production systems. In: 2020 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE), pp. 1–8 (2020). https://doi.org/10.1109/CCECE47787.2020.9255795
Dadras, S., Dadras, S., Winstead, C.: Collaborative attacks on autonomous vehicle platooning. In: 2018 IEEE 61st International Midwest Symposium on Circuits and Systems (MWSCAS), pp. 464–467 (2018). https://doi.org/10.1109/MWSCAS.2018.8624026
Foerster, J.N., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent policy gradients. In: AAAI (2018)
Google Scholar
Fu, J., Luo, K., Levine, S.: Learning robust rewards with adversarial inverse reinforcement learning (2018)
Google Scholar
Higgins, F., Tomlinson, A., Martin, K.M.: Survey on security challenges for swarm robotics. In: 2009 Fifth International Conference on Autonomic and Autonomous Systems, pp. 307–312 (2009). DOI: https://doi.org/10.1109/ICAS.2009.62
Huang, S., Papernot, N., Goodfellow, I., Duan, Y., Abbeel, P.: Adversarial attacks on neural network policies (2017)
Google Scholar
Jaques, N., et al.: Social influence as intrinsic motivation for multi-agent deep reinforcement learning (2019)
Google Scholar
Jo, U., Jo, T., Kim, W., Yoon, I., Lee, D., Lee, S.: Cooperative multi-agent reinforcement learning framework for scalping trading (2019)
Google Scholar
Kos, J., Song, D.: Delving into adversarial attacks on deep policies (2017)
Google Scholar
Levine, S.: Reinforcement learning and control as probabilistic inference: tutorial and review (2018)
Google Scholar
Lin, J., Dzeparoska, K., Zhang, S.Q., Leon-Garcia, A., Papernot, N.: On the robustness of cooperative multi-agent reinforcement learning (2020)
Google Scholar
Lin, Y.C., Hong, Z.W., Liao, Y.H., Shih, M.L., Liu, M.Y., Sun, M.: Tactics of adversarial attack on deep reinforcement learning agents (2019)
Google Scholar
Liu, M., et al.: Multi-agent interactions modeling with correlated policies (2020)
Google Scholar
Lowe, R., Wu, Y., Tamar, A., Harb, J., Abbeel, P., Mordatch, I.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 6382–6393. NIPS 2017, Curran Associates Inc., Red Hook, NY, USA (2017)
Google Scholar
Peake, A., McCalmon, J., Raiford, B., Liu, T., Alqahtani, S.: Multi-agent reinforcement learning for cooperative adaptive cruise control. In: 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), pp. 15–22 (2020). https://doi.org/10.1109/ICTAI50040.2020.00013
Rashid, T., Samvelyan, M., de Witt, C.S., Farquhar, G., Foerster, J., Whiteson, S.: Qmix: monotonic value function factorisation for deep multi-agent reinforcement learning (2018)
Google Scholar
Shapley, L.S.: Stochastic games. Proc. National Acad. Sci. 39(10), 1095–1100 (1953). https://doi.org/10.1073/pnas.39.10.1095, https://www.pnas.org/content/39/10/1095
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA (2018)
MATH Google Scholar
Tang, Y.C.: Towards learning multi-agent negotiations via self-play (2020)
Google Scholar
Tian, Z., et al.: Learning to communicate implicitly by actions (2019)
Google Scholar
Wen, Y., Yang, Y., Luo, R., Wang, J., Pan, W.: Probabilistic recursive reasoning for multi-agent reinforcement learning (2019)
Google Scholar
Wiyatno, R., Xu, A.: Maximal Jacobian-based saliency map attack (2018)
Google Scholar
Wu, X., Guo, W., Wei, H., Xing, X.: Adversarial policy training against deep reinforcement learning. In: 30th USENIX Security Symposium (USENIX Security 21), pp. 1883–1900. USENIX Association (2021). https://www.usenix.org/conference/usenixsecurity21/presentation/wu-xian
Xia, Y., Qin, T., Chen, W., Bian, J., Yu, N., Liu, T.Y.: Dual supervised learning (2017)
Google Scholar
Yang, Y., Luo, R., Li, M., Zhou, M., Zhang, W., Wang, J.: Mean field multi-agent reinforcement learning (2018)
Google Scholar
Zhao, Y., Shumailov, I., Cui, H., Gao, X., Mullins, R., Anderson, R.: Blackbox attacks on reinforcement learning agents using approximated temporal information (2019)
Google Scholar

Download references

Acknowledgment

This material is based upon work supported by the National Science Foundation (NSF) under grant no. 2105007.

Author information

Authors and Affiliations

Computer Science Department, Wake Forest University, Winston-Salem, NC, USA
Tongtong Liu, Joe McCalmon, Md Asifur Rahman & Sarra Alqahtani
Computer Science Department, John Hopkins University, Baltimore, MD, USA
Cameron Lischke
Department of Computer Science and Software Engineering, Laval University, Quebec City, QC, Canada
Talal Halabi

Authors

Tongtong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Joe McCalmon
View author publications
You can also search for this author in PubMed Google Scholar
Md Asifur Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Cameron Lischke
View author publications
You can also search for this author in PubMed Google Scholar
Talal Halabi
View author publications
You can also search for this author in PubMed Google Scholar
Sarra Alqahtani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sarra Alqahtani .

Editor information

Editors and Affiliations

Özyeğin University, Istanbul, Turkey
Reyhan Aydoğan
Universitat Politècnica de València, Valencia, Spain
Natalia Criado
Université Paris-Dauphine, Paris, France
Jérôme Lang
Universitat Politècnica de València, Valencia, Spain
Victor Sanchez-Anguix
King's College London, London, UK
Marc Serramia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, T., McCalmon, J., Rahman, M.A., Lischke, C., Halabi, T., Alqahtani, S. (2023). Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness. In: Aydoğan, R., Criado, N., Lang, J., Sanchez-Anguix, V., Serramia, M. (eds) PRIMA 2022: Principles and Practice of Multi-Agent Systems. PRIMA 2022. Lecture Notes in Computer Science(), vol 13753. Springer, Cham. https://doi.org/10.1007/978-3-031-21203-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-031-21203-1_21
Published: 12 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21202-4
Online ISBN: 978-3-031-21203-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness

Abstract

Access this chapter

Similar content being viewed by others

Adversarial attacks on cooperative multi-agent deep reinforcement learning: a dynamic group-based adversarial example transferability method

Security Analysis of Poisoning Attacks Against Multi-agent Reinforcement Learning

Learning Cooperative Behaviours in Adversarial Multi-agent Systems

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness

Abstract

Access this chapter

Similar content being viewed by others

Adversarial attacks on cooperative multi-agent deep reinforcement learning: a dynamic group-based adversarial example transferability method

Security Analysis of Poisoning Attacks Against Multi-agent Reinforcement Learning

Learning Cooperative Behaviours in Adversarial Multi-agent Systems

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation