Large language models associate Muslims with violence

Abid, Abubakar; Farooqi, Maheen; Zou, James

doi:10.1038/s42256-021-00359-2

Large language models associate Muslims with violence

Comment
Published: 17 June 2021

Volume 3, pages 461–463, (2021)
Cite this article

From

View current issue Submit your manuscript

3194 Accesses
42 Citations
251 Altmetric
34 Mentions
Explore all metrics

Large language models, which are increasingly used in AI applications, display undesirable stereotypes such as persistent associations between Muslims and violence. New approaches are needed to systematically reduce the harmful bias of language models in deployment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Fig. 1: GPT-3 exhibits Muslim–violence bias.**

**Fig. 2: Debiasing GPT-3 completions.**

References

Mikolov, T., Chen, K., Corrado, G. & Dean, J. in Proc. International Conference on Learning Representations (ICLR, 2013).
Dai, A. M. & Le, Q. V. in Advances in Neural Information Processing Systems Vol. 28, 3079–3087 (NeurIPS, 2015).
Brown, T. et al. in Advances in Neural Information Processing Systems Vol. 33, 1877–1901 (NeurIPS, 2020).
Kitaev, N., Kaiser, L. & Levskaya, A. in Proc. International Conference on Learning Representations (ICLR, 2020).
Bolukbasi, T., Chang, K.-W., Zou, J. Y., Saligrama, V. & Kalai, A. T. in Advances in Neural Information Processing Systems Vol. 29, 4349–4357 (NeurIPS, 2016).
Nadeem, M., Bethke, A. & Reddy, S. Preprint at https://arxiv.org/abs/2004.09456 (2020).
Sheng, E., Chang, K.-W., Natarajan, P. & Peng, N. in Proc. Conference on Empirical Methods in Natural Language Processing 3407–3412 (ACL, 2019).
Bordia, S. & Bowman, S. R. in Proc. Conference of the North American Chapter of the Association for Computational Linguistics (ACL, 2019).
Lu, K., Mardziel, P., Wu, F., Amancharla, P. & Datta, A. in Logic, Language, and Security (eds Nigam, V. et al.) 189–202 (Springer, 2020).
Lewis, M. et al. in Proc. 58th Annual Meeting of the Association for Computational Linguistics 7871–7880 (ACL, 2020).
Wallace, E., Feng, S., Kandpal, N., Gardner, M. & Singh, S. in Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2153–2162 (ACL, 2019).
Qian, Y., Muaz, U., Zhang, B. & Hyun, J. W. Preprint at https://arxiv.org/abs/1905.12801 (2019).
Bender, E. M., Gebru, T., McMillan-Major, A. & Mitchell, S. in ACM Conference on Fairness, Accountability, and Transparency 610–623 (ACM, 2021).
Li, X. L. & Liang, P. Preprint at https://arxiv.org/abs/2101.00190 (2021).

Download references

Acknowledgements

We thank A. Abid, A. Abdalla, D. Khan, and M. Ghassemi for the helpful feedback on the manuscript and experiments. J.Z. is supported by NSF CAREER 1942926.

Author information

Authors and Affiliations

Department of Electrical Engineering, Stanford University, Stanford, CA, USA
Abubakar Abid
Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
Maheen Farooqi
Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
James Zou

Authors

Abubakar Abid
View author publications
You can also search for this author in PubMed Google Scholar
Maheen Farooqi
View author publications
You can also search for this author in PubMed Google Scholar
James Zou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James Zou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Arvind Narayaran for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary discussions A–C

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abid, A., Farooqi, M. & Zou, J. Large language models associate Muslims with violence. Nat Mach Intell 3, 461–463 (2021). https://doi.org/10.1038/s42256-021-00359-2

Download citation

Published: 17 June 2021
Issue Date: June 2021
DOI: https://doi.org/10.1038/s42256-021-00359-2
Springer Nature Limited

This article is cited by

Manifestations of xenophobia in AI systems
- Nenad Tomasev
- Jonathan Leader Maynard
- Iason Gabriel
AI & SOCIETY (2024)
Large language models in medicine
- Arun James Thirunavukarasu
- Darren Shu Jeng Ting
- Daniel Shu Wei Ting
Nature Medicine (2023)
Mitigating the impact of biased artificial intelligence in emergency decision-making
- Hammaad Adam
- Aparna Balagopalan
- Marzyeh Ghassemi
Communications Medicine (2022)
Shifting machine learning for healthcare from development to deployment and from models to data
- Angela Zhang
- Lei Xing
- Joseph C. Wu
Nature Biomedical Engineering (2022)
Identity of AI
- Vladan Devedzic
Discover Artificial Intelligence (2022)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Large language models associate Muslims with violence

From

Access this article

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Manifestations of xenophobia in AI systems

Large language models in medicine

Mitigating the impact of biased artificial intelligence in emergency decision-making

Shifting machine learning for healthcare from development to deployment and from models to data

Identity of AI

Search

Navigation