Dangers of Bias in Data-Intensive Information Systems

Park, Baekkwan; Rao, Dhana L.; Gudivada, Venkat N.

doi:10.1007/978-981-15-4851-2_28

Baekkwan Park¹⁸,
Dhana L. Rao¹⁹ &
Venkat N. Gudivada²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1162 ))

536 Accesses
4 Citations

Abstract

Data-intensive information systems (DIS) are pervasive and virtually affect people in all walks of life. Artificial intelligence and machine learning technologies are the backbone of DIS systems. Various types of biases embedded into DIS systems have serious significance and implications for individuals as well as the society at large. In this paper, we discuss various types of bias—both human and machine—and suggest ways to eliminate or minimize it. We also make a case for digital ethics education and outline ways to incorporate such education into computing curricula.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Artificial intelligence and the value of transparency

Article 08 September 2020

Artificial Intelligence and Ethics

Can digital tools foster ethical deliberation?

Article Open access 17 January 2024

Notes

References

Acquisti, A., Gross, R., Stutzman, F.: Faces of facebook: Privacy in the age of augmented reality. BlackHat USA 2, 1–20 (2011)
Google Scholar
Alarie, B.: The path of the law: towards legal singularity. Univ. Toronto Law J. 66(4), 443–455 (2016)
Article Google Scholar
Andrejevic, M.: Digital citizenship and surveillance| to pre-empt a thief. Int. J. Commun. 11, 18 (2017)
Google Scholar
Asur, S., Huberman, B.A.: Predicting the future with social media. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology-Volume 01, pp. 492–499. IEEE Computer Society (2010)
Google Scholar
Bakke, E.: Predictive policing: the argument for public transparency. NYU Ann. Surv. Am. L. 74, 131 (2018)
Google Scholar
Bakshy, E., Messing, S., Adamic, L.A.: Exposure to ideologically diverse news and opinion on facebook. Science 348(6239), 1130–1132 (2015)
Article MathSciNet Google Scholar
Barocas, S., Boyd, D.: Engaging the ethics of data science in practice. Commun. ACM 60(11), 23–25 (2017)
Article Google Scholar
Barocas, S., Selbst, A.D.: Big data’s disparate impact. Calif. L. Rev. 104, 671 (2016)
Google Scholar
Burrell, J.: How the machine ‘thinks’: understanding opacity in machine learning algorithms. Big Data Soc. 3(1), 2053951715622512 (2016)
Article Google Scholar
Cadwalladr, C., Graham-Harrison, E.: The Cambridge analytics files. The Guardian (2018)
Google Scholar
Camacho-Collados, M., Liberatore, F.: A decision support system for predictive police patrolling. Decis. Support Syst. 75, 25–37 (2015)
Article Google Scholar
Chandler, S.: The AI chatbot will hire you now. Wired.com (2017)
Google Scholar
Chen, W., Quan-Haase, A.: Big data ethics and politics: Toward new understandings. Soc. Sci. Comput. Rev., p. 0894439318810734 (2018)
Google Scholar
Datta, A., Sen, S., Tschantz, M.C.: Correspondences between privacy and nondiscrimination: why they should be studied together. arXiv preprint arXiv:1808.01735 (2018)
Eubanks, V.: Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor. St. Martin’s Press, New York, NY (2018)
Google Scholar
Fang, H., Moro, A.: Theories of statistical discrimination and affirmative action: a survey. In: Benhabib, J., Jackson, M.O., Bisin, A. (eds.) Handbook of Social Economics, vol. 1a (2011)
Google Scholar
Fry, H.: Hello World: Being Human in the Age of Algorithms. WW Norton & Company (2018)
Google Scholar
Gandomi, A., Haider, M.: Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manage. 35(2), 137–144 (2015)
Article Google Scholar
Gayo-Avello, D.: A meta-analysis of state-of-the-art electoral prediction from twitter data. Soc. Sci. Comput. Rev. 31(6), 649–679 (2013)
Article Google Scholar
Glaberson, S.K.: Coding over the cracks: predictive analytics and child protection. Fordham Urb. LJ 46, 307 (2019)
Google Scholar
Goffman, A.: On the run: fugitive life in an American city. Picador (2015)
Google Scholar
Gudivada, V., Apon, A., Ding, J.: Data quality considerations for big data and machine learning: going beyond data cleaning and transformations. Int. J. Adv. Softw. 10(1), 1–20 (2017)
Google Scholar
Gudivada, V.N., Ramaswamy, S., Srinivasan, S.: Data management issues in cyber-physical systems. In: Transportation Cyber-Physical Systems, pp. 173–200. Elsevier (2018)
Google Scholar
Guerette, R.T., Bowers, K.J.: Assessing the extent of crime displacement and diffusion of benefits: a review of situational crime prevention evaluations. Criminology 47(4), 1331–1368 (2009)
Article Google Scholar
Hamilton, M.: The biased algorithm: evidence of disparate impact on hispanics. Am. Crim. L. Rev. 56, 1553 (2019)
Google Scholar
Hargittai, E.: Is bigger always better? potential biases of big data derived from social network sites. Ann. Am. Acad. Polit. Soc. Sci. 659(1), 63–76 (2015)
Article Google Scholar
Hersch, J., Shinall, J.B.: Something to talk about: Information exchange under employment law. U. Pa. L. Rev. 165, 49 (2016)
Google Scholar
Kleinberg, J.: Inherent trade-offs in algorithmic fairness. In: ACM SIGMETRICS Performance Evaluation Review, vol. 46, pp. 40–40. ACM (2018)
Google Scholar
Kroll, J.A., Barocas, S., Felten, E.W., Reidenberg, J.R., Robinson, D.G., Yu, H.: Accountable algorithms. U. Pa. L. Rev. 165, 633 (2016)
Google Scholar
Labrinidis, A., Jagadish, H.V.: Challenges and opportunities with big data. Proc. VLDB Endowment 5(12), 2032–2033 (2012)
Article Google Scholar
Lazer, D., Pentland, A., Adamic, l., Aral, S., Barabasi, A.L., Brewer, D., Christakis, N., Contractor, N., Fowler, J., Gutmann, M., Jebara, T., King, G., Macy, M., Roy, D., Van Alstyne, M.: Computational social science. Science 323(5915), 721–723 (2009)
Google Scholar
Levine, E., Tisch, J., Tasso, A., Joy, M.: The New York city police department’s domain awareness system. Interfaces 47(1), 70–84 (2017)
Article Google Scholar
Lipton, Z.C.: The mythos of model interpretability. arXiv preprint arXiv:1606.03490 (2016)
Madden, M., Gilman, M., Levy, K., Marwick, A.: Privacy, poverty, and big data: a matrix of vulnerabilities for poor americans. Wash. UL Rev. 95, 53 (2017)
Google Scholar
Markham, A.N., Tiidenberg, K., Herman, A.: Ethics as methods: doing ethics in the era of big data research-introduction. Social Media Soc. 4(3), 2056305118784502 (2018). https://doi.org/10.1177/2056305118784502
Article Google Scholar
Meijer, A., Wessels, M.: Predictive policing: Review of benefits and drawbacks. Int. J. Pub. Adm., pp. 1–9 (2019)
Google Scholar
Noble, S.: Algorithms of Oppression: How Search Engines Reinforce Racism. NYU Press, New York, NY (2018)
Book Google Scholar
O’Connor, B., Balasubramanyan, R., Routledge, B.R., Smith, N.A.: From tweets to polls: linking text sentiment to public opinion time series. In: Fourth International AAAI Conference on Weblogs and Social Media (2010)
Google Scholar
O’Neil, C.: Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. Crown Publishing Group, New York, NY (2016)
MATH Google Scholar
Oswald, M., Babuta, A.: Data Analytics and Algorithmic Bias in Policing (2019)
Google Scholar
Pasquale, F.: The Black Box Society. Harvard University Press, Cambridge (2015)
Book Google Scholar
Passi, S., Barocas, S.: Problem formulation and fairness. In: Proceedings of the Conference on Fairness, Accountability, and Transparency, pp. 39–48. ACM (2019)
Google Scholar
Pearsall, B.: Predictive policing: the future of law enforcement. Nat. Inst. Justice J. 266(1), 16–19 (2010)
Google Scholar
Schlehahn, E., Wenning, R.: Gdpr transparency requirements and data privacy vocabularies. In: IFIP International Summer School on Privacy and Identity Management, pp. 95–113. Springer (2018)
Google Scholar
Schwartz, H.A., Eichstaedt, J.C., Kern, M.L., Dziurzynski, L., Ramones, S.M., Agrawal, M., Shah, A., Kosinski, M., Stillwell, D., Seligman, M.E., et al.: Personality, gender, and age in the language of social media: the open-vocabulary approach. PloS one 8(9), e73791 (2013)
Article Google Scholar
Shahin, S., Zheng, P.: Big data and the illusion of choice: Comparing the evolution of India’s aadhaar and China’s social credit system as technosocial discourses. Soc. Sci. Comput. Rev., p. 0894439318789343 (2018)
Google Scholar
Silva, S., Kenney, M.: Algorithms, platforms, and ethnic bias: an integrative essay. Phylon (1960-) 55(1 & 2), 9–37 (2018)
Google Scholar
Stern, M.J., Bilgen, I., McClain, C., Hunscher, B.: Effective sampling from social media sites and search engines for web surveys: demographic and data quality differences in surveys of google and facebook users. Soc. Sci. Comput. Rev. 35(6), 713–732 (2017)
Article Google Scholar
Strahilevitz, L.J.: Reputation nation: law in an era of ubiquitous personal information. Nw. UL Rev. 102, 1667 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Survey Research, East Carolina University, Greenville, NC, 27855, USA
Baekkwan Park
Department of Biology, East Carolina University, Greenville, NC, 27855, USA
Dhana L. Rao
Department of Computer Science, East Carolina University, Greenville, NC, 27855, USA
Venkat N. Gudivada

Authors

Baekkwan Park
View author publications
You can also search for this author in PubMed Google Scholar
Dhana L. Rao
View author publications
You can also search for this author in PubMed Google Scholar
Venkat N. Gudivada
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Venkat N. Gudivada .

Editor information

Editors and Affiliations

Department of Computer Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, Maharashtra, India
Prachi Deshpande
Machine Intelligence Research Labs (MIR Labs), Auburn, WA, USA
Ajith Abraham
Department of Electronics and Telecommunication Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, Maharashtra, India
Brijesh Iyer
School of Information Science and Engineering, University of Jinan, Jinan, Shandong, China
Kun Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, B., Rao, D.L., Gudivada, V.N. (2021). Dangers of Bias in Data-Intensive Information Systems. In: Deshpande, P., Abraham, A., Iyer, B., Ma, K. (eds) Next Generation Information Processing System. Advances in Intelligent Systems and Computing, vol 1162 . Springer, Singapore. https://doi.org/10.1007/978-981-15-4851-2_28

Download citation

DOI: https://doi.org/10.1007/978-981-15-4851-2_28
Published: 14 June 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4850-5
Online ISBN: 978-981-15-4851-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Dangers of Bias in Data-Intensive Information Systems

Abstract

Access this chapter

Similar content being viewed by others

Artificial intelligence and the value of transparency

Artificial Intelligence and Ethics

Can digital tools foster ethical deliberation?

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Dangers of Bias in Data-Intensive Information Systems

Abstract

Access this chapter

Similar content being viewed by others

Artificial intelligence and the value of transparency

Artificial Intelligence and Ethics

Can digital tools foster ethical deliberation?

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation