Abstract
Racial discrimination remains a critical issue in the United States. Hence, the study employs a logistic regression model to predict racial arrests based on demographics. Using a secondary open dataset from the City of Albany’s official website from November 2020 to September 27, 2021, the results revealed that the variables with relative influence on the race of an individual arrested are the age, neighborhood, and sex variables, at 80%, 12%, and 8%, respectively. The model results demonstrated that the likelihood of a white male between the age of 18 and 25 years arrested in a predominantly white neighborhood is significantly less than being black. Limitations of the study include small sample size (10 months of historical data); and under-specification of the Logistic regression model due to excluding one or more relevant independent variables from the model.
Keywords
- Race
- United States
- Racial discrimination
- Racial profiling
- Neighborhoods
- Policing
- Predictive modeling
- Supervised learning
- Logistic regression
- Open data
Authors Alice Nneka Ottah and Yvonne Appiah Dadson have contributed equally to this work.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Racial Discrimination | Country Page | World | Human Rights Watch. (n.d.). Retrieved September 16, 2021, from https://www.hrw.org/united-states/racial-discrimination
APD Arrests Dataset by Neighborhood | City of Albany. (n.d.). Retrieved September 16, 2021, from https://data.albanyny.gov/Public-Safety/APD-Arrests-Dataset-by-Neighborhood/7y34-47cz
B.E. Harcourt, Risk as a proxy for race: The dangers of risk assessment. Federal Sentencing Reporter 27(4), 237–243 (2015)
W.D. Heaven, Predictive policing algorithms are racist. They need to be dismantled. MIT Technology Review, 17, 2020 (2020)
Law Enforcement & Juvenile Crime, Office of Juvenile Justice and Delinquency Prevention, U. S. Department of Justice Office (2019). Retrieved on October 26 from https://www.ojjdp.gov/ojstatbb/crime/ucr.asp?table_in=2
J. Angwin, J. Larson, S. Mattu, L. Kirchner, Machine bias (2016). ProPublica, May 23, 2016. Retrieved October 24, 2021 from https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
P.J. Brantingham, M. Valasik, G.O. Mohler, Does predictive policing lead to biased arrests? Results from a randomized controlled trial. Stat. Public Policy 5(1), 1–6 (2018) Calculate skewness in R. Programming R. (n.d.). Retrieved October 26, 2021, from https://www.programmingr.com/statistics/skewness/
J. Whitehead, An introduction to logistic regression. Department of Economics, Appalachian State University (1999). Retrieved December 2, 2021, from http://www.appstate.edu/~whiteheadjc/service/logit/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ottah, A.N., Dadson, Y.A., Caramancion, K.M. (2023). A Demographic Model to Predict Arrests by Race: An Exploratory Approach. In: Latifi, S. (eds) ITNG 2023 20th International Conference on Information Technology-New Generations. ITNG 2023. Advances in Intelligent Systems and Computing, vol 1445. Springer, Cham. https://doi.org/10.1007/978-3-031-28332-1_49
Download citation
DOI: https://doi.org/10.1007/978-3-031-28332-1_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28331-4
Online ISBN: 978-3-031-28332-1
eBook Packages: EngineeringEngineering (R0)