Skip to main content

Abstract

This chapter provides an introduction to ordinary least squares (OLS) regression analysis in R. This is a technique used to explore whether one or multiple variables (the independent variable or X) can predict or explain the variation in another variable (the dependent variable or Y). OLS regression belongs to a family of techniques called generalized linear models, so the variables being examined must be measured at the ratio or interval level and have a linear relationship. The chapter also reviews how to assess model fit using regression error (the difference between the predicted and actual values of Y) and R2. While you learn these techniques in R, you will be using the Crime Survey for England and Wales data from 2013 to 2014; these data derive from a face-to-face survey that asks people about their experiences of crime during the 12 months prior to interview.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bock, D. E., Velleman, P. F., & De Veaux, R. D. (2012). Stats: modeling the world. Boston: Addison-Wesley.

    Google Scholar 

  • Cullen, F. T., Cao, L., Frank, J., Langworthy, R. H., Browning, S. L., Kopache, R., & Stevenson, T. J. (1996). Stop or I’ll shoot: Racial differences in support for police use of deadly force. American Behavioral Scientist, 39(4), 449–460.

    Article  Google Scholar 

  • Gelman, A., & Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models. New York, NY, USA: Cambridge University Press.

    Google Scholar 

  • Tyler, T. R., & Fagan, J. (2008). Legitimacy and cooperation: Why do people help the police fight crime in their communities? Ohio State Journal of Criminal Law, 6, 231.

    Google Scholar 

  • Weisburd, D., Britt, C., Wilson, D., & Wooditch, A. (2021). Basics in statistics in criminology and criminal justice, Edition 5. Springer Science & Business Media.

    Google Scholar 

  • Weisburd, D., & Piquero, A. R. (2008). How well do criminologists explain crime? Statistical modeling in published studies. Crime and Justice, 37(1), 453–502.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Electronic Supplementary Material

Data 15.1

(CSV 1.74 Mb)

Key Terms

Bivariate regression

A technique for predicting change in a dependent variable using one independent variable.

Dependent variable ( Y )

The variable assumed by the researcher to be influenced by one or more independent variables.

Heteroscedasticity

A situation in which the variances of scores on two or more variables are not equal. Heteroscedasticity violates one of the assumptions of the parametric test of statistical significance for the correlation coefficient.

Independent variable ( X )

A variable assumed by the researcher to have an impact on the value of the dependent variable, Y.

Multicollinearity

Condition in a multivariate regression model in which independent variables examined are very strongly intercorrelated. Multicollinearity leads to unstable regression coefficients.

OLS regression

See ordinary least squares regression analysis.

Ordinary least squares regression analysis

A type of regression analysis in which the sum of squared errors from the regression line is minimized.

Percent of variance explained ( R 2 )

A measure for evaluating how well the regression model predicts values of Y. It represents the improvement in predicting Y that the regression line provides over the mean of Y.

Regression coefficient ( b )

A statistic used to assess the influence of an independent variable, X, on a dependent variable, Y. The regression coefficient b is interpreted as the estimated change in Y that is associated with a one-unit change in X.

Regression error ( e )

The difference between the predicted value of Y and the actual value of Y.

Regression line

The line predicting values of Y. The line is plotted from knowledge of the Y-intercept and the regression coefficient.

Regression model

The hypothesized statement by the researcher of the factor or factors that define the value of the dependent variable, Y. The model is normally expressed in equation form.

Y -intercept ( b 0 )

The expected value of Y when X = 0. The Y-intercept is used in predicting values of Y.

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Wooditch, A., Johnson, N.J., Solymosi, R., Medina Ariza, J., Langton, S. (2021). Ordinary Least Squares Regression. In: A Beginner’s Guide to Statistics for Criminology and Criminal Justice Using R. Springer, Cham. https://doi.org/10.1007/978-3-030-50625-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-50625-4_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-50624-7

  • Online ISBN: 978-3-030-50625-4

  • eBook Packages: Law and CriminologyLaw and Criminology (R0)

Publish with us

Policies and ethics