Matching in R
The statistical package R is used to construct several matched samples from one data set. The focus is on the mechanics of using R, not on the design of observational studies. The process is made tangible by describing it in detail, closely inspecting intermediate results; however, essentially, three steps are involved, (i) creating a distance matrix, (ii) adding a propensity score caliper to the distance matrix, and (iii) finding an optimal match. One appendix contains a short introduction to R. A second appendix contains short R functions used to create distance matrices used in matching.
KeywordsPropensity Score Distance Matrix Statist Assoc High School Senior Optimal Match
Unable to display preview. Download preview PDF.
- Aitkin, M., Francis, B., Hinde, J., Darnell, R.: Statistical Modelling in R. New York: Oxford University Press (2009)Google Scholar
- Card, D., Krueger, A.: Minimum wages and employment: A case study of the fast-food industry in New Jersey and Pennsylvania. Am Econ Rev 84, 772–793 (1994) Data: http://www.irs.princeton.edu/
- Fleiss, J.L., Levin, B., Paik, M.C.: Statistical Methods for Rates and Proportions. New York: Wiley (2001)Google Scholar
- Hansen, B.B.: Optmatch: Flexible, optimal matching for observational studies. R News 7, 18–24 (2007)Google Scholar
- LaLonde, R.J.: Evaluating the econometric evaluations of training programs with experimental data. Am Econ Rev 76, 604–620 (1986)Google Scholar
- Maindonald, J., Braun, J.: Data Analysis and Graphics Using R. New York: Cambridge University Press (2001)Google Scholar
- R Development Core Team.: R: A Language and Environment for Statistical Computing. Vienna: R Foundation, http://www.R-project.org (2007)
- Rosenbaum, P.R.: Permutation tests for matched pairs with adjustments for covariates. Appl Statist 37, 401–411 (1988) (Correction: [27, §3])Google Scholar
- Shadish, W. R., Cook, T. D., Campbell, D.T.: Experimental and Quasi-Experimental Designs for Generalized Causal Inference. Boston: Houghton-Mifflin (2002)Google Scholar
- Wooldridge, J.M.: Econometric Analysis of Cross Section and Panel Data. Cambridge, MA: MIT Press. (2002)Google Scholar