Implications of Stratified Sampling for Fair Lending Binary Logit Models

Clarke, Judith A.; Courchane, Marsha J.

doi:10.1007/s11146-004-4829-5

Implications of Stratified Sampling for Fair Lending Binary Logit Models

Original Article
Published: February 2005

Volume 30, pages 5–31, (2005)
Cite this article

The Journal of Real Estate Finance and Economics Aims and scope Submit manuscript

Judith A. Clarke¹ &
Marsha J. Courchane²

146 Accesses
5 Citations
Explore all metrics

Abstract

We examine the effect of sample design on estimation and inference for disparate treatment in binary logistic models used to assess for fair lending. Our Monte Carlo experiments provide information on how sample design affects efficiency (in terms of mean squared error) of estimation of the disparate treatment parameter and power of a test for statistical insignificance of this parameter. The sample design requires two decision levels: first, the degree of stratification of the loan applicants (Level I Decision) and secondly, given a Level I Decision, how to allocate the sample across strata (Level II Decision). We examine four Level I stratification strategies: no stratification (simple random sampling), exogenously stratifying loan cases by race, endogenously stratifying cases by loan outcome (denied or approved), and stratifying exogenously by race and endogenously by outcome. Then, we consider five Level II methods: proportional, balanced, and three designs based on applied studies. Our results strongly support the use of stratifying by both race and loan outcome coupled with a balanced sample design when interest is in estimation of, or testing for statistical significance of, the disparate treatment parameter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Anderson, J. A. (1972). “Separate Sample Logistic Discrimination,” Biometrika 59, 19–35.
Google Scholar
Avery, R. B., P. E. Beeson, and P. S. Calem. (1997). “Using HMDA Data as a Regulatory Screen for Fair Lending Compliance,” Journal of Financial Services Research 11, 9–42.
Google Scholar
Breslow, N. E., and N. Chatterjee. (1999). “Design and Analysis of Two Phase Studies with Binary Outcome Applied to Wilms Tumour Prognosis,” Applied Statistics 48, 457–468.
Google Scholar
Bull, S. B. (1993). “Sample Size and Power Determination for a Binary Outcome and an Ordinal Exposure when Logistic Regression Analysis is Planned,” American Journal of Epidemiology 137, 676–684.
Google Scholar
Calem, P. S., and S. D. Longhofer. (2002). “Anatomy of Fair Lending Exam: The Uses and Limitations of Statistics,” Journal of Real Estate Finance and Economics 24, 207–237.
Google Scholar
Calem, P. S., and M. Stutzer. (1995). “The Simple Analytics of Observed Discrimination in Credit Markets,” Journal of Financial Intermediation 4, 189–212.
Google Scholar
Carr, J. H., and I. F. Megbolugbe. (1993). “The Federal Reserve Bank of Boston Study on Mortgage Lending Revisited,” Journal of Housing Research 4, 277–313.
Google Scholar
Cosslett, S. (1981a). “Maximum Likelihood Estimators for Choice-Based Samples,” Econometrica 49, 1289–1316.
Google Scholar
Cosslett, S. (1981b). “Efficient Estimation of Discrete-Choice Models.” In C. F. Manski and D. McFadden (eds.), Structural Analysis of Discrete Data with Econometric Applications, Cambridge, Mass.: MIT Press, pp. 51–111.
Google Scholar
Courchane, M., D. Nebhut, and D. Nickerson. (2000). “Lessons Learned: Statistical Techniques and Fair Lending,” Journal of Housing Research 11, 277–295.
Google Scholar
Day, T. E. and S. J. Liebowitz (1998). “Mortgage Lending to Minorities: Where’s the Bias?,” Economic Inquiry 36, 3–28.
Google Scholar
Glennon, D., and M. Stengel. (1994). “An Evaluation of the Federal Reserve Bank of Boston’s Study of Racial Discrimination in Mortgage Lending.” Office of the Comptroller of the Currency, Economic and Policy Analysis Working Paper 94–2.
Harrison, G. W. (1998). “Mortgage Lending in Boston: A Reconsideration of the Evidence,” Economic Inquiry 36, 29–38.
Google Scholar
Horne, D. K. (1994). “Evaluating the Role of Race in Mortgage Lending,” FDIC Banking Review 7, 1–15.
Google Scholar
Horne, D. K. (1997). “Mortgage Lending, Race, and Model Specification,” Journal of Financial Services Research 11, 43–68.
Google Scholar
Kao, T.-C., and G. P. McCabe. (1991). “Optimal Sample Allocation for Normal Discrimination and Logistic Regression under Stratified Sampling,” Journal of the American Statistical Association 86, 432–436.
Google Scholar
Ladd, H. F. (1998). “Evidence on Discrimination in Mortgage Lending,” Journal of Economic Perspectives 12, 41–62.
Google Scholar
Longhofer, S., and S. Peters. (1999). “Beneath the Rhetoric: Clarifying the Debate on Mortgage Lending Discrimination,” Federal Reserve Bank of Cleveland, Economic Review 34, 2–13.
Google Scholar
Manski, C. F., and S. R. Lerman. (1977). “The Estimation of Choice Probabilities from Choice Based Samples,” Econometrica 45, 1977–1988.
Google Scholar
Munnell, A. H., L. E. Browne, J. McEneaney, and G. M. B. Tootell. (1992). “Mortgage Lending in Boston: Interpreting HMDA Data,” Federal Reserve Bank of Boston, Working Paper 92-7.
Munnell, A. H., G. M. B. Tootell, L. E. Browne, and J. McEneaney. (1996). “Mortgage Lending in Boston: Interpreting HMDA Data,” American Economic Review 86, 25–53.
Google Scholar
Prentice, R. L., and R. Pyke. (1979). “Logistic Disease Incidence Models and Case-Control Studies,” Biometrika 66, 403–411.
Google Scholar
Scott, A. J., and C. J. Wild. (1986). “Fitting Logistic Models under Case-Control or Choice Based Sampling,” Journal of the Royal Statistical Society, Series B 23, 469–476.
Google Scholar
Scott, A. J., and C. J. Wild. (1991). “Fitting Logistic Regression Models in Stratified Case-Control Studies,” Biometrics 47, 497–510.
Google Scholar
Self, S. G., and R. H. Mauritsen. (1988). “Power/Sample Size Calculations for Generalized Linear Models,” Biometrics 44, 79–86.
Google Scholar
Stengel, M., and D. Glennon. (1999). “Evaluating Statistical Models of Mortgage Lending Discrimination: A Bank-Specific Analysis,” Real Estate Economics 27, 299–334.
Google Scholar
Whittemore, A. (1981). “Sample Size for Logistic Regression with Small Response Probability,” Journal of the American Statistical Association 76, 27–32.
Google Scholar
Windmeijer, F. A. G. (1995). “Goodness-of-fit Measure in Binary Choice Models,” Econometric Reviews 14, 101–116.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics, University of Victoria, PO Box 1700, Victoria, BC, Canada, V8X 5A3
Judith A. Clarke
Welch Consulting, 8757 Georgia Avenue, Suite 520, Silver Spring, MD, 20910, USA
Marsha J. Courchane

Authors

Judith A. Clarke
View author publications
You can also search for this author in PubMed Google Scholar
Marsha J. Courchane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Judith A. Clarke.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Clarke, J.A., Courchane, M.J. Implications of Stratified Sampling for Fair Lending Binary Logit Models. J Real Estate Finan Econ 30, 5–31 (2005). https://doi.org/10.1007/s11146-004-4829-5

Download citation

Issue Date: February 2005
DOI: https://doi.org/10.1007/s11146-004-4829-5

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Implications of Stratified Sampling for Fair Lending Binary Logit Models

Abstract

Access this article

Similar content being viewed by others

Racial differences in access to mortgage lending: comparison across major institutions

Intergroup disparity among student loan borrowers

Investigating bank lending discrimination in the US using CRA-rated banks’ HMDA loan data

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Key Words

Navigation

Implications of Stratified Sampling for Fair Lending Binary Logit Models

Abstract

Access this article

Similar content being viewed by others

Racial differences in access to mortgage lending: comparison across major institutions

Intergroup disparity among student loan borrowers

Investigating bank lending discrimination in the US using CRA-rated banks’ HMDA loan data

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation