Skip to main content

Gini’s Multiple Regressions: Two Approaches and Their Interaction

  • Chapter
  • First Online:
  • 3360 Accesses

Part of the book series: Springer Series in Statistics ((SSS,volume 272))

Abstract

Our target in this chapter is to illustrate one of the major advantages of the GMD regressions: they offer a complete framework for checking and dealing with some of the assumptions imposed on the data in a multiple regression problem. There are two approaches that are related to the Gini—the semi-parametric approach and the minimization approach. The interaction between the two gives tools for assessing the adequacy of the model. In addition, there are two tools that enable the researcher to investigate the curvature of the regression curve: the extended Gini regression and the NLMA curve. The basic idea is the following: there is an unknown regression curve that relates the dependent variable Y and (all or some out of) a set of explanatory variables X1,…,Xn. The shape of the curve is not known. The curve is approximated by a linear model (which is then estimated from the data). However, each approach mentioned above leads to a (possibly different) linear model. The interaction between the two approaches can help to decide whether the original curve is linear (in each individual explanatory variable) or not. The suggested stages are the following: first one estimates the regression coefficients according to the semi-parametric approach without specifying a linear model. This means that at this stage the researcher decides only on the set of explanatory variables to be included in the regression model but not on the functional form. Then one uses the residuals from the fitted curve and tests whether they fulfill the necessary conditions for the minimization approach (which were obtained assuming linearity) for each explanatory variable separately. If for any given explanatory variable the above conditions are fulfilled; that is, if the hypothesis that the two regression coefficients are equal is not rejected, then one concludes that the regression curve is linear in this variable. Otherwise it is not (see Chap. 7 for details or below for a brief review). This property is especially important in regressions with several explanatory variables. It enables the investigator to find a set of variables that allows linear predictions without having to commit to the linearity of the model as a whole. Provided that the linearity hypothesis is not rejected for all explanatory variables one can examine the properties of the residuals such as their distribution, whether it is symmetric around the regression line or not, the serial correlation between them, etc., using the methodologies that will keep the analysis under the Gini framework. Although each stage could be performed by alternative methods, we are not aware of any methodology that can offer a complete set of tests that is governed by a unified framework and therefore offers a method to test the assumptions behind the regression with an internal consistency. We note in passing that the suggested test for linearity does not require replications of observations, as is the case in the common tests for linearity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    To be accurate, the investigator has also to decide whether the model is multiplicative or additive.

  2. 2.

    See, for example, Frick et al. (2006) who developed ANOGI—the Gini equivalent of ANOVA, and Shalit (2010) for a test for normality.

  3. 3.

    If one is interested in overcoming the restriction, then one should use EG regression. See Chap. 21.

  4. 4.

    In the empirical application we use GR* = 1 − cov(e,r(e))/cov(y,r(y)).

  5. 5.

    It is worth emphasizing that the connection between R-regression and GMD was not recognized in the literature mentioned above. Many of the properties of those regressions can be traced to the properties of GMD. Bowie and Bradfield (1998) compare the robustness of several alternative estimation methods in the simple regression case and find the minimization of the GMD of the residuals among the most robust methods.

  6. 6.

    Because (20.8), the GMD of the residuals, is a piecewise linear function, its partial derivative with respect to bM may not exist because the derivative is a step function. In this case the solutions bM to (20.9) form a segment on the real line and bM is determined up to a range. The larger the sample the lower the probability that such an event occurs.

  7. 7.

    The semi-parametric estimators can be viewed as OLS instrumental variable (IV) estimators, with the rank of each variable being used as an IV. However, note that the assumptions that are assumed here are entirely different (see Yitzhaki and Schechtman (2004)). Therefore the inference cannot be drawn from there.

  8. 8.

    In some applications the model used is \( {\rm{T}}\left( {{\rm{N}},{\rm{Y}}} \right) = {\rm{Nt(}}\frac{\rm{Y}}{{{\rm{a(N)}}}}) \), so that each member of the household is counted as one (see Ebert (2005, 2010) and Ben-Porath’s comment by Bruno and Habib (1976)).

  9. 9.

    The French tax system resembles this structure.

  10. 10.

    There are two problems with these results. The first problem is that household’s size is a discrete variable. In this case there is a mismatch between the LMA curve and the definition of cumulative distribution, because the empirical cumulative distribution is defined as a step function, while in an LMA (and Lorenz) curve one connects different points of the curve by straight lines, which implies continuity (see Chap. 5). The other problem is the issue of rounding errors because of small numbers involved. Therefore one should be careful in interpreting this result. Further research is required to resolve this issue.

  11. 11.

    Standard errors were calculated using Jackknife fast method.

References

  • Bowie, C. D., & Bradfield, D. J. (1998). Robust estimation of beta coefficients: Evidence from small stock market. Journal of Business Finance & Accounting, 25(3/4 (April/May)), 439–454.

    Article  Google Scholar 

  • Bruno, M., & Habib, J. (1976). Taxes, family grants and redistribution. Journal of Public Economics, 5, 57–79.

    Article  Google Scholar 

  • D’Agostino, R. B. (1972). Small sample probability points for the D test of normality. Biometrika, 59, 219–221.

    Article  Google Scholar 

  • D’Agostino, R. B. (1971). An omnibus test of normality for moderate and large size samples. Biometrika, 58, 341–348.

    Article  MathSciNet  MATH  Google Scholar 

  • Ebert, U. (2005). Optimal anti poverty programmes: Horizontal equity and the paradox of targeting. Economica, 72, 453–468.

    Article  Google Scholar 

  • Ebert, U. (2010). The decomposition of inequality reconsidered: Weakly decomposable measures. Mathematical Social Sciences, 60, 94–103.

    Article  MathSciNet  MATH  Google Scholar 

  • Feldstein, M. S. (1976). On the theory of tax reform. Journal of Public Economic, 6, 77–104.

    Article  Google Scholar 

  • Frick, R. J., Goebel, J., Schechtman, E., Wagner, G. G., & Yitzhaki, S. (2006). Using analysis of Gini (ANOGI) for detecting whether two sub-samples represent the same universe: The German Socio-Economic Panel study (SOEP) experience. Sociological Methods and Research, 34(4 (May)), 427–468.

    Article  MathSciNet  Google Scholar 

  • Hettmansperger, T. P. (1984). Statistical inference based on ranks. New York: John Wiley and Sons.

    MATH  Google Scholar 

  • Jaeckel, L. A. (1972). Estimating regression coefficients by minimizing the dispersion of the residuals. Annals of Mathematical Statistics, 43, 1449–1458.

    Article  MathSciNet  MATH  Google Scholar 

  • Jurečková, J. (1969). Asymptotic linearity of a rank statistic in regression parameter. Annals of Mathematical Statistics, 40, 1889–1900.

    Article  MathSciNet  Google Scholar 

  • Jurečková, J. (1971). Nonparametric estimates of regression coefficients. Annals of Mathematical Statistics, 42, 1328–1338.

    Article  MathSciNet  MATH  Google Scholar 

  • McKean, J. W., & Hettmansperger, T. P. (1978). A robust analysis of the general linear model based on one step R-estimates. Biometrika, 65, 571–579.

    MathSciNet  MATH  Google Scholar 

  • Olkin, I., & Yitzhaki, S. (1992). Gini regression analysis. International Statistical Review, 60(2), 185–196.

    Article  MATH  Google Scholar 

  • Schechtman, E., Shelef, A., Yitzhaki, S., & Zitikis, R. (2008). Testing hypotheses about absolute concentration curves and marginal conditional stochastic dominance. Econometric Theory, 24, 1044–1062.

    Article  MATH  Google Scholar 

  • Schechtman, E., Soffer, E., & Yitzhaki, S. (2008). The robustness of conclusions based on TIMSS mean grades, first draft.

    Google Scholar 

  • Schechtman, E., & Yitzhaki, S. (1987). A measure of association based on Gini’s mean difference. Communications in Statistics, Theory and Methods, 16, 207–231.

    Article  MathSciNet  MATH  Google Scholar 

  • Schechtman, E., Yitzhaki, S., & Pudalov, T. (2011). Gini’s multiple regressions: Two approaches and their interaction. Metron, LXIX(1), 65–97.

    Google Scholar 

  • Shalit, H. (2010). Using OLS to test for normality, DP No. 09-12. Monaster Center for Economic Research, Ben-Gurion University of the Negev, Beer Sheva, Israel.

    Google Scholar 

  • Yitzhaki, S. (1991). Calculating jackknife variance estimators for parameters of the Gini method. Journal of Business & Economics Statistics, 9(2), 235–239.

    Google Scholar 

  • Yitzhaki, S., & Schechtman, E. (2004). The Gini instrumental variable, or the “double instrumental variable” estimator. Metron, LXII(3), 287–313.

    MathSciNet  Google Scholar 

  • Yitzhaki, S., & Schechtman, E. (2012). Identifying monotonic and non-monotonic relationships. Economics Letters, Economics Letters, 116, 23–25.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this chapter

Cite this chapter

Yitzhaki, S., Schechtman, E. (2013). Gini’s Multiple Regressions: Two Approaches and Their Interaction. In: The Gini Methodology. Springer Series in Statistics, vol 272. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4720-7_20

Download citation

Publish with us

Policies and ethics