Gini’s Multiple Regressions: Two Approaches and Their Interaction

Yitzhaki, Shlomo; Schechtman, Edna

doi:10.1007/978-1-4614-4720-7_20

Gini’s Multiple Regressions: Two Approaches and Their Interaction

Shlomo Yitzhaki³ &
Edna Schechtman⁴

Chapter
First Online: 01 January 2012

3360 Accesses

Part of the book series: Springer Series in Statistics ((SSS,volume 272))

Abstract

Our target in this chapter is to illustrate one of the major advantages of the GMD regressions: they offer a complete framework for checking and dealing with some of the assumptions imposed on the data in a multiple regression problem. There are two approaches that are related to the Gini—the semi-parametric approach and the minimization approach. The interaction between the two gives tools for assessing the adequacy of the model. In addition, there are two tools that enable the researcher to investigate the curvature of the regression curve: the extended Gini regression and the NLMA curve. The basic idea is the following: there is an unknown regression curve that relates the dependent variable Y and (all or some out of) a set of explanatory variables X₁,…,X_n. The shape of the curve is not known. The curve is approximated by a linear model (which is then estimated from the data). However, each approach mentioned above leads to a (possibly different) linear model. The interaction between the two approaches can help to decide whether the original curve is linear (in each individual explanatory variable) or not. The suggested stages are the following: first one estimates the regression coefficients according to the semi-parametric approach without specifying a linear model. This means that at this stage the researcher decides only on the set of explanatory variables to be included in the regression model but not on the functional form. Then one uses the residuals from the fitted curve and tests whether they fulfill the necessary conditions for the minimization approach (which were obtained assuming linearity) for each explanatory variable separately. If for any given explanatory variable the above conditions are fulfilled; that is, if the hypothesis that the two regression coefficients are equal is not rejected, then one concludes that the regression curve is linear in this variable. Otherwise it is not (see Chap. 7 for details or below for a brief review). This property is especially important in regressions with several explanatory variables. It enables the investigator to find a set of variables that allows linear predictions without having to commit to the linearity of the model as a whole. Provided that the linearity hypothesis is not rejected for all explanatory variables one can examine the properties of the residuals such as their distribution, whether it is symmetric around the regression line or not, the serial correlation between them, etc., using the methodologies that will keep the analysis under the Gini framework. Although each stage could be performed by alternative methods, we are not aware of any methodology that can offer a complete set of tests that is governed by a unified framework and therefore offers a method to test the assumptions behind the regression with an internal consistency. We note in passing that the suggested test for linearity does not require replications of observations, as is the case in the common tests for linearity.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
To be accurate, the investigator has also to decide whether the model is multiplicative or additive.
2.
See, for example, Frick et al. (2006) who developed ANOGI—the Gini equivalent of ANOVA, and Shalit (2010) for a test for normality.
3.
If one is interested in overcoming the restriction, then one should use EG regression. See Chap. 21.
4.
In the empirical application we use GR* = 1 − cov(e,r(e))/cov(y,r(y)).
5.
It is worth emphasizing that the connection between R-regression and GMD was not recognized in the literature mentioned above. Many of the properties of those regressions can be traced to the properties of GMD. Bowie and Bradfield (1998) compare the robustness of several alternative estimation methods in the simple regression case and find the minimization of the GMD of the residuals among the most robust methods.
6.
Because (20.8), the GMD of the residuals, is a piecewise linear function, its partial derivative with respect to b_M may not exist because the derivative is a step function. In this case the solutions b_M to (20.9) form a segment on the real line and b_M is determined up to a range. The larger the sample the lower the probability that such an event occurs.
7.
The semi-parametric estimators can be viewed as OLS instrumental variable (IV) estimators, with the rank of each variable being used as an IV. However, note that the assumptions that are assumed here are entirely different (see Yitzhaki and Schechtman (2004)). Therefore the inference cannot be drawn from there.
8.
In some applications the model used is \( {\rm{T}}\left( {{\rm{N}},{\rm{Y}}} \right) = {\rm{Nt(}}\frac{\rm{Y}}{{{\rm{a(N)}}}}) \), so that each member of the household is counted as one (see Ebert (2005, 2010) and Ben-Porath’s comment by Bruno and Habib (1976)).
9.
The French tax system resembles this structure.
10.
There are two problems with these results. The first problem is that household’s size is a discrete variable. In this case there is a mismatch between the LMA curve and the definition of cumulative distribution, because the empirical cumulative distribution is defined as a step function, while in an LMA (and Lorenz) curve one connects different points of the curve by straight lines, which implies continuity (see Chap. 5). The other problem is the issue of rounding errors because of small numbers involved. Therefore one should be careful in interpreting this result. Further research is required to resolve this issue.
11.
Standard errors were calculated using Jackknife fast method.

References

Bowie, C. D., & Bradfield, D. J. (1998). Robust estimation of beta coefficients: Evidence from small stock market. Journal of Business Finance & Accounting, 25(3/4 (April/May)), 439–454.
Article Google Scholar
Bruno, M., & Habib, J. (1976). Taxes, family grants and redistribution. Journal of Public Economics, 5, 57–79.
Article Google Scholar
D’Agostino, R. B. (1972). Small sample probability points for the D test of normality. Biometrika, 59, 219–221.
Article Google Scholar
D’Agostino, R. B. (1971). An omnibus test of normality for moderate and large size samples. Biometrika, 58, 341–348.
Article MathSciNet MATH Google Scholar
Ebert, U. (2005). Optimal anti poverty programmes: Horizontal equity and the paradox of targeting. Economica, 72, 453–468.
Article Google Scholar
Ebert, U. (2010). The decomposition of inequality reconsidered: Weakly decomposable measures. Mathematical Social Sciences, 60, 94–103.
Article MathSciNet MATH Google Scholar
Feldstein, M. S. (1976). On the theory of tax reform. Journal of Public Economic, 6, 77–104.
Article Google Scholar
Frick, R. J., Goebel, J., Schechtman, E., Wagner, G. G., & Yitzhaki, S. (2006). Using analysis of Gini (ANOGI) for detecting whether two sub-samples represent the same universe: The German Socio-Economic Panel study (SOEP) experience. Sociological Methods and Research, 34(4 (May)), 427–468.
Article MathSciNet Google Scholar
Hettmansperger, T. P. (1984). Statistical inference based on ranks. New York: John Wiley and Sons.
MATH Google Scholar
Jaeckel, L. A. (1972). Estimating regression coefficients by minimizing the dispersion of the residuals. Annals of Mathematical Statistics, 43, 1449–1458.
Article MathSciNet MATH Google Scholar
Jurečková, J. (1969). Asymptotic linearity of a rank statistic in regression parameter. Annals of Mathematical Statistics, 40, 1889–1900.
Article MathSciNet Google Scholar
Jurečková, J. (1971). Nonparametric estimates of regression coefficients. Annals of Mathematical Statistics, 42, 1328–1338.
Article MathSciNet MATH Google Scholar
McKean, J. W., & Hettmansperger, T. P. (1978). A robust analysis of the general linear model based on one step R-estimates. Biometrika, 65, 571–579.
MathSciNet MATH Google Scholar
Olkin, I., & Yitzhaki, S. (1992). Gini regression analysis. International Statistical Review, 60(2), 185–196.
Article MATH Google Scholar
Schechtman, E., Shelef, A., Yitzhaki, S., & Zitikis, R. (2008). Testing hypotheses about absolute concentration curves and marginal conditional stochastic dominance. Econometric Theory, 24, 1044–1062.
Article MATH Google Scholar
Schechtman, E., Soffer, E., & Yitzhaki, S. (2008). The robustness of conclusions based on TIMSS mean grades, first draft.
Google Scholar
Schechtman, E., & Yitzhaki, S. (1987). A measure of association based on Gini’s mean difference. Communications in Statistics, Theory and Methods, 16, 207–231.
Article MathSciNet MATH Google Scholar
Schechtman, E., Yitzhaki, S., & Pudalov, T. (2011). Gini’s multiple regressions: Two approaches and their interaction. Metron, LXIX(1), 65–97.
Google Scholar
Shalit, H. (2010). Using OLS to test for normality, DP No. 09-12. Monaster Center for Economic Research, Ben-Gurion University of the Negev, Beer Sheva, Israel.
Google Scholar
Yitzhaki, S. (1991). Calculating jackknife variance estimators for parameters of the Gini method. Journal of Business & Economics Statistics, 9(2), 235–239.
Google Scholar
Yitzhaki, S., & Schechtman, E. (2004). The Gini instrumental variable, or the “double instrumental variable” estimator. Metron, LXII(3), 287–313.
MathSciNet Google Scholar
Yitzhaki, S., & Schechtman, E. (2012). Identifying monotonic and non-monotonic relationships. Economics Letters, Economics Letters, 116, 23–25.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics, The Hebrew University, Mount Scopus, Jerusalem, Israel
Shlomo Yitzhaki
Department of Industrial Engineering and Management, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Edna Schechtman

Authors

Shlomo Yitzhaki
View author publications
You can also search for this author in PubMed Google Scholar
Edna Schechtman
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yitzhaki, S., Schechtman, E. (2013). Gini’s Multiple Regressions: Two Approaches and Their Interaction. In: The Gini Methodology. Springer Series in Statistics, vol 272. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4720-7_20

Download citation

DOI: https://doi.org/10.1007/978-1-4614-4720-7_20
Published: 25 August 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4719-1
Online ISBN: 978-1-4614-4720-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics