Computation of determinantal subset influence in regression
- 61 Downloads
One of the important goals of regression diagnostics is the detection of cases or groups of cases which have an inordinate impact on the regression results. Such observations are generally described as ‘influential’. A number of influence measures have been proposed, each focusing on a different aspect of the regression. For single cases, these measures are relatively simple and inexpensive to calculate. However, the detection of multiple-case or joint influence is more difficult on two counts. First, calculation of influence for a single subset is more involved than for an individual case, and second, the sheer number of subsets of cases makes the computation overwhelming for all but the smallest data sets.
Barrett and Gray (1992) described methods for efficiently examining subset influence for those measures that can be expressed as the trace of a product of positive semidefinite (psd) matrices. There are, however, other popular measures that do not take this form, but rather are expressible as the ratio of determinants of psd matrices. This article focuses on reducing the computation for the determinantal ratio measures by making use of upper and lower bounds on the influence to limit the number of subsets for which the actual influence must be explicitly determined.
KeywordsCovariance ratio multiple-case influence regression diagnostics
Unable to display preview. Download preview PDF.
- Barrett, B. E. and Gray, J. B. (1992) Efficient computation of subset influence in regression. Journal of Computational and Graphical Statistics, 1, 271–286.Google Scholar
- Barrett, B. E. and Ling, R. F. (1992) General classes of influence measures for multivariate regression. Journal of the American Statistical Association, 87, 184–191.Google Scholar
- Belsley, D. A., Kuh, E. and Welsch, R. E. (1980) Regression Diagnostics: Identifying Influential Data and Sources of Collinearity, Wiley, New York.Google Scholar
- Cook, R. D. and Weisberg, S. (1980) Characterizations of an empirical influence function for detecting influential cases in regression. Technometrics, 22, 495–508.Google Scholar
- Cook, R. D. and Weisberg, S. (1982) Residuals and Influence in Regression, Chapman & Hall, New York.Google Scholar