A toolbox of permutation tests for structural change

Zeileis, Achim; Hothorn, Torsten

doi:10.1007/s00362-013-0503-4

A toolbox of permutation tests for structural change

Regular Article
Published: 09 February 2013

Volume 54, pages 931–954, (2013)
Cite this article

Statistical Papers Aims and scope Submit manuscript

Achim Zeileis¹ &
Torsten Hothorn²

608 Accesses
11 Citations
Explore all metrics

Abstract

The sup$LM$ test for structural change is embedded into a permutation test framework for a simple location model. The resulting conditional permutation distribution is compared to the usual (unconditional) asymptotic distribution, showing that the power of the test can be clearly improved in small samples. Furthermore, the permutation test is embedded into a general framework that encompasses tools for binary and multivariate dependent variables as well as model-based permutation testing for structural change. It is also demonstrated that the methods can not only be employed for analyzing structural changes in time series data but also for recursive partitioning of cross-section data. The procedures suggested are illustrated using both artificial data and empirical applications (number of youth homicides, employment discrimination data, carbon flux in tropical forests, stock returns, and demand for economics journals).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Permutation Tests in Linear Regression

Permutation tests for experimental data

Article Open access 01 April 2023

Permutation Tests Using Arbitrary Permutation Distributions

Article Open access 03 June 2023

References

Andrews DWK (1991) Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59:817–858
Article MathSciNet MATH Google Scholar
Andrews DWK (1993) Tests for parameter instability and structural change with unknown change point. Econometrica 61:821–856
Article MathSciNet MATH Google Scholar
Antoch J, Hušková M (2001) Permutation tests in change point analysis. Stat Probab Lett 53:37–46
Article MATH Google Scholar
Bergstrom TC (2001) Free labor for costly journals? J Econ Perspect 15:183–198
Article Google Scholar
Bonal D, Bosc A, Ponton S, Goret JY, Burban B, Gross P, Bonnefond JM, Elbers J, Longdoz B, Epron D, Guehl JM, Granier A (2008) Impact of severe dry season on net ecosystem exchange in the neotropical rainforest of French Guiana. Glob Chang Biol 14(8):1917–1933
Article Google Scholar
Boulesteix AL, Strobl C (2007) Maximally selected chi-squared statistics and non-monotonic associations: an exact approach based on two cutpoints. Comput Stat Data Anal 51(12):6295–6306
Article MathSciNet MATH Google Scholar
Brown RL, Durbin J, Evans JM (1975) Techniques for testing the constancy of regression relationships over time. J R Stat Soc B 37:149–163
MathSciNet MATH Google Scholar
Chambers JQ, Schimel JP, Nobre AD (2001) Respiration from coarse wood litter in central Amazon forests. Biogeochemistry 52(2):115–131
Article Google Scholar
Chow GC (1960) Tests of equality between sets of coefficients in two linear regressions. Econometrica 28:591–605
Article MathSciNet MATH Google Scholar
Ernst MD (2004) Permutation methods: a basis for exact inference. Stat Sci 19(4):676–685
Article MathSciNet MATH Google Scholar
Fisher RA (1935) The design of experiments. Oliver and Boyd, Edinburgh
Google Scholar
Freidlin B, Gastwirth JL (2000) Changepoint tests designed for the analysis of hiring data arising in employment discrimination cases. J Bus Econ Stat 18(3):315–322
Google Scholar
Genz A (1992) Numerical computation of multivariate normal probabilities. J Comput Graph Stat 1:141–149
Google Scholar
Hansen BE (1997) Approximate asymptotic $p$ values for structural-change tests. J Bus Econ Stat 15:60–67
Google Scholar
Hansen BE (2000) Testing for structural change in conditional models. J Econ 97:93–115
Article MATH Google Scholar
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, New York
Book MATH Google Scholar
Hothorn T, Lausen B (2003) On the exact distribution of maximally selected rank statistics. Comput Stat Data Anal 43(2):121–137
Article MathSciNet MATH Google Scholar
Hothorn T, Zeileis A (2008) Generalized maximally selected statistics. Biometrics 64:1263–1269
Article MathSciNet MATH Google Scholar
Hothorn T, Hornik K, van de Wiel MA, Zeileis A (2006a) A Lego system for conditional inference. Am Stat 60(3):257–263
Article Google Scholar
Hothorn T, Hornik K, Zeileis A (2006b) Unbiased recursive partitioning: a conditional inference framework. J Comput Graph Stat 15(3):651–674
Article MathSciNet Google Scholar
Hothorn T, Hornik K, van de Wiel MA, Zeileis A (2008) Implementing a class of permutation tests: the coin package. J Stat Softw 28(8):1–23. http://www.jstatsoft.org/v28/i08/
Google Scholar
Hsu DA (1979) Detecting shifts of parameter in gamma sequences with applications to stock price and air traffic flow analysis. J Am Stat Assoc 74:31–40
Article Google Scholar
Kennedy PE (1995) Randomization tests in econometrics. J Bus Econ Stat 13(1):85–94
Google Scholar
Kirch C (2007) Block permutation principles for the change analysis of dependent data. J Stat Plan Inference 137:2453–2474
Article MathSciNet MATH Google Scholar
Kirch C, Steinebach J (2006) Permutation principles for the change analysis of stochastic processes under strong invariance. J Comput Appl Math 186(1):64–88
Article MathSciNet MATH Google Scholar
Krämer W, Sonnberger H (1986) The linear regression model under test. Physica, Heidelberg
Book Google Scholar
Krämer W, Ploberger W, Alt R (1988) Testing for structural change in dynamic models. Econometrica 56(6):1355–1369
Article MathSciNet MATH Google Scholar
Lausen B, Schumacher M (1992) Maximally selected rank statistics. Biometrics 48:73–85
Article Google Scholar
Ludbrook J, Dudley H (1998) Why permutation tests are superior to $t$ and $F$ tests in biomedical research. Am Stat 52(2):127–132
Google Scholar
Luger R (2006) Exact permutation tests for non-nested non-linear regression models. J Econ 133:513–529
Article MathSciNet Google Scholar
McCullagh P (2005) Exchangeability and regression models. In: Davison AC, Dodge Y, Wermuth N (eds) Celebrating statistics—papers in honour of Sir David Cox on his 80th birthday. Oxford University Press, Oxford, pp 89–115
Google Scholar
Pesarin F (2001) Multivariate permutation tests: with applications to biostatistics. Wiley, Chichester
Google Scholar
Piehl AM, Cooper SJ, Braga AA, Kennedy DM (2003) Testing for structural breaks in the evaluation of programs. Rev Econ Stat 85(3):550–558
Article Google Scholar
Pitman EJG (1938) Significance tests which may be applied to samples from any populations: III. The analysis of variance test. Biometrika 29:322–335
MATH Google Scholar
Quandt RE (1960) Tests of the hypothesis that a linear regression obeys two separate regimes. J Am Stat Assoc 55:324–330
Article MathSciNet MATH Google Scholar
R Development Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org/, ISBN 3-900051-07-0
Stahl C, Burban B, Goret JY, Bonal D (2011) Seasonal variations in stem CO$_2$ efflux in the neotropical rainforest of French Guiana. Ann For Sci 68(4):771–782
Article Google Scholar
Stock JH, Watson MW (1996) Evidence on structural instability in macroeconomic time series relations. J Bus Econ Stat 14:11–30
Google Scholar
Stock JH, Watson MW (2007) Introduction to econometrics, 2nd edn. Addison-Wesley, Reading
Google Scholar
Strasser H, Weber C (1999) On the asymptotic theory of permutation statistics. Math Methods Stat 8:220–250. Preprint available from http://epub.wu.ac.at/102/
Strobl C, Boulesteix AL, Augustin T (2007) Unbiased split selection for classification trees based on the Gini index. Comput Stat Data Anal 52:483–501
Article MathSciNet MATH Google Scholar
Zeileis A (2005) A unified approach to structural change tests based on ML scores, $F$ statistics, and OLS residuals. Econ Rev 24(4):445–466
Article MathSciNet MATH Google Scholar
Zeileis A, Leisch F, Hornik K, Kleiber C (2002) strucchange: An R package for testing for structural change in linear regression models. J Stat Softw 7(2):1–38, http://www.jstatsoft.org/v07/i02/
Zeileis A, Hothorn T, Hornik K (2008) Model-based recursive partitioning. J Comput Graph Stat 17(2):492–514
Article MathSciNet Google Scholar

Download references

Acknowledgments

The coarse woody debris respiration data were kindly provided by Lucy Rowland (School of GeoSciences, University of Edinburgh).

Author information

Authors and Affiliations

Department of Statistics, Faculty of Economics and Statistics, Universität Innsbruck, Universitätsstraße 15, 6020 , Innsbruck, Austria
Achim Zeileis
Institut für Sozial- und Präventivmedizin, Abteilung Biostatistik, Universität Zürich, Hirschengraben 84, 8001 , Zürich, Switzerland
Torsten Hothorn

Authors

Achim Zeileis
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Hothorn
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Achim Zeileis.

Appendix

1.1 Proofs

The (asymptotic) distribution of the multivariate statistic $Z = (Z_{\pi _1}, \dots , Z_{\pi _m})^\top $ is derived by embedding the statistic into the framework of Strasser and Weber (1999), as discussed in Hothorn et al. (2006a). More precisely, the test statistic $\max _{\pi \in \Pi } Z_\pi $ considered above is the maximum-type statistic $c_{\max }$ of Hothorn et al. (2006a) if the influence function $h(Y) = Y$ is used for the observations $Y_i$ and the transformation $g(t) = (\mathbf{1 }_{[0, \pi _1]}(t), \dots , \mathbf{1 }_{[0, \pi _m]}(t))^\top $ is used for the associated timings $t_i$. The transformation $g$ used the indicator function $\mathbf{1 }_I$ of the interval $I$ and thus corresponds to a vector of indicators for the time up to the timings $\pi _j$ ($j = 1, \dots , k$).

Using these transformations $h(\cdot )$ and $g(\cdot )$, the unstandardized test statistic $T$ is in the notation of Hothorn et al. (2006a)

$$\begin{aligned} T \, = \, \text{ vec} \left( \sum _{i = 1}^n g(t_i) h(Y_i)^\top \right) \, = \, \left( n_{1, \pi _1} \bar{Y}_{1, \pi _1}, \dots , n_{1, \pi _m} \bar{Y}_{1, \pi _m} \right)^\top . \end{aligned}$$

(8)

Under $H_0$, given all permutations $\sigma \in S$ of the observations $Y_1, \dots , Y_n$, the unstandardized statistic has expectation

$$\begin{aligned} \text{ E}_\sigma [T] \, = \, \text{ vec} \left( \left( \sum _{i = 1}^n g(t_i) \right) n^{-1} \sum _{i = 1}^n h(Y_i)^\top \right) \, = \, \left( n_{1, \pi _1}, \dots , n_{1, \pi _m} \right)^\top \bar{Y} \end{aligned}$$

(9)

and each unstandardized statistic has variance

$$\begin{aligned} \text{ Var}_\sigma [T_\pi ] \, = \, \left( n_{1, \pi } - \frac{n_{1, \pi }^2}{n} \right) \frac{RSS_{0}}{n-1} \, = \, \frac{n_{1, \pi } n_{2, \pi }}{n} \frac{RSS_{0}}{n-1}, \end{aligned}$$

(10)

where the residual sum of squares is $RSS_0 = \sum _{i = 1}^n (Y_i - \bar{Y})^2$. The equations directly follow from Eq. 7 in Strasser and Weber (1999).

Standardizing the vector of raw statistics $T = (T_{\pi _1}, \dots , T_{\pi _m})^\top $ by their respective mean and standard deviation yields the vector of statistics $Z = (Z_{\pi _1}, \dots , Z_{\pi _m})^\top $:

$$\begin{aligned} Z_\pi&= \frac{T_\pi - \text{ E}_\sigma [T_\pi ]}{\sqrt{\text{ Var}_\sigma [T_\pi ]}} \\&= \frac{n_{1, \pi } \bar{Y}_{1, \pi } - n_{1, \pi } \bar{Y}}{\sqrt{\frac{n_{1, \pi } n_{2, \pi }}{n} \frac{RSS_{0}}{n-1}}} \\&= \sqrt{ \frac{n_{1, \pi } n_{2, \pi }}{n} } \frac{\bar{Y}_{1, \pi } - \bar{Y}_{2, \pi }}{\sqrt{ RSS_0 / (n-1) }}, \end{aligned}$$

because of the following simple relationship between $\bar{Y}_{1, \pi }$, $\bar{Y}_{2, \pi }$ and $\bar{Y}$:

$$\begin{aligned} \bar{Y} \, = \, \frac{n_{1, \pi } \bar{Y}_{1, \pi } + n_{2, \pi } \bar{Y}_{2, \pi }}{n}. \end{aligned}$$

Consequently, $Z$ has zero mean and unit variance given all permutations $\sigma \in S$. Similarly, the covariance between two elements of $Z$, $Z_\pi $ and $Z_\tau $ say, is

$$\begin{aligned}&\frac{RSS_0}{n-1} \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \pi ]}(t_i) \mathbf{1 }_{[0, \tau ]}(t_i) \right) - \frac{1}{n} \frac{RSS_0}{n-1} \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \pi ]}(t_i) \right) \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \tau ]}(t_i) \right) \\&\qquad \quad = \frac{n \min (n_{1, \pi }, n_{1, \tau }) - n_{1, \pi } n_{1, \tau }}{n} \frac{RSS_0}{n-1}. \end{aligned}$$

Assuming that $\pi < \tau $ and using the variance computed above, the correlation is thus

$$\begin{aligned} \frac{n_{1, \pi } n_{2, \tau }}{\sqrt{n_{1, \pi } n_{2, \pi } n_{1, \tau } n_{2, \tau }}}. \end{aligned}$$

Given that we derived the first two moments of $Z$ by embedding the statistic into the framework of Strasser and Weber (1999), the asymptotic normality of $Z$ follows by application of their Theorem 2.3.

1.2 Power and size for autocorrelated series

To study the performance of the tests with dependent data, we use a simulation setup as in Sect. 2.3. The only difference is that the errors are now autocorrelated with $\varrho = 0, 0.1, 0.2, 0.3, 0.5, 0.9$. The length of the time series considered is either very short ($n = 10$) or moderate ($n = 50$) and either there is no change ($\delta = 0$) or a large shift in the mean ($\delta = 15$). Five different versions of the tests are assessed: the unconditional and conditional test ($\mathcal D _{\infty }$ and $\mathcal D _{\sigma |Y}$, respectively) on the original data (as in Sect. 2.3), the unconditional test computed with a robust HAC covariance estimate and the unconditional and conditional test computed on the residuals of an AR(1) model (fitted by OLS).

Using the uncorrected tests on the original data ($\mathcal D _{\infty }$ and $\mathcal D _{\sigma |Y}$, respectively), it can be seen that size distortions occur for $\varrho > 0$ even if there is no change ($\delta = 0$). For $\varrho $ up to $0.2$ these are still moderate but become very large afterwards and are even more pronounced for the conditional version of the test. However, in moderately large time series ($n = 50$), the problem can be remedied by either using a HAC correction or applying the tests to the AR(1) residuals. These three tests keep their size and have reasonable power with the conditional test having the highest power. However, none of the tests is able to distinguish between autocorrelation and a shift in the mean (with $\delta = 15$) if the time series is very short ($n = 10$) or if the autocorrelation is very high ($\varrho \ge 0.7$) and the length only moderate ($n = 50$).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zeileis, A., Hothorn, T. A toolbox of permutation tests for structural change. Stat Papers 54, 931–954 (2013). https://doi.org/10.1007/s00362-013-0503-4

Download citation

Received: 16 June 2012
Revised: 22 December 2012
Published: 09 February 2013
Issue Date: November 2013
DOI: https://doi.org/10.1007/s00362-013-0503-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A toolbox of permutation tests for structural change

Abstract

Access this article

Similar content being viewed by others

Permutation Tests in Linear Regression

Permutation tests for experimental data

Permutation Tests Using Arbitrary Permutation Distributions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

1.1 Proofs

1.2 Power and size for autocorrelated series

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A toolbox of permutation tests for structural change

Abstract

Access this article

Similar content being viewed by others

Permutation Tests in Linear Regression

Permutation tests for experimental data

Permutation Tests Using Arbitrary Permutation Distributions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

1.1 Proofs

1.2 Power and size for autocorrelated series

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation