Skip to main content
Log in

A toolbox of permutation tests for structural change

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

The sup\(LM\) test for structural change is embedded into a permutation test framework for a simple location model. The resulting conditional permutation distribution is compared to the usual (unconditional) asymptotic distribution, showing that the power of the test can be clearly improved in small samples. Furthermore, the permutation test is embedded into a general framework that encompasses tools for binary and multivariate dependent variables as well as model-based permutation testing for structural change. It is also demonstrated that the methods can not only be employed for analyzing structural changes in time series data but also for recursive partitioning of cross-section data. The procedures suggested are illustrated using both artificial data and empirical applications (number of youth homicides, employment discrimination data, carbon flux in tropical forests, stock returns, and demand for economics journals).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Andrews DWK (1991) Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59:817–858

    Article  MathSciNet  MATH  Google Scholar 

  • Andrews DWK (1993) Tests for parameter instability and structural change with unknown change point. Econometrica 61:821–856

    Article  MathSciNet  MATH  Google Scholar 

  • Antoch J, Hušková M (2001) Permutation tests in change point analysis. Stat Probab Lett 53:37–46

    Article  MATH  Google Scholar 

  • Bergstrom TC (2001) Free labor for costly journals? J Econ Perspect 15:183–198

    Article  Google Scholar 

  • Bonal D, Bosc A, Ponton S, Goret JY, Burban B, Gross P, Bonnefond JM, Elbers J, Longdoz B, Epron D, Guehl JM, Granier A (2008) Impact of severe dry season on net ecosystem exchange in the neotropical rainforest of French Guiana. Glob Chang Biol 14(8):1917–1933

    Article  Google Scholar 

  • Boulesteix AL, Strobl C (2007) Maximally selected chi-squared statistics and non-monotonic associations: an exact approach based on two cutpoints. Comput Stat Data Anal 51(12):6295–6306

    Article  MathSciNet  MATH  Google Scholar 

  • Brown RL, Durbin J, Evans JM (1975) Techniques for testing the constancy of regression relationships over time. J R Stat Soc B 37:149–163

    MathSciNet  MATH  Google Scholar 

  • Chambers JQ, Schimel JP, Nobre AD (2001) Respiration from coarse wood litter in central Amazon forests. Biogeochemistry 52(2):115–131

    Article  Google Scholar 

  • Chow GC (1960) Tests of equality between sets of coefficients in two linear regressions. Econometrica 28:591–605

    Article  MathSciNet  MATH  Google Scholar 

  • Ernst MD (2004) Permutation methods: a basis for exact inference. Stat Sci 19(4):676–685

    Article  MathSciNet  MATH  Google Scholar 

  • Fisher RA (1935) The design of experiments. Oliver and Boyd, Edinburgh

    Google Scholar 

  • Freidlin B, Gastwirth JL (2000) Changepoint tests designed for the analysis of hiring data arising in employment discrimination cases. J Bus Econ Stat 18(3):315–322

    Google Scholar 

  • Genz A (1992) Numerical computation of multivariate normal probabilities. J Comput Graph Stat 1:141–149

    Google Scholar 

  • Hansen BE (1997) Approximate asymptotic \(p\) values for structural-change tests. J Bus Econ Stat 15:60–67

    Google Scholar 

  • Hansen BE (2000) Testing for structural change in conditional models. J Econ 97:93–115

    Article  MATH  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, New York

    Book  MATH  Google Scholar 

  • Hothorn T, Lausen B (2003) On the exact distribution of maximally selected rank statistics. Comput Stat Data Anal 43(2):121–137

    Article  MathSciNet  MATH  Google Scholar 

  • Hothorn T, Zeileis A (2008) Generalized maximally selected statistics. Biometrics 64:1263–1269

    Article  MathSciNet  MATH  Google Scholar 

  • Hothorn T, Hornik K, van de Wiel MA, Zeileis A (2006a) A Lego system for conditional inference. Am Stat 60(3):257–263

    Article  Google Scholar 

  • Hothorn T, Hornik K, Zeileis A (2006b) Unbiased recursive partitioning: a conditional inference framework. J Comput Graph Stat 15(3):651–674

    Article  MathSciNet  Google Scholar 

  • Hothorn T, Hornik K, van de Wiel MA, Zeileis A (2008) Implementing a class of permutation tests: the coin package. J Stat Softw 28(8):1–23. http://www.jstatsoft.org/v28/i08/

    Google Scholar 

  • Hsu DA (1979) Detecting shifts of parameter in gamma sequences with applications to stock price and air traffic flow analysis. J Am Stat Assoc 74:31–40

    Article  Google Scholar 

  • Kennedy PE (1995) Randomization tests in econometrics. J Bus Econ Stat 13(1):85–94

    Google Scholar 

  • Kirch C (2007) Block permutation principles for the change analysis of dependent data. J Stat Plan Inference 137:2453–2474

    Article  MathSciNet  MATH  Google Scholar 

  • Kirch C, Steinebach J (2006) Permutation principles for the change analysis of stochastic processes under strong invariance. J Comput Appl Math 186(1):64–88

    Article  MathSciNet  MATH  Google Scholar 

  • Krämer W, Sonnberger H (1986) The linear regression model under test. Physica, Heidelberg

    Book  Google Scholar 

  • Krämer W, Ploberger W, Alt R (1988) Testing for structural change in dynamic models. Econometrica 56(6):1355–1369

    Article  MathSciNet  MATH  Google Scholar 

  • Lausen B, Schumacher M (1992) Maximally selected rank statistics. Biometrics 48:73–85

    Article  Google Scholar 

  • Ludbrook J, Dudley H (1998) Why permutation tests are superior to \(t\) and \(F\) tests in biomedical research. Am Stat 52(2):127–132

    Google Scholar 

  • Luger R (2006) Exact permutation tests for non-nested non-linear regression models. J Econ 133:513–529

    Article  MathSciNet  Google Scholar 

  • McCullagh P (2005) Exchangeability and regression models. In: Davison AC, Dodge Y, Wermuth N (eds) Celebrating statistics—papers in honour of Sir David Cox on his 80th birthday. Oxford University Press, Oxford, pp 89–115

    Google Scholar 

  • Pesarin F (2001) Multivariate permutation tests: with applications to biostatistics. Wiley, Chichester

    Google Scholar 

  • Piehl AM, Cooper SJ, Braga AA, Kennedy DM (2003) Testing for structural breaks in the evaluation of programs. Rev Econ Stat 85(3):550–558

    Article  Google Scholar 

  • Pitman EJG (1938) Significance tests which may be applied to samples from any populations: III. The analysis of variance test. Biometrika 29:322–335

    MATH  Google Scholar 

  • Quandt RE (1960) Tests of the hypothesis that a linear regression obeys two separate regimes. J Am Stat Assoc 55:324–330

    Article  MathSciNet  MATH  Google Scholar 

  • R Development Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org/, ISBN 3-900051-07-0

  • Stahl C, Burban B, Goret JY, Bonal D (2011) Seasonal variations in stem CO\(_2\) efflux in the neotropical rainforest of French Guiana. Ann For Sci 68(4):771–782

    Article  Google Scholar 

  • Stock JH, Watson MW (1996) Evidence on structural instability in macroeconomic time series relations. J Bus Econ Stat 14:11–30

    Google Scholar 

  • Stock JH, Watson MW (2007) Introduction to econometrics, 2nd edn. Addison-Wesley, Reading

    Google Scholar 

  • Strasser H, Weber C (1999) On the asymptotic theory of permutation statistics. Math Methods Stat 8:220–250. Preprint available from http://epub.wu.ac.at/102/

  • Strobl C, Boulesteix AL, Augustin T (2007) Unbiased split selection for classification trees based on the Gini index. Comput Stat Data Anal 52:483–501

    Article  MathSciNet  MATH  Google Scholar 

  • Zeileis A (2005) A unified approach to structural change tests based on ML scores, \(F\) statistics, and OLS residuals. Econ Rev 24(4):445–466

    Article  MathSciNet  MATH  Google Scholar 

  • Zeileis A, Leisch F, Hornik K, Kleiber C (2002) strucchange: An R package for testing for structural change in linear regression models. J Stat Softw 7(2):1–38, http://www.jstatsoft.org/v07/i02/

  • Zeileis A, Hothorn T, Hornik K (2008) Model-based recursive partitioning. J Comput Graph Stat 17(2):492–514

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

The coarse woody debris respiration data were kindly provided by Lucy Rowland (School of GeoSciences, University of Edinburgh).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Achim Zeileis.

Appendix

Appendix

1.1 Proofs

The (asymptotic) distribution of the multivariate statistic \(Z = (Z_{\pi _1}, \dots , Z_{\pi _m})^\top \) is derived by embedding the statistic into the framework of Strasser and Weber (1999), as discussed in Hothorn et al. (2006a). More precisely, the test statistic \(\max _{\pi \in \Pi } Z_\pi \) considered above is the maximum-type statistic \(c_{\max }\) of Hothorn et al. (2006a) if the influence function \(h(Y) = Y\) is used for the observations \(Y_i\) and the transformation \(g(t) = (\mathbf{1 }_{[0, \pi _1]}(t), \dots , \mathbf{1 }_{[0, \pi _m]}(t))^\top \) is used for the associated timings \(t_i\). The transformation \(g\) used the indicator function \(\mathbf{1 }_I\) of the interval \(I\) and thus corresponds to a vector of indicators for the time up to the timings \(\pi _j\) (\(j = 1, \dots , k\)).

Using these transformations \(h(\cdot )\) and \(g(\cdot )\), the unstandardized test statistic \(T\) is in the notation of Hothorn et al. (2006a)

$$\begin{aligned} T \, = \, \text{ vec} \left( \sum _{i = 1}^n g(t_i) h(Y_i)^\top \right) \, = \, \left( n_{1, \pi _1} \bar{Y}_{1, \pi _1}, \dots , n_{1, \pi _m} \bar{Y}_{1, \pi _m} \right)^\top . \end{aligned}$$
(8)

Under \(H_0\), given all permutations \(\sigma \in S\) of the observations \(Y_1, \dots , Y_n\), the unstandardized statistic has expectation

$$\begin{aligned} \text{ E}_\sigma [T] \, = \, \text{ vec} \left( \left( \sum _{i = 1}^n g(t_i) \right) n^{-1} \sum _{i = 1}^n h(Y_i)^\top \right) \, = \, \left( n_{1, \pi _1}, \dots , n_{1, \pi _m} \right)^\top \bar{Y} \end{aligned}$$
(9)

and each unstandardized statistic has variance

$$\begin{aligned} \text{ Var}_\sigma [T_\pi ] \, = \, \left( n_{1, \pi } - \frac{n_{1, \pi }^2}{n} \right) \frac{RSS_{0}}{n-1} \, = \, \frac{n_{1, \pi } n_{2, \pi }}{n} \frac{RSS_{0}}{n-1}, \end{aligned}$$
(10)

where the residual sum of squares is \(RSS_0 = \sum _{i = 1}^n (Y_i - \bar{Y})^2\). The equations directly follow from Eq. 7 in Strasser and Weber (1999).

Standardizing the vector of raw statistics \(T = (T_{\pi _1}, \dots , T_{\pi _m})^\top \) by their respective mean and standard deviation yields the vector of statistics \(Z = (Z_{\pi _1}, \dots , Z_{\pi _m})^\top \):

$$\begin{aligned} Z_\pi&= \frac{T_\pi - \text{ E}_\sigma [T_\pi ]}{\sqrt{\text{ Var}_\sigma [T_\pi ]}} \\&= \frac{n_{1, \pi } \bar{Y}_{1, \pi } - n_{1, \pi } \bar{Y}}{\sqrt{\frac{n_{1, \pi } n_{2, \pi }}{n} \frac{RSS_{0}}{n-1}}} \\&= \sqrt{ \frac{n_{1, \pi } n_{2, \pi }}{n} } \frac{\bar{Y}_{1, \pi } - \bar{Y}_{2, \pi }}{\sqrt{ RSS_0 / (n-1) }}, \end{aligned}$$

because of the following simple relationship between \(\bar{Y}_{1, \pi }\), \(\bar{Y}_{2, \pi }\) and \(\bar{Y}\):

$$\begin{aligned} \bar{Y} \, = \, \frac{n_{1, \pi } \bar{Y}_{1, \pi } + n_{2, \pi } \bar{Y}_{2, \pi }}{n}. \end{aligned}$$

Consequently, \(Z\) has zero mean and unit variance given all permutations \(\sigma \in S\). Similarly, the covariance between two elements of \(Z\), \(Z_\pi \) and \(Z_\tau \) say, is

$$\begin{aligned}&\frac{RSS_0}{n-1} \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \pi ]}(t_i) \mathbf{1 }_{[0, \tau ]}(t_i) \right) - \frac{1}{n} \frac{RSS_0}{n-1} \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \pi ]}(t_i) \right) \left( \sum _{i = 1}^n \mathbf{1 }_{[0, \tau ]}(t_i) \right) \\&\qquad \quad = \frac{n \min (n_{1, \pi }, n_{1, \tau }) - n_{1, \pi } n_{1, \tau }}{n} \frac{RSS_0}{n-1}. \end{aligned}$$

Assuming that \(\pi < \tau \) and using the variance computed above, the correlation is thus

$$\begin{aligned} \frac{n_{1, \pi } n_{2, \tau }}{\sqrt{n_{1, \pi } n_{2, \pi } n_{1, \tau } n_{2, \tau }}}. \end{aligned}$$

Given that we derived the first two moments of \(Z\) by embedding the statistic into the framework of Strasser and Weber (1999), the asymptotic normality of \(Z\) follows by application of their Theorem 2.3.

1.2 Power and size for autocorrelated series

To study the performance of the tests with dependent data, we use a simulation setup as in Sect. 2.3. The only difference is that the errors are now autocorrelated with \(\varrho = 0, 0.1, 0.2, 0.3, 0.5, 0.9\). The length of the time series considered is either very short (\(n = 10\)) or moderate (\(n = 50\)) and either there is no change (\(\delta = 0\)) or a large shift in the mean (\(\delta = 15\)). Five different versions of the tests are assessed: the unconditional and conditional test (\(\mathcal D _{\infty }\) and \(\mathcal D _{\sigma |Y}\), respectively) on the original data (as in Sect. 2.3), the unconditional test computed with a robust HAC covariance estimate and the unconditional and conditional test computed on the residuals of an AR(1) model (fitted by OLS).

Using the uncorrected tests on the original data (\(\mathcal D _{\infty }\) and \(\mathcal D _{\sigma |Y}\), respectively), it can be seen that size distortions occur for \(\varrho > 0\) even if there is no change (\(\delta = 0\)). For \(\varrho \) up to \(0.2\) these are still moderate but become very large afterwards and are even more pronounced for the conditional version of the test. However, in moderately large time series (\(n = 50\)), the problem can be remedied by either using a HAC correction or applying the tests to the AR(1) residuals. These three tests keep their size and have reasonable power with the conditional test having the highest power. However, none of the tests is able to distinguish between autocorrelation and a shift in the mean (with \(\delta = 15\)) if the time series is very short (\(n = 10\)) or if the autocorrelation is very high (\(\varrho \ge 0.7\)) and the length only moderate (\(n = 50\)).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zeileis, A., Hothorn, T. A toolbox of permutation tests for structural change. Stat Papers 54, 931–954 (2013). https://doi.org/10.1007/s00362-013-0503-4

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-013-0503-4

Keywords

Navigation