Abstract
A class of parametric models for locally stationary processes is introduced. The class depends on a power parameter that applies to the time-varying spectrum so that it can be locally represented by a (finite low dimensional) Fourier polynomial. The coefficients of the polynomial have an interpretation as time-varying autocovariances, whose dynamics are determined by a linear combination of smooth transition functions, depending on some static parameters. Frequency domain estimation is based on the generalized Whittle likelihood and the pre-periodogram, while model selection is performed through information criteria. Change points are identified via a sequence of score tests. Consistency and asymptotic normality are proved for the parametric estimators considered in the paper, under weak assumptions on the time-varying parameters.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Adak, S. (1998). Time-dependent spectral analysis of nonstationary time series. Journal of the American Statistical Association 93 1488–1501.
Amado, C. and Teräsvirta, T. (2013). Modelling volatility by variance decomposition. Journal of Econometrics 175 142–153.
Barndorff-Nielsen, O. and Schou, G. (1973). On the parametrization of autoregressive models by partial autocorrelations. Journal of Multivariate Analysis 3 408–419.
Bloomfield, P. (1973). An exponential model for the spectrum of a scalar time series. Biometrika 60 217–226.
Bogert, B. P., Healy, M. J. R. and Tukey, J. W. (1963). The frequency analysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum, and saphe cracking. Proceedings of the Symposium on Time Series Analysis, Wiley, New York. Chap 15, 209–243.
Box, G. E. and Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society: Series B 26 211–243.
Brockwell, P. and Davis, R. (1991). Time Series: Theory and Methods. Springer Series in Statistics, Springer.
Cressie, N. (1993). Statistics for Spatial Data. Wiley Series in Probability and Statistics, Wiley.
Dahlhaus, R. (1996). Maximum likelihood estimation and model selection for locally stationary processes. Journal of Nonparametric Statistics 6 171–191.
Dahlhaus, R. (1997). Fitting Time Series Models to Nonstationary Processes. The Annals of Statistics 25 1–37.
Dahlhaus, R. (2000). A likelihood approximation for locally stationary processes. The Annals of Statistics 28 1762–1794.
Dahlhaus, R. (2012). Locally stationary processes. 351–408.
Dahlhaus, R. and Polonik, W. (2006). Nonparametric quasi-maximum likelihood estimation for gaussian locally stationary processes. Ann Statist 34 2790–2824.
Dahlhaus, R. and Polonik, W. (2009). Empirical spectral processes for locally stationary time series. Bernoulli 15 1–39.
Dahlhaus, R. and Subba-Rao, S. (2006). Statistical inference for time-varying ARCH processes. The Annals of Statistics 34 1075–1114.
Davis, R. A., Lee, T. C. M. and Rodriguez-Yam, G. A. (2006). Structural break estimation for nonstationary time series models. Journal of the American Statistical Association 101 223–239.
Dette, H., Preuss, P. and Vetter, M. (2011). A measure of stationarity in locally stationary processes with applications to testing. Journal of the American Statistical Association 106 1113–1124.
Durbin, J. (1960). The fitting of time-series models. Revue de l’Institut International de Statistique 28 233–244.
van de Geer, S. (1993). Hellinger-consistency of certain nonparametric maximum likelihood estimators. The Annals of Statistics 21 14–44.
van de Geer, S. (2000). Empirical Processes in M-Estimation. Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press.
González, A. and Teräsvirta, T. (2008). Modelling autoregressive processes with a shifting mean. Studies in Nonlinear Dynamics & Econometrics 12.
Gould, H. W. (1974). Coefficient identities for powers of taylor and dirichlet series. The American Mathematical Monthly 81 3–14.
Grenander, U. and Szegö, G. (1958). Toeplitz forms and their applications. Univ of California Press.
Guo, D. M. O. H.W. and von Sachs, R. (2003). Smoothing spline anova for time-dependent spectral analysis. Journal of the American Statistical Association 98 643–652.
Krampe, J., Kreiss, J. P. and Paparoditis, E. (2018). Estimated Wold Representation and Spectral Density Driven Bootstrap for Time Series. Journal of the Royal Statistical Society, Series B 80 703–726.
Kreiss, J.-P. and Paparoditis, E. (2015). Bootstrapping locally stationary processes. Journal of the Royal Statistical Society Series B 77 267–290.
Levinson, N. (1946). The Wiener (root mean square) error criterion in filter design and prediction. Studies in Applied Mathematics 25 261–278.
Liu, Y., Xue, Y. and Taniguchi, M. (2020). Robust linear interpolation and extrapolation of stationary time series in \(L^p\). Journal of Time Series Analysis 41 229–248.
Liu, Y., Taniguchi, M. and Ombao, H. (2021). Statistical inference for local Granger causality. arXiv: arxiv.org/abs/2103.00209.
Monahan, J. (1984). A note enforcing stationarity in autoregressive-moving average models. Biometrika 71 403–404.
Nason, G. P. (2013). A test for second-order stationarity and approximate confidence intervals for localized autocovariances for locally stationary time series. Journal of the Royal Statistical Society Series B 75 879–904.
Nason, G. P., von Sachs, R. and Kroisandt, G. (2000). Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum. Journal of the Royal Statistical Society Series B 62 271–292.
Neumann, M. H. and von Sachs, R. (1997). Wavelet thresholding in anisotropic function classes and applications to adaptive estimation of evolutionary spectra. The Annals of Statistics 25 38–76.
Ombao, H. C., Raz, J. A., von Sachs, R. and Malow, B. A. (2001). Automatic statistical analysis of bivariate nonstationary time series. Journal of the American Statistical Association 96 543–560.
Ombao, H. C., Von Sachs, R. and Guo, W. (2005). Slex analysis of multivariate nonstationary time series. Journal of the American Statistical Association 100 519–531.
Pourahmadi, M. (1984). Taylor expansion of and some applications. The American Mathematical Monthly 91 303–307.
Priestley, M. B. (1965). Evolutionary spectra and non-stationary processes. 204–237. .
Proietti, T. and Luati, A. (2015). The generalised autocovariance function. Journal of Econometrics 186 245–257.
Proietti, T. and Luati, A. (2019). Generalized linear cepstral models for the spectrum of a time series. Statistica Sinica 29 1561–1583.
Qin, L. and Wang, Y. (2008). Nonparametric spectral analysis with applications to seizure characterization using eeg time series. 1432–1451. .
Rosen, O., Stoffer, D. and Wood, S. (2009). Local spectral analysis via a Bayesian mixture of smoothing splines. Journal of the American Statistical Association 104 249–262.
Rosen, O., Wood, S. and Stoffer, D. (2012). Adaptspec: Adaptive spectral estimation for nonstationary time series. Journal of the American Statistical Association 107 1575–1589.
Sakiyama, K. and Taniguchi, M. (2003). Testing composite hypotheses for locally stationary processes. Journal of Time Series Analysis 24 483–504.
Silverman, R. (1957). Locally stationary random processes. IRE Transactions on Information Theory 3 182–187.
Taniguchi, M. (1980). On estimation of the integrals of certain functions of spectral density. Journal of Applied Probability 17 73–83.
West, M., Prado, R. and Krystal, A. D. (1999). Evaluation and comparison of eeg traces: Latent structure in nonstationary time series. Journal of the American Statistical Association 94 375–387.
White, H. (2006). Approximate nonlinear forecasting methods. Handbook of economic forecasting 1 459–512.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
13.6 Proofs
Proof of Theorem 13.1
The proofs of consistency and asymptotic normality of the estimator \(\hat{\theta }_n\) are based on results by [14] on the empirical spectral process
evaluated at \(\phi = f_\theta (\cdot ,\omega )^{-1}\) and at \(\phi = \frac{\partial }{\partial \theta } f_\theta (\cdot ,\omega )^{-1} \), respectively, where
is a spectral measure and
is an empirical spectral measure based on the pre-periodogram, defined in Eq. (13.17).
Under the assumptions of Theorem 13.1, almost sure consistency of \(\hat{\theta }_n\) requires the uniform convergence of \(\ell _n(\theta )\) to \(\ell (\theta )\), defined in Eqs. (13.19) and (13.20), respectively,
Note that ([14, Example 3.1])
where \(F,F_n\) are defined above and
Hence, if \(\sup _{\theta \in \Theta }|R_{\log }(f_\theta )|\) is small, the convergence of \(\ell _n(\theta )-\ell (\theta )\) may be controlled by \(E_n(1/f_\theta )\), since, by Assumptions 13.1 and 13.2, we obtain
Note that by Assumptions 13.1 and 13.2, we obtain that the empirical spectral process \(E_n(1/f_\theta )\) has bounded variation in both the components of \(f_\theta (u, \omega )\), see also the discussion in [14]. Then, the uniform convergence of \(\ell _n(\theta )-\ell (\theta )\) could be easily obtained by Glivenko–Cantelli-type convergence results, see, for example, [14, Theorem 2.12], which ensures that we have
To show that this result is valid also in our case, it suffices to verify [15, Assumption 2.4 (c)].
First, we note that by construction the reciprocal \(1/(f_\theta (u,\omega ))\) is an element of \(\mathcal {F}_\lambda \) (when \(\lambda =-1\)) and the functions in \(\mathcal {F}_\lambda \) are not indexed by n. Moreover, the implied spectral density \(f_\theta (u,\omega )\in \mathcal {F}_\lambda \) is bounded away from zero.
Let us consider
By Kolmogorov’s formula ([7, Theorem 5.8.1]) we have
We define the class of functions
and note that for uniformly bounded monotonic functions, the metric entropy is bounded
for some \(\epsilon > 0\), where \(C_2\) is a constant, see [19]. Moreover, \(\exists \,\, 0< M_{\star } \le 1 \le M^{\star } < \infty \,, \, M_{\star } \le f_\theta (u,\omega ) \le M^{\star } \,\,, \forall \,\, u,\omega \,\, \text {and} \,\, f_\theta (u,\omega ) \in \mathcal {F}_\lambda \). However, we do not need the lower bound \(M_{\star }\) and then we may set \(M_{\star } = 1\).
In addition, to ensure that
we choose
such that \(\mathcal {B}_{\lambda } \in \mathcal {W}_{2}^{\alpha }\), that is, the Sobolev class of functions with \(m\in \mathbb {N}\) smoothness parameter, such that the first \(m \ge 1\) derivatives exist with finite \(L_2\)-norm, see [20]. For this class of functions, the metric entropy can be bounded by
for some \(A>0\) and \(\forall \epsilon > 0\). It follows that the metric entropy of the class of the time-varying sdf is bounded by
As a result, all the conditions of [13][Assumption 2.4 (c)] are easily satisfied, and hence we apply [13, Lemma A.2] to conclude that
which implies \(\sup _{\theta \in \Theta }|R_{\log }(f_\theta )| \rightarrow 0\) as \(n \rightarrow \infty \).
Moreover, by [13, Theorem 2.6], we obtain the convergence of \(\rho _2 (1/\hat{f}_\theta - 1/\hat{f}_\theta )\) by setting \(\gamma = (1+1/m)\), so that
In conclusion, by compactness of \(\Theta \) and the uniqueness of \(\theta _0\) implied by Assumptions 13.3 and 13.4, we obtain the claimed convergence \(\hat{\theta }_n \rightarrow _p \theta _0\), which concludes the proof. \(\square \)
13.7 Proof of Theorem 13.2
Before presenting the proof of Theorem 13.2, we need to check the smoothness conditions given in [9, Assumption 2.1].
First, we note that
where the \((q+1)\)-vector \(\Psi (u)\) is defined as
with \(\Psi (u, \gamma _i, \tau _i)= [1+\exp \{ - \gamma _i(u-\tau _i) \}]^{-1}\) for \(i = 1, \dots , q\). Moreover \(\theta _\sigma = (\theta _{00}, \dots , \theta _{0q})^\prime \) and define \(\boldsymbol{\gamma }=(\gamma _1, \dots , \gamma _q)^\prime \).
Therefore, the first derivatives of \(\sigma ^2_\lambda (u)\) with respect to \(\theta _\sigma \) and \(\boldsymbol{\gamma }\) are
where
and
with \(\boldsymbol{1}_{q+1}\) being a \((q+1)\)-vector of ones, \(\boldsymbol{u} = (0, u, \dots , u)^\prime \in [0, 1]^{q+1}\) and \(\boldsymbol{\tau } = (0, \tau _1, \dots , \tau _{q})^\prime \).
The second derivatives of \(\sigma ^2_\lambda (u)\) with respect to \(\theta _\sigma \) and \(\boldsymbol{\gamma }\) are
where
and
Furthermore, we require the first and second derivatives of \(\beta _\lambda (u, \omega )\) with respect to \(\theta _\sigma \) and \( \boldsymbol{\gamma }\). However, we recall that for estimation purposes we adopt the parametrization \(\beta _\lambda (u, k)\) for \(k=1,\dots ,K\) and each \(\beta _\lambda (u, k)\) from the last iteration of the Durbin–Levinson recursion. As a result, we have \(\beta _\lambda (u, k) = \zeta _\lambda (u, k)\), where \(\zeta _\lambda (u, k)\) denotes the generalized partial autocorrelation coefficients, and we set, for \(k=1, \dots , K\),
Thus, it suffices to derive \(\zeta _\lambda (u, k)\) with respect to \(\theta _\sigma \) and \(\boldsymbol{\gamma }\). Then we have
where
and
The second derivatives of \(\zeta _\lambda (u, k)\) with respect to \(\theta _\sigma \) and \(\boldsymbol{\gamma }\) are
where
and finally
Then, we conclude that all the derivatives derived above reveal that the first and second derivatives \(\nabla \sigma _\lambda ^2(u)\), \(\nabla ^2 \sigma _\lambda ^2(u)\), \(\nabla \beta _\lambda (u, \omega )\), and \(\nabla ^2 \beta _\lambda (u, \omega )\) satisfy all the smoothness conditions given in [9, Assumption 2.1]. Therefore, we can prove the asymptotic normality of our estimator in (13.18) by checking the remaining relevant conditions.
Our estimator solves the likelihood equations \(\nabla \ell _n(\hat{\theta }_n ) = 0\). Hence the mean value theorem gives
where \(|\theta _n^\star - \theta _0| \le |\hat{\theta }_n - \theta _0|\) and, since \(\hat{\theta }_n \in \Theta \), \(\nabla \ell _n(\hat{\theta }_n ) = 0\). As we have proved in Theorem 1, Assumptions 13.1–13.4 ensure the consistency of \(\hat{\theta }_n\), and thus a central limit theorem for \(\sqrt{n}(\hat{\theta }_n - \theta _0)\) could be obtained by following analogous arguments discussed in the proof of [9, Theorem 2.4]. Then, the result follows by proving that
-
1.
\(\sqrt{n}\nabla \ell _n(\theta _0) \rightarrow _d N(\boldsymbol{0}, \Gamma )\),
-
2.
\(\nabla ^2 \ell _n(\theta _n^\star ) - \nabla ^2 \ell _n(\theta _0) \rightarrow _p 0\), and
-
3.
\(\nabla ^2 \ell _n(\theta _0) \rightarrow _p \mathbb {E}[\nabla ^2 \ell _n(\theta _0)]\) where \(\mathbb {E}[\nabla ^2 \ell _n(\theta _0)] = V\).
However, as in [9, Remark 2.6], if the model is correctly specified we have \( \Gamma =V\), where
As far as convergence in distribution of the score vector in the item Assumption 13.7, we first prove the equicontinuity of \(\nabla \ell _n(\theta )\). We use similar arguments as those used in the proof of Theorem 13.1. From Eq. (13.21), one has
By Leibniz’s rule, it is easy to see that the preceding equation is equivalent to
where \(R_{\log } (f_\theta )\), defined in (13.25), was proved to be asymptotically negligible. The same bounds obtained in the proof of Theorem 13.1 can be applied to the class of functions
such that, by using Assumptions 13.1 and 13.2, we get Glivenko–Cantelli-type convergence result
which implies the equicontinuity in probability of \(\nabla \ell _n(\theta ) \). Thus,
Clearly, since by Assumptions 13.3 and 13.4 the parameter space \(\Theta \) is compact with a unique minimum at \(\theta _0\in \Theta \), we have \(\nabla \ell (\theta _0) = 0\), and so, evaluating Eq. (13.21) at the true value \(\theta _0\) is equivalent to write \(\nabla \ell _n(\theta _0)\) as an empirical process of the form
Note also that, under the assumption that the model is correctly specified, it holds that \(\mathbb {E}[\nabla \ell _n(\theta _0) ] = 0\), and therefore, we obtain
since all the required Assumptions in [9, Lemma A.5] are fulfilled.
Now we focus on the item Assumption 13.7, which is satisfied if \(\nabla ^2 \ell _n(\theta )\) is equicontinuous. From Eq. (13.22) and the results obtained in the proof of condition Assumption 13.7, one can express \(\nabla ^2 \ell _n(\theta )\) as
where
Therefore, by similar arguments as above, under Assumptions 13.1 and 13.2, we can show the uniform convergence
which further implies the equicontinuity in probability of \(\nabla ^2 \ell _n(\theta )\). Indeed, the second term in (13.26) converges to its expectation as \(n \rightarrow \infty \), and, hence,
The proof of the item Assumption 13.7 follows directly, since, under the maintained Assumptions, Theorem 13.1 implies that the second term of Eq. (13.26) tends to V in probability for \(\theta \rightarrow \theta _0\). To conclude, all the relevant conditions stated in [9, Theorem 2.4] are fulfilled and then the asymptotic normality follows, see also [11, Theorem 3.1]. \(\square \)
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Proietti, T., Luati, A., D’Innocenzo, E. (2023). Generalized Linear Spectral Models for Locally Stationary Processes. In: Liu, Y., Hirukawa, J., Kakizawa, Y. (eds) Research Papers in Statistical Inference for Time Series and Related Models. Springer, Singapore. https://doi.org/10.1007/978-981-99-0803-5_13
Download citation
DOI: https://doi.org/10.1007/978-981-99-0803-5_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0802-8
Online ISBN: 978-981-99-0803-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)