Abstract
SNP is a method of nonparametric multivariate time series analysis. It employs an expansion in Hermite functions to approximate the conditional density of a multivariate process. An appealing feature of the expansion is that it is a nonlinear nonparametric model that directly nests the Gaussian VAR model, the semiparametric ARCH model, the Gaussian GARCH model, and the semiparametric GARCH model. The unrestricted SNP expansion is more general than any of these models. The SNP model is fitted using conventional maximum likelihood together with a model selection strategy that determines the appropriate order of expansion.
SNP is a method of multivariate nonparametric time series analysis. SNP is an abbreviation of ‘semi-nonparametric’ which was introduced by Gallant and Nychka (1987) to suggest the notion of a statistical inference methodology that lies halfway between parametric and nonparametric inference. The method employs an expansion in Hermite functions to approximate the conditional density of a multivariate process.
The leading term of this expansion can be chosen through selection of model parameters to be a Gaussian vector autoregression (VAR) model, a semi-parametric VAR model, a Gaussian ARCH model (Engle 1982), a semiparametric ARCH model, a Gaussian GARCH model (Bollerslev 1986), or a semiparametric GARCH model, either univariate or multivariate in each case. The unrestricted SNP expansion is more general than that of any of these models. The SNP model is fitted using maximum likelihood together with a model selection strategy that determines the appropriate order of expansion. Because the SNP model possesses a score, it is an ideal candidate for the auxiliary model in connection with efficient method of moment estimation (Gallant and Tauchen 1996). Due to its leading term, the SNP approach does not suffer from the curse of dimensionality to the same extent as kernels and splines. In regions where data are sparse, the leading term helps to fill in smoothly between data points. Where data are plentiful, the higher-order terms accommodate deviations from the leading term. The method was first proposed by Gallant and Tauchen (1989) in connection with an asset pricing application. A C + + implementation of SNP is at http://econ.duke.edu/webfiles/arg/snp/, together with a User’s Guide, which is an excellent tutorial introduction to the method.
Important adjuncts to SNP estimation are a rejection method for simulating from the SNP density developed in Gallant and Tauchen (1992), which can be used, for example, to set bootstrapped confidence intervals as in Gallant et al. (1992); nonlinear error shock analysis as described in Gallant et al. (1993), which develops the nonlinear analog of conventional error shock analysis for linear VAR models; and re-projection, which is a form of nonlinear Kalman filtering that can be used to forecast the unobservables of nonlinear latent variables models (Gallant and Tauchen 1998).
As stated above, the SNP method is based on the notion that a Hermite expansion can be used as a general purpose approximation to a density function. Letting z denote an M–vector, we can write the Hermite density as where denotes a multivariate polynomial of degree Kz and φ(z) denotes the density function of the (multivariate) Gaussian distribution with mean zero and variance the identity matrix. Denote the coefficients of by a, which is a vector whose length depends on Kz and M. When we wish to call attention to the coefficients, we write
The constant of proportionality is which makes h(z) integrate to one. As seen from the expression that results, namely
we are effectively expanding the square root of the density in Hermite functions of the form . Because the square root of a density is always square integrable and because the Hermite functions of the form are dense for the collection of square integrable functions (Fenton and Gallant 1996), every density has such an expansion. Because is a homogeneous function of the coefficients of the polynomial , the coefficients can only be determined to within a scalar multiple. To achieve a unique representation, the constant term of the polynomial part is put to 1. Customarily the Hermite density is written with its terms orthogonalized and the C ++ code is written in the orthogonalized form for numerical efficiency. But reflecting that here would lead to cluttered notation and add nothing to the ideas.
A change of variables using the location-scale transformation y = Rz + μ, where R is an upper triangular matrix and μ is an M-vector, gives
The constant of proportionality is the same as above, Because {φ[R−1(y − μ)]/| det(R)|} is the density function of the M-dimensional, multivariate, Gaussian distribution with mean μ and variance-covariance matrix Σ = RR′, and because the leading term of the polynomial part is 1, the leading term of the entire expansion is proportional to the multivariate, Gaussian density function. Denote the Gaussian density of dimension M with mean vector μ and variance matrix Σ by nM (y|μ, Σ) and write
where z = R−1(y − μ) for the density above.
When Kz is put to zero, one gets f(y| θ) = nM(y| μ, Σ) exactly. When Kz is positive, one gets a Gaussian density whose shape is modified due to multiplication by a polynomial in z = R−1(y − μ). The shape modifications thus achieved are rich enough to accurately approximate densities from a large class that includes densities with fat, t-like tails, densities with tails that are thinner than Gaussian, and skewed densities (Gallant and Nychka 1987).
The parameters θ of f(y|θ) are made up of the coefficients a of the polynomial plus μ and R and are estimated by maximum likelihood which is accomplished by minimizing \( {s}_n\left(\theta \right)=\left(-1/n\right){\sum}_{t=1}^n\mathit{\log}\left[f\left({y}_t|\theta \right)\right] \) As mentioned above, if the number of parameters pθ grows with the sample size n, the true density and various features of it such as derivatives and moments are estimated consistently (Gallant and Nychka 1987).
This basic approach can be adapted to the estimation of the conditional density of a multiple time series {yt} that has a Markovian structure. Here, the term ‘Markovian structure’ is taken to mean that the conditional density of the M–vector yt given the entire past yt−1,yt−2,... depends only on L lags from the past. For convenience, we will presume that the data are from a process with a Markovian structure, but one should be aware that, if L is sufficiently large, then non-Markovian data can be well approximated by an SNP density (Gallant and Long 1997). Collect these lags together as xt−1 = (yt−1,yt−2,...,yt−L), where L exceeds all lags in the following discussion.
To approximate the conditional density of {yt} using the ideas above, begin with a sequence of innovations {zt}. First consider the case of homogeneous innovations; that is, the distribution of zt does not depend on xt−1. Then, as above, the density of zt can be approximated by where is a polynomial of degree Kz. Follow with the location-scale transformation yt = Rzt + μx where μx is a linear function that depends on Lu lags
(If Lu < L, then some elements of B are zero.) The density that results is
where z = R−1(y − μx). The constant of proportionality is as above, . The leading term of the expansion is nM(y| μx, Σ) which is a Gaussian vector autoregression or Gaussian VAR. When Kz is put to zero, one gets nM(y| μx, Σ) exactly. When Kz is positive, one gets a semiparametric VAR density.
To approximate conditionally heterogeneous processes, proceed as above but let each coefficient of the polynomial be a polynomial of degree Kx in x. A polynomial in z of degree Kz whose coefficients are polynomials of degree Kx in x is, of course, a polynomial in (z, x) of degree Kz + Kx. Denote this polynomial by Denote the mapping from x to the coefficients a of such that by ax and the number of lags on which it depends by Lp. The form of the density with this modification is
where z = R−1(y − μx ). The constant of proportionality is . When Kx is zero, the density reverts to the density above. When Kx is positive, the shape of the density will depend upon x. Thus, all moments can depend upon x and the density can, in principal, approximate any form of conditional heterogeneity (Gallant and Tauchen, 1989).
In practice the second moment can exhibit marked dependence upon x. In an attempt to track the second moment, Kx can get quite large. To keep Kx small when data are markedly conditionally heteroskedastic, the leading term nM (y|μx, Σ) of the expansion can be put to a Gaussian GARCH rather than a Gaussian VAR. SNP uses a modified BEKK expression as described in Engle and Kroner (1995); the modifications are to add leverage and level effects.
Above, R0 is an upper triangular matrix. The matrices Pi, Qi, Vi, and Wi can be scalar, diagonal, or full M by M matrices. The notation x(1),t−i indicates that only the first column of xt−i enters the computation. The max(0, x) function is applied elementwise. Because \( {\sum}_{x_{t-1}} \) must be differentiable with respect to the parameters of \( {\mu}_{x_{t-2-i}} \), the max(0,x) function is approximated by a twice continuously differentiable cubic spline. Defining \( {R}_{x_{t-1}} \) by the factorization \( {\sum}_{x_{t-1}}={R}_{x_{t-1}}{R}_{x_{t-1}}^{\prime } \) and writing x for xt−1, the SNP density becomes
where \( z={R}_x^{-1}\left(y-\mu \right) \). The constant of proportionality is . The leading term nM(y|μx, Σx) is Gaussian ARCH if Lg = 0 and Lr> 0 and Gaussian GARCH if both Lg> 0 and Lr> 0 (leaving aside the implications of Lv and Lw).
Bibliography
Bollerslev, T. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–327.
Engle, R.F. 1982. Autoregressive conditional heteroskedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50: 987–1007.
Engle, R.F., and K.F. Kroner. 1995. Multivariate simultaneous generalized ARCH. Econometric Theory 11: 122–150.
Fenton, V.M., and A.R. Gallant. 1996. Qualitative and asymptotic performance of SNP density estimators. Journal of Econometrics 74: 77–118.
Gallant, A.R., D. Hsieh, and G. Tauchen. 1997. Estimation of stochastic volatility models with diagnostics. Journal of Econometrics 81: 159–192.
Gallant, A.R., and J.R. Long. 1997. Estimating stochastic differential equations efficiently by minimum chi-square. Biometrika 84: 125–141.
Gallant, A.R., and D.W. Nychka. 1987. Seminonparametric maximum likelihood estimation. Econometrica 55: 363–390.
Gallant, A.R., P.E. Rossi, and G. Tauchen. 1992. Stock prices and volume. Review of Financial Studies 5: 199–242.
Gallant, A.R., P.E. Rossi, and G. Tauchen. 1993. Nonlinear dynamic structures. Econometrica 61: 871–907.
Gallant, A.R., and G. Tauchen. 1989. Seminonparametric estimation of conditionally constrained heterogeneous processes: Asset pricing applications. Econometrica 57: 1091–1120.
Gallant, A.R., and G. Tauchen. 1992. A nonparametric approach to nonlinear time series analysis: Estimation and simulation. In New directions in time series analysis, part II, ed. D. Brillinger et al. New York: Springer-Verlag.
Gallant, A.R., and G. Tauchen. 1996. Which moments to match? Econometric Theory 12: 657–681.
Gallant, A.R., and G. Tauchen. 1998. Reprojecting partially observed systems with application to interest rate diffusions. Journal of the American Statistical Association 93: 10–24.
Author information
Authors and Affiliations
Editor information
Additional information
Research for this article was supported by the National Science Foundation.
Copyright information
© 2018 Macmillan Publishers Ltd.
About this entry
Cite this entry
Gallant, A.R. (2018). SNP: Nonparametric Time Series Analysis. In: The New Palgrave Dictionary of Economics. Palgrave Macmillan, London. https://doi.org/10.1057/978-1-349-95189-5_1956
Download citation
DOI: https://doi.org/10.1057/978-1-349-95189-5_1956
Published:
Publisher Name: Palgrave Macmillan, London
Print ISBN: 978-1-349-95188-8
Online ISBN: 978-1-349-95189-5
eBook Packages: Economics and FinanceReference Module Humanities and Social SciencesReference Module Business, Economics and Social Sciences