Skip to main content
Log in

Nonlinear Regime-Switching State-Space (RSSS) Models

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

Nonlinear dynamic factor analysis models extend standard linear dynamic factor analysis models by allowing time series processes to be nonlinear at the latent level (e.g., involving interaction between two latent processes). In practice, it is often of interest to identify the phases—namely, latent “regimes” or classes—during which a system is characterized by distinctly different dynamics. We propose a new class of models, termed nonlinear regime-switching state-space (RSSS) models, which subsumes regime-switching nonlinear dynamic factor analysis models as a special case. In nonlinear RSSS models, the change processes within regimes, represented using a state-space model, are allowed to be nonlinear. An estimation procedure obtained by combining the extended Kalman filter and the Kim filter is proposed as a way to estimate nonlinear RSSS models. We illustrate the utility of nonlinear RSSS models by fitting a nonlinear dynamic factor analysis model with regime-specific cross-regression parameters to a set of experience sampling affect data. The parallels between nonlinear RSSS models and other well-known discrete change models in the literature are discussed briefly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1.
Figure 2.
Figure 3.
Figure 4.

Similar content being viewed by others

Notes

  1. Note that although the two lag-1 cross-regression parameters, \(b_{\mathit{PN},S_{\mathit{it}}}\) and \(b_{\mathit{NP},S_{\mathit{it}}}\), were allowed to vary over time and could be modeled as latent variables as in Chow et al. (2011b), it was not necessary to do so here because these two parameters did not have their own process noise components. Thus, the model comprises only two latent variables, namely, PA it and NA it . However, as in Chow et al. (2011a), the logistic functions still render the dynamic model nonlinear in PA it and NA it .

  2. It may be worth mentioning that Equations (12) differ from the model considered in Chow et al. (2011a) in a number of ways. For instance, Chow et al. (2011a) did not use the absolute function in Equation (2) and allowed the cross-lagged dependencies to materialize in the AR, as opposed to the cross-regression parameters, at extremely high levels of PA and NA from the previous day. They also allowed for random effects in some of the time series parameters, all of which are hypothesized to conform to nonparametric distributions modeled within a Bayesian framework.

  3. The items included in the three parcels included: (1) for PA parcel 1, elated, affectionate, lively, attentive, active, satisfied and calm; (2) for PA parcel 2, excited, love, enthusiastic, alert, interested, pleased and happy; (3) for PA parcel 3, aroused, inspired, proud, determined, strong and relaxed; (4) for NA parcel 1, angry, sad, distressed, jittery, guilty and afraid; (5) for NA parcel 2, upset, hostile, irritable, tense and ashamed, and (6) for NA parcel 3, depressed, agitated, nervous, anxious and scared.

  4. Although the proposed approach can handle missing values assumed to be missing completely at random or missing at random (Little & Rubin 2002), the wide-ranging time intervals in the original data would necessitate the insertion of too many “missing values” between some of the observed time points to create a set of equally spaced data. Thus, the data set in its original form is not particularly conducive for the illustration in the present article.

  5. For the composite scores, we aggregated each participant’s ratings across item parcels to obtain a composite PA score and a composite NA score for each person and time point. The lagged correlation matrix computed using these composite scores was then compared to the lagged correlation matrix computed using the latent variable scores estimated using Equations (A.9A.11) in the Appendix.

  6. The parameter \(\sigma^{2}_{\epsilon_{1}}\) was omitted in computing these average coverage rates because this parameter was characterized by very low coverage rate due to systematic underestimation in the point estimates. To avoid skewing the comparisons across sample size conditions, this parameter was omitted in the computation of the estimates shown in Figure 3, Panel E.

  7. Note that this was only a rough estimate. In the case of linear state-space models that are stationary, closed-form expressions of E(y it |S it ) and \(\mathop {\mathrm {Cov}}(\boldsymbol {y}_{\mathit{it}},\boldsymbol {y}'_{\mathit{it}}|S_{\mathit{it}})\) can be obtained analytically (see, e.g., p. 121, Harvey 2001; Du Toit & Browne 2007). In the case of our general modeling equations, \(E(\boldsymbol {y}_{\mathit{it}}|S_{\mathit{it}}) = \mathbf {d}_{S_{\mathit{it}}}+ \boldsymbol{\Lambda}_{S_{\mathit{it}}}[\mathbf {b}_{S_{\mathit{it}}}(\boldsymbol {\eta}_{i,t-1}, \boldsymbol {x}_{\mathit{it}})]\) whereas \(\mathop {\mathrm {Cov}}(\boldsymbol {y}_{\mathit{it}},\boldsymbol {y}'_{\mathit{it}}|S_{\mathit{it}}) = \boldsymbol{\Lambda}_{S_{\mathit{it}}}\mathop {\mathrm {Cov}}[\mathbf {b}_{S_{\mathit{it}}}(\boldsymbol {\eta}_{i,t-1}, \boldsymbol {x}_{\mathit{it}})] \boldsymbol{\Lambda}'_{S_{\mathit{it}}}\) and closed-form expressions of these functions cannot be obtained analytically. To yield an approximation, we generated data using the simulation model and a large sample size (i.e., with T=1000 and n=1000). Subsequently, we obtained the empirical means and covariance matrices of y it |S it over all people and time points.

References

  • Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B.N. Petrov & F. Csaki (Eds.), Second international symposium on information theory (pp. 267–281). Budapest: Akademiai Kiado.

    Google Scholar 

  • Anderson, B.D.O., & Moore, J.B. (1979). Optimal filtering. Englewood Cliffs: Prentice Hall.

    Google Scholar 

  • Aptech Systems Inc. (2009). GAUSS (version 10) [computer software manual]. Black Diamond: Aptech Systems Inc.

    Google Scholar 

  • Bar-Shalom, Y., Li, X.R., & Kirubarajan, T. (2001). Estimation with applications to tracking and navigation: theory algorithms and software. New York: Wiley.

    Book  Google Scholar 

  • Boker, S.M., Neale, H., Maes, H., Wilde, M., Spiegel, M., Brick, T., et al. (2011). Openmx: an open source extended structural equation modeling framework. Psychometrika, 76(2), 306–317.

    Article  PubMed  Google Scholar 

  • Browne, M.W., & Nesselroade, J.R. (2005). Representing psychological processes with dynamic factor models: some promising uses and extensions of autoregressive moving average time series models. In A. Maydeu-Olivares & J.J. McArdle (Eds.), Contemporary psychometrics: a festschrift for Roderick P. McDonald (pp. 415–452). Mahwah: Erlbaum.

    Google Scholar 

  • Browne, M.W., & Zhang, G. (2007). Developments in the factor analysis of individual time series. In R. Cudeck & R.C. MacCallum (Eds.), Factor analysis at 100: historical developments and future directions (pp. 265–291). Mahwah: Erlbaum.

    Google Scholar 

  • Cacioppo, J.T., & Berntson, G.G. (1999). The affect system: architecture and operating characteristics. Current Directions in Psychological Science, 8, 133–137.

    Article  Google Scholar 

  • Cattell, R., & Barton, K. (1974). Changes in psychological state measures and time of day. Psychological Reports, 35, 219–222.

    Article  PubMed  Google Scholar 

  • Chow, S.-M., Ho, M.-H.R., Hamaker, E.J., & Dolan, C.V. (2010). Equivalences and differences between structural equation and state-space modeling frameworks. Structural Equation Modeling, 17, 303–332.

    Article  Google Scholar 

  • Chow, S.-M., Nesselroade, J.R., Shifren, K., & McArdle, J. (2004). Dynamic structure of emotions among individuals with Parkinson’s disease. Structural Equation Modeling, 11, 560–582.

    Article  Google Scholar 

  • Chow, S.-M., Tang, N., Yuan, Y., Song, X., & Zhu, H. (2011a). Bayesian estimation of semiparametric dynamic latent variable models using the Dirichlet process prior. British Journal of Mathematical & Statistical Psychology, 64(1), 69–106.

    Article  Google Scholar 

  • Chow, S.-M., Zu, J., Shifren, K., & Zhang, G. (2011b). Dynamic factor analysis models with time-varying parameters. Multivariate Behavioral Research, 46(2), 303–339.

    Article  Google Scholar 

  • Cohen, S., Kamarck, T., & Mermelstein, R. (1983). A global measure of perceived stress. Journal of Health and Social Behavior, 24(4), 385–396.

    Article  PubMed  Google Scholar 

  • Collins, L.M., & Wugalter, S.E. (1992). Latent class models for stage-sequential dynamic latent variables. Multivariate Behavioral Research, 28, 131–157.

    Article  Google Scholar 

  • De Jong, P. (1991). The diffuse Kalman filter. The Annals of Statistics, 19, 1073–1083.

    Article  Google Scholar 

  • Dolan, C.V. (2009). Structural equation mixture modeling. In R.E. Millsap & A. Maydeu-Olivares (Eds.), The SAGE handbook of quantitative methods in psychology (pp. 568–592). Thousand Oaks: Sage.

    Chapter  Google Scholar 

  • Dolan, C.V., Schmittmann, V.D., Lubke, G.H., & Neale, M.C. (2005). Regime switching in the latent growth curve mixture model. Structural Equation Modeling, 12(1), 94–119.

    Article  Google Scholar 

  • Dolan, C.V., & Van der Maas, H.L.J. (1998). Fitting multivariate normal finite mixtures subject to structural equation modeling. Psychometrika, 63(3), 227–253.

    Article  Google Scholar 

  • Doornik, J.A. (1998). Object-oriented matrix programming using Ox 2.0. London: Timberlake Consultants Press.

    Google Scholar 

  • Du Toit, S.H.C., & Browne, M.W. (2007). Structural equation modeling of multivariate time series. Multivariate Behavioral Research, 42, 67–101.

    Article  Google Scholar 

  • Durbin, J., & Koopman, S.J. (2001). Time series analysis by state space methods. New York: Oxford University Press.

    Google Scholar 

  • Elliott, R.J., Aggoun, L., & Moore, J. (1995). Hidden Markov models: estimation and control. New York: Springer.

    Google Scholar 

  • Emotions and Dynamic Systems Laboratory (2010). The affective dynamics and individual differences (ADID) study: developing non-stationary and network-based methods for modeling the perception and physiology of emotions Unpublished manual, University of North Carolina at Chapel Hill.

  • Engle, R.F., & Watson, M. (1981). A one-factor multivariate time series model of metropolitan wage rates. Journal of the American Statistical Association, 76, 774–781.

    Article  Google Scholar 

  • Ferrer, E., & Nesselroade, J.R. (2003). Modeling affective processes in dyadic relations via dynamic factor analysis. Emotion, 3, 344–360.

    Article  PubMed  Google Scholar 

  • Fukuda, K., & Ishihara, K. (1997). Development of human sleep and wakefulness rhythm during the first six months of life: discontinuous changes at the 7th and 12th week after birth. Biological Rhythm Research, 28, 94–103.

    Article  Google Scholar 

  • Geweke, J.F., & Singleton, K.J. (1981). Maximum likelihood confirmatory factor analysis of economic time series. International Economic Review, 22, 133–137.

    Google Scholar 

  • Hamaker, E.L., Dolan, C.V., & Molenaar, P.C.M. (2003). ARMA-based SEM when the number of time points T exceeds the number of cases N: raw data maximum likelihood. Structural Equation Modeling, 10, 352–379.

    Article  Google Scholar 

  • Hamaker, E.L., & Grasman, R.P.P.P. (2012). Regime switching state-space models applied to psychological processes: handling missing data and making inferences. Psychometrika, 77(2), 400–422.

    Article  Google Scholar 

  • Hamilton, J.D. (1989). A new approach to the economic analysis of nonstationary time series and the business cycle. Econometrica, 57, 357–384.

    Article  Google Scholar 

  • Harvey, A.C. (2001). Forecasting, structural time series models and the Kalman filter. Cambridge: Cambridge University Press.

    Google Scholar 

  • Heron, K.E., & Smyth, J.M. (2010). Ecological momentary interventions: incorporating mobile technology into psychosocial and health behavior treatments. British Journal of Health Psychology, 15, 1–39.

    Article  PubMed  Google Scholar 

  • Hosmer, D.W. (1974). Maximum likelihood estimates of parameters of a mixture of two regression lines. Communications in Statistics. Theory and Methods, 3, 995–1006.

    Article  Google Scholar 

  • Jackson, C.H. (2011). Multi-state models for panel data: the msm package for R. Journal of Statistical Software, 38(8), 1–29. Available from http://www.jstatsoft.org/v38/i08/.

    Google Scholar 

  • Kenny, D.A., & Judd, C.M. (1984). Estimating the nonlinear and interactive effects of latent variables. Psychological Bulletin, 96, 201–210.

    Article  Google Scholar 

  • Kim, C.-J., & Nelson, C.R. (1999). State-space models with regime switching: classical and Gibbs-sampling approaches with applications. Cambridge: MIT Press.

    Google Scholar 

  • Kishton, J.M., & Widaman, K.F. (1994). Unidimensional versus domain representative parceling of questionnaire items: an empirical example. Educational and Psychological Measurement, 54, 757–765.

    Article  Google Scholar 

  • Koopman, S.J. (1997). Exact initial Kalman filtering and smoothing for nonstationary time series models. Journal of the American Statistical Association, 92, 1630–1638.

    Article  Google Scholar 

  • Kuppens, P., Allen, N.B., & Sheeber, L.B. (2010). Emotional inertia and psychological adjustment. Psychological Science, 21, 984–991.

    Article  PubMed  Google Scholar 

  • Lanza, S.T., & Collins, L.M. (2008). A new SAS procedure for latent transition analysis: transitions in dating and sexual risk behavior. Developmental Psychology, 44(2), 446–456.

    Article  PubMed  Google Scholar 

  • Larsen, R.J., & Diener, E. (1992). Promises and problems with the circumplex model of emotion. Review of Personality and Social Psychology, 13, 25–59.

    Google Scholar 

  • Li, F., Duncan, T.E., & Acock, A. (2000). Modeling interaction effects in latent growth curve models. Structural Equation Modeling, 7(4), 497–533.

    Article  Google Scholar 

  • Little, R.J.A., & Rubin, D.B. (2002). Statistical analysis with missing data. New York: Wiley.

    Google Scholar 

  • Marsh, W.H., Wen, Z.L., & Hau, J.-T. (2004). Structural equation models of latent interactions: evaluation of alternative estimation strategies and indicator construction. Psychological Methods, 9, 275–300.

    Article  PubMed  Google Scholar 

  • Molenaar, P.C.M. (1985). A dynamic factor model for the analysis of multivariate time series. Psychometrika, 50, 181–202.

    Article  Google Scholar 

  • Molenaar, P.C.M. (1994a). Dynamic factor analysis of psychophysiological signals. In J.R. Jennings, P.K. Ackles, & M.G.H. Coles (Eds.), Advances in psychophysiology: a research annual (Vol. 5, pp. 229–302). London: Jessica Kingsley Publishers.

    Google Scholar 

  • Molenaar, P.C.M. (1994b). Dynamic latent variable models in developmental psychology. In A. von Eye & C. Clogg (Eds.), Latent variables analysis: applications for developmental research (pp. 155–180). Thousand Oaks: Sage.

    Google Scholar 

  • Molenaar, P.C.M., & Nesselroade, J.R. (1998). A comparison of pseudo-maximum likelihood and asymptotically distribution-free dynamic factor analysis parameter estimation in fitting covariance-structure models to block-Toeplitz matrices representing single-subject multivariate time series. Multivariate Behavioral Research, 33, 313–342.

    Article  Google Scholar 

  • Molenaar, P.C.M., & Newell, K.M. (2003). Direct fit of a theoretical model of phase transition in oscillatory finger motions. British Journal of Mathematical & Statistical Psychology, 56, 199–214.

    Article  Google Scholar 

  • Muthén, B.O., & Asparouhov, T. (2011). LTA in Mplus: transition probabilities influenced by covariates (Mplus Web Notes: No. 13).

  • Muthén, L.K., & Muthén, B.O. (2001). Mplus: the comprehensive modeling program for applied researchers: user’s guide. Los Angeles: Muthén & Muthén. 1998–2001.

    Google Scholar 

  • Nesselroade, J.R., McArdle, J.J., Aggen, S.H., & Meyers, J.M. (2002). Dynamic factor analysis models for representing process in multivariate time-series. In D.S. Moskowitz & S.L. Hershberger (Eds.), Modeling intraindividual variability with repeated measures data: methods and applications (pp. 235–265). Mahwah: Erlbaum.

    Google Scholar 

  • Nylund-Gibson, K., Muthen, B., Nishina, A., Bellmore, A., & Graham, S. (2013, under review). Stability and instability of peer victimization during middle school: using latent transition analysis with covariates, distal outcomes, and modeling extensions.

  • Piaget, J., & Inhelder, B. (1969). The psychology of the child. New York: Basic Books.

    Google Scholar 

  • R Development Core Team (2009). R: a language and environment for statistical computing [computer software manual]. Vienna, Austria: R Foundation for Statistical Computing. ISBN 3-900051-07-0. Available from. http://www.R-project.org.

    Google Scholar 

  • Russell, J.A. (1980). A circumplex model of affect. Journal of Personality and Social Psychology, 39(1–sup–6), 1161–1178.

    Article  Google Scholar 

  • SAS Institute Inc. (2008). SAS 9.2 help and documentation [computer software manual]. Cary: SAS Institute Inc.

    Google Scholar 

  • Sbarra, D.A., & Ferrer, E. (2006). The structure and process of emotional experience following non-marital relationship dissolution: dynamic factor analyses of love, anger, and sadness. Emotion, 6, 224–238.

    Article  PubMed  Google Scholar 

  • Schmittmann, V.D., Dolan, C.V., van der Maas, H., & Neale, M.C. (2005). Discrete latent Markov models for normally distributed response data. Multivariate Behavioral Research, 40(2), 207–233.

    Article  Google Scholar 

  • Schumacker, R.E. & Marcoulides, G.A. (Eds.) (1998). Interaction and nonlinear effects in structural equation modeling. Mahwah: Erlbaum.

    Google Scholar 

  • Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464.

    Article  Google Scholar 

  • Schweppe, F. (1965). Evaluation of likelihood functions for Gaussian signals. IEEE Transactions on Information Theory, 11, 61–70.

    Article  Google Scholar 

  • Tiao, G.C., & Tsay, R.S. (1994). Some advances in non-linear and adaptive modelling in time-series. Journal of Forecasting, 13, 109–131.

    Article  Google Scholar 

  • Tong, H., & Lim, K.S. (1980). Threshold autoregression, limit cycles and cyclical data. Journal of the Royal Statistical Society. Series B, 42, 245–292.

    Google Scholar 

  • Van der Maas, H.L.J., & Molenaar, P.C.M. (1992). Stagewise cognitive development: an application of catastrophe theory. Psychological Review, 99(3), 395–417.

    Article  PubMed  Google Scholar 

  • Van Dijk M., & Van Geert, P. (2007). Wobbles, humps and sudden jumps: a case study of continuity, discontinuity and variability in early language development. Infant and Child Development, 16(1), 7–33.

    Article  Google Scholar 

  • Visser, I. (2007). Depmix: an R-package for fitting mixture models on mixed multivariate data with Markov dependencies (Tech. Rep.). University of Amsterdam. Available from http://cran.r-project.org.

  • Watson, D., Clark, L.A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: the PANAS scale. Journal of Personality and Social Psychology, 54(6), 1063–1070.

    Article  PubMed  Google Scholar 

  • Wen, Z., Marsh, H.W., & Hau, K.-T. (2002). Interaction effects in growth modeling: a full model. Structural Equation Modeling, 9(1), 20–39.

    Article  Google Scholar 

  • Yang, M., & Chow, S.-M. (2010). Using state-space model with regime switching to represent the dynamics of facial electromyography (EMG) data. Psychometrika, 74(4), 744–771. Application and Case Studies.

    Article  Google Scholar 

  • Yung, Y.F. (1997). Finite mixtures in confirmatory factor-analysis models. Psychometrika, 62, 297–330.

    Article  Google Scholar 

  • Zautra, A.J., Potter, P.T., & Reich, J.W. (1997). The independence of affect is context-dependent: an integrative model of the relationship between positive and negative affect. In M.P. Lawton, K.W. Schaie, & M.P. Lawton (Eds.), Annual review of gerontology and geriatrics: Vol. 17. Focus on adult development (pp. 75–103). New York: Springer.

    Google Scholar 

  • Zautra, A.J., Reich, J.W., Davis, M.C., Potter, P.T., & Nicolson, N.A. (2000). The role of stressful events in the relationship between positive and negative affects: evidence from field and experimental studies. Journal of Personality, 68, 927–951.

    Article  PubMed  Google Scholar 

  • Zhang, Z., Hamaker, E.L., & Nesselroade, J.R. (2008). Comparisons of four methods for estimating a dynamic factor model. Structural Equation Modeling, 15, 377–402.

    Article  Google Scholar 

  • Zhang, Z., & Nesselroade, J.R. (2007). Bayesian estimation of categorical dynamic factor models. Multivariate Behavioral Research, 42, 729–756.

    Article  Google Scholar 

Download references

Acknowledgements

Funding for this study was provided by a grant from NSF (BCS-0826844). We would like to thank Manshu Yang for thoughtful comments on earlier drafts of this manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sy-Miin Chow.

Appendix

Appendix

1.1 A.1 The Extended Kim Filter and Extended Kim Smoother

We outline the key procedures for implementing the extended Kim filter and extended Kim smoother here. Besides the modifications added to accommodate linearization constraints, the estimation process is identical to that associated with the linear Kim filter and Kim smoother. Interested readers are referred to Kim and Nelson (1999) for further details.

The extended Kim filter algorithm can be decomposed into three parts: the extended Kalman filter (for latent variable estimation), the Hamilton filter (to estimate the latent regime indicator, S it ) and a collapsing procedure (to consolidate regime-specific estimates to reduce computational burden). For didactic reasons, we will describe the extended Kalman filter followed by the collapsing process and finally, the Hamilton filter. In actual implementation, however, the Hamilton filter step has to be executed before the collapsing process takes place. MATLAB scripts for implementing these procedures with annotated comments can be downloaded from the first author’s website at http://www.personal.psu.edu/quc16/.

1.2 A.2 The Extended Kalman Filter (EKF)

The extended Kalman filter (EKF) essentially provides a way to derive longitudinal factor or latent variable scores in real time as a new observation, y it , is brought in. Let \(\boldsymbol {\eta }_{i,t|t-1}^{j,k}=E(\boldsymbol {\eta}_{\mathit{it}}|S_{\mathit{it}}=k,S_{i,t-1}=j, \mathbf {Y}_{i,t-1})\), \(\mathbf {P}_{i,t|t-1}^{j,k} = \mathop {\mathrm {Cov}}(\boldsymbol {\eta}_{\mathit{it}}|S_{\mathit{it}}=k,S_{i,t-1}=j, \mathbf {Y}_{i,t-1})\), \(\boldsymbol {v}_{\mathit{it}}^{j,k}\) is the one-step-ahead prediction errors and \(\mathbf {F}_{\mathit{it}}^{j,k}\) is the associated covariance matrix; j and k are indices for the previous regime and current regime, respectively. The extended Kalman filter can be expressed as

(A.1)
(A.2)
(A.3)
(A.4)
(A.5)
(A.6)

where \(\mathbf {K}_{k,it}= \mathbf {P}_{i,t|t-1}^{j,k} \boldsymbol {\Lambda }_{k}' [ \mathbf {F}_{\mathit{it}}^{j,k}]^{-1}\) is called the Kalman gain function; B k,it is the Jacobian matrix that consists of differentiations of the dynamic functions around the latent variable estimates, \(\boldsymbol {\eta }_{i,t-1|t-1}^{j}\), namely, \(\mathbf {B}_{k,it} = \frac {\partial \mathbf {b}_{k}( \boldsymbol {\eta }_{i,t-1|t-1}^{j}, \boldsymbol {x}_{\mathit{it}})}{\partial \boldsymbol {\eta }_{i,t-1|t-1}^{j}}\), with the time-varying covariates, x it , fixed at their observed values. The gth row and lth column of B k,it carries the partial derivative of the gth dynamic function characterizing regime k with respect to the lth latent variable, evaluated at subject i’s conditional latent variable estimates from time t−1, \(\boldsymbol {\eta }_{i,t-1|t-1}^{j}\), from the jth regime. The subject index in B k,it is used to indicate that the Jacobian matrix has different numerical values because it is evaluated at each person’s respective latent variable estimates, not that the dynamic functions are subject-dependent. Because our hypothesized measurement functions are linear, no linearization of the measurement functions is needed.

The EKF summarized in Equations (A.1A.6) works recursively (i.e., one time point at a time) from time 1 to T and i=1,…,n until \(\boldsymbol {\eta }_{i,t|t}^{j,k}\) and \(\mathbf {P}_{i,t|t}^{j,k}\), have been computed for all time points and people. To start the filter, the initial latent variable scores at time t = 0, η 0, are assumed to be distributed as η 0∼ MVN(η 0|0,P 0|0). Typically, η 0 is assumed to have a diffuse density, that is, η 0|0 is fixed to be a vector of constant values (e.g., a vector of zeros) and the diagonal elements of the covariance matrix P 0|0 are set to some arbitrarily large constants.

1.3 A.3 The Collapsing Process

At each t, the EKF procedures utilize only the marginal estimates, \(\boldsymbol {\eta }_{i,t-1|t-1}^{j}\) and \(\mathbf {P}_{i,t-1|t-1}^{j}\), from the previous time point. This is because to ease computational burden, a collapsing procedure is performed on \(\boldsymbol {\eta }_{i,t|t}^{j,k}\) and \(\mathbf {P}_{i,t|t}^{j,k}\) after each EKF step to yield \(\boldsymbol {\eta }_{i,t|t}^{k}\) and \(\mathbf {P}_{i,t|t}^{k}\). Given a total of M regimes, if no collapsing is used, the M sets of computations involving \(\boldsymbol {\eta }_{i,t-1|t-1}^{j}\) and \(\mathbf {P}_{i,t-1|t-1}^{j}\) in Equations (A.1A.2) would have to be performed using \(\boldsymbol {\eta}^{j,k}_{i,t-1|t-1}\) and \(\mathbf {P}^{j,k}_{i,t-1|t-1}\) for every possible value of j and k. As a result, the number of possible values of filtered estimates increases directly with time, leading to considerable computational and storage burden if T is large. To circumvent this computational issue, Kim and Nelson (1999) proposed collapsing the M×M sets of new \(\boldsymbol {\eta }_{i,t|t}^{j,k}\) and \(\mathbf {P}_{i,t|t}^{j,k}\) at each t as

(A.7)

where W it is called the weighting factor, the elements of which are computed using the Hamilton filter.

1.4 A.4 The Hamilton Filter

The Hamilton filter is also a recursive process and it can be expressed as:

(A.8)

where Pr[S it =k|S i,t−1=j] are elements of the transition probability matrix shown in Equation (8). f(y it |S it =k,S i,t−1=j,Y i,t−1) is a multivariate normal likelihood function expressed as

f(y it |Y i,t−1) in Equation (A.8) is often referred to as the prediction error decomposition function (Schweppe 1965). Taking the log of this value, log[f(y it |Y i,t−1)], and subsequently summing over t=1,…,T and then i=1,…,n yields the overall log-likelihood value, denoted herein as log[f(Y|θ)], that can then be maximized using an optimization procedure of choice (e.g., Newton–Raphson) to obtain estimates of θ. Standard errors associated with θ can be obtained by taking the square root of the diagonal elements of I −1 at the convergence point, where I is the observed information matrix, obtained by computing the negative numerical Hessian matrix of log[f(Y|θ)]. Information criterion measures such as the AIC (Akaike 1973) and BIC (Schwarz 1978) can be computed using log[f(y it |Y i,t−1)] as (see p. 80, Harvey 2001):

where q is the number of parameters in a model.

Since the prediction error decomposition function in Equation (A.8) is essentially a raw data likelihood function, missing values can be readily accommodated by using only the nonmissing observed elements of y it in computing the prediction errors, \(\boldsymbol {v}_{\mathit{it}}^{j,k}\) and their associated covariance matrix. To handle missing data in the EKF, we used the approach suggested by Hamaker and Grasman (2012), that is, to only update the estimates in \(\{ \boldsymbol {\eta }_{i,t|t}^{j,k},\ \mathbf {P}_{i,t|t}^{j,k},\ \boldsymbol {\eta }_{i,t|t}^{k},\ \mathbf {P}_{i,t|t}^{k},\ \mathit{Pr}[S_{i,t-1}=j,S_{\mathit{it}}=k| \mathbf {Y}_{\mathit{it}}],\ \mathit{Pr}[S_{\mathit{it}}=k| \mathbf {Y}_{\mathit{it}}]\}\) using nonmissing elements from each measurement occasion.

1.5 A.5 The Extended Kim Smoother (EKS)

Given estimates from the EKF, the extended Kim smoother (EKS) can be used to obtain more accurate latent variable estimates and regime probabilities based on all observed information from each individual’s entire time series. Using \(\boldsymbol {\eta }_{i,t|t-1}^{j,k}, \mathbf {P}_{i,t|t-1}^{j,k}, \boldsymbol {\eta }_{i,t|t}^{k}, \mathbf {P}_{i,t|t}^{k}\), Pr[S it =k|Y it ] and Pr[S it =k|Y i,t−1], the smoothing procedure can be implemented for t=T−1,…,1 and i=1,…,n as follows. First, smoothed estimates from regime j to regime k are obtained as

(A.9)

where \(\widetilde{ \mathbf {P}}_{t}^{k,h}= \mathbf {P}_{i,t|t}^{k}\mathbf {B}_{h,it}'[ \mathbf {P}_{i,t+1|t}^{k,h}]^{-1}\). Similar to the collapsing procedure used in the extended Kim filter, a collapsing process is implemented here as

(A.10)

Finally, smoothed latent variable estimates and their associated covariance matrix are obtained by summing over the M regimes in effect to yield

(A.11)

Equations (A.9A.11) yield three sets of estimates: η i,t|T , the smoothed latent variable estimates conditional on all observations, the smoothed covariance matrix, P i,t|T , and Pr[S it =k|Y iT ], the smoothed probability for person i to be in regime k at time t.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chow, SM., Zhang, G. Nonlinear Regime-Switching State-Space (RSSS) Models. Psychometrika 78, 740–768 (2013). https://doi.org/10.1007/s11336-013-9330-8

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11336-013-9330-8

Key words

Navigation