A time-dependent extension of the projected normal regression model for longitudinal circular data based on a hidden Markov heterogeneity structure

Maruotti, Antonello; Punzo, Antonio; Mastrantonio, Gianluca; Lagona, Francesco

doi:10.1007/s00477-015-1183-5

A time-dependent extension of the projected normal regression model for longitudinal circular data based on a hidden Markov heterogeneity structure

Original Paper
Published: 01 December 2015

Volume 30, pages 1725–1740, (2016)
Cite this article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Antonello Maruotti^1,2,
Antonio Punzo³,
Gianluca Mastrantonio⁴ &
…
Francesco Lagona⁵

398 Accesses
18 Citations
Explore all metrics

Abstract

The modelling of animal movement is an important ecological and environmental issue. It is well-known that animals change their movement patterns over time, according to observable and unobservable factors. To trace the dynamics of behaviors, to identify factors influencing these dynamics and unobserved characteristics driving intra-subjects correlations, we introduce a time-dependent mixed effects projected normal regression model. A set of animal-specific parameters following a hidden Markov chain is introduced to deal with unobserved heterogeneity. For the maximum likelihood estimation of the model parameters, we outline an expectation–maximization algorithm. A large-scale simulation study provides evidence on model behavior. The data analysis approach based on the proposed model is finally illustrated by an application to a dataset, which derives from a population of Talitrus saltator from the beach of Castiglione della Pescaia (Italy).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical modelling of individual animal movement: an overview of key methods and a discussion of practical challenges

Article 04 July 2017

A Bayesian Markov Model with Pólya-Gamma Sampling for Estimating Individual Behavior Transition Probabilities from Accelerometer Classifications

Article 15 June 2020

The Conditionally Autoregressive Hidden Markov Model (CarHMM): Inferring Behavioural States from Animal Tracking Data Exhibiting Conditional Autocorrelation

Article 22 May 2019

References

Bacci S, Pandolfi A, Pennoni F (2014) A comparison of some criteria for states selection in the latent Markov model for longitudinal data. Adv Data Anal Classif 8:125–145
Article Google Scholar
Bartolucci F, Farcomeni A (2009) A multivariate extension of the dynamic logit model for longitudinal data based on a latent Markov heterogeneity structure. J Am Stat Assoc 104:816–831
Article CAS Google Scholar
Bartolucci F, Farcomeni A, Pennoni F (2013) Latent Markov model for longitudinal data. CRC Press, Boca Raton
Google Scholar
Breed GA, Jonsen ID, Myers RA, Bowen WD, Leonard ML (2009) Sex-specific, seasonal foraging tactics of adult grey seals (Halichoerus grypus) revealed by state space analysis. Ecology 90:3209–3221
Article Google Scholar
Baum LE, Petrie T, Soules G, Weiss NA (1970) maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 41:164–171
Article Google Scholar
Bulla J, Lagona F, Maruotti A, Picone M (2012) A multivariate hidden Markov model for the identification of sea regimes from incomplete skewed and circular time series. J Agric Biol Environ Stat 17:544–567
Article Google Scholar
Bulla J, Lagona F, Maruotti A, Picone M (2015) Environmental conditions in semi-enclosed basins: a dynamic latent class approach for mixed-type multivariate variables. Journal de la Societé Français de Statistique 156:114–136
Google Scholar
Carnicero JA, Ausin MC, Wiper MP (2013) Non-parametric copulas for circularlinear and circularcircular data: an application to wind directions. Stoch Environ Res Risk Assess 27:1991–2002
Article Google Scholar
D’Elia A (2001) A statistical model for orientation mechanism. Stat Methods Appl 10:157–174
Article Google Scholar
Farcomeni A (2015) Generalized linear mixed models based on latent Markov heterogeneity structures. Scand J Stat 42:1127–1135
Fisher NI, Lee AJ (1992) Regression models for angular response. Biometrics 48:665–677
Article Google Scholar
Gill G, Hangartner D (2010) Circular data in political science and how to handle it. Polit Anal 18:316–336
Article Google Scholar
Hanks EM, Hooten MB, Johnson DS, Sterling JT (2011) Velocity-based movement modeling for individual and population level inference. PLoS One 6:e22795
Article CAS Google Scholar
Heiss F (2008) Sequential numerical integration in nonlinear state space models for microeconometric panel data. J Appl Econ 23:373–389
Article Google Scholar
Hokimoto T, Kiyofuji H (2014) Effect of regime switching on behavior of albacore under the influence of phytoplankton concentration. Stoch Environ Res Risk Assess 28:1099–1124
Article Google Scholar
Holzmann H, Munk A, Suster M, Zucchini W (2006) Hidden Markov models for circular and linear-circular time series. Environ Ecol Stat 13:325–347
Article Google Scholar
Hornik K, Grün B (2014). movMF: an R package for fitting mixtures of von Mises–Fisher Distributions. J Stat Softw, vol 58
Jammalamadaka RA, SenGupta A (2001) Topics in circular statistics. World Scientific, Singapore
Google Scholar
Johnson RA, Wehrly TE (1978) Some angular-linear distributions and related regression models. J Am Stat Assoc 73:602–606
Article Google Scholar
Lagona F (2015) Regression analysis of correlated circular data based on the multivariate von Mises distribution. Environ Ecol Stat. doi:10.1007/s10651-015-0330-y
Google Scholar
Lagona F, Jdanov D, Shkolnikova M (2014) Latent time-varying factor in longitudinal analysis: a linear mixed hidden Markov model for heart rates. Stat Med 33:4116–4134
Article Google Scholar
Lagona F, Picone M, Maruotti A, Cosoli S (2015) A hidden Markov approach to the analysis of space–time environmental data with linear and circular components. Stoch Environ Res Risk Assess 29:397–409
Article Google Scholar
Langrock R, King R, Matthiopoulos J, Thomas L, Fortin D, Morales JM (2012) Flexible and practical modeling of animal telemetry data: hidden Markov models and extensions. Ecology 93:2336–2342
Article Google Scholar
Langrock R, Hopcraft JGC, Blackwell PG, Goodall V, King R, Niu M, Patterson TA, Pedersen MW, Skarin A, SchicK RS (2014a) Modelling group dynamic animal movement. Methods Ecol Evol 5:190–199
Article Google Scholar
Langrock R, Marques TA, Baird RW, Thomas L (2014b) Modeling the diving behavior of whales: a latent-variable approach with feedback and semi-Markovian components. J Agric Biol Environ Stat 19:82–100
Article Google Scholar
Lee A (2010) Circular data. Wiley Interdiscip Rev 2:477–486
Article Google Scholar
Leroux BG, Puterman ML (1992) Maximum-Penalized-Likelihood estimation for independent and Markov dependent mixture models. Biometrics 48:545–558
Article CAS Google Scholar
Maruotti A (2011) Mixed hidden Markov models for longitudinal data: an overview. Int Stat Rev 79:427–454
Article Google Scholar
Maruotti A, Rocci R (2012) A mixed non-homogeneous hidden Markov model for categorical data, with application to alcohol consumption. Stat Med 9:871–886
Article Google Scholar
Mastrantonio G, Jona-Lasinio G, Maruotti A (2015) Bayesian hidden Markov modelling using circular-linear general projected normal distribution. Environmetrics 26:145–158
Article Google Scholar
McClintock BT, King R, Thomas L, Matthiopoulos J, McConnell BJ, Morales JM (2012) A general discrete-time modeling framework for animal movement using multi-state random walks. Ecol Monogr 82:335–349
Article Google Scholar
McKellar AE, Langrock R, Walters JR, Kesler DC (2015) Using mixed hidden Markov models to examine behavioral states in a cooperatively breeding bird. Behav Ecol 26:148–157
Article Google Scholar
McLellan CR, Worton BJ, Deasy W, Birch ANE (2015) Modelling larval movement data from individual bioassays. Biom J 57(3):485–501
Article Google Scholar
Nathan R, Getz WM, Revilla E, Holyoak M, Kadmon R, Saltz D, Smouse PE (2008) A movement ecology paradigm for unifying organismal movement research. Proc Natl Acad Sci 105:19052–19059
Article CAS Google Scholar
Nunez-Antonio G, Gutierrez-Pena E (2014) A Bayesian model for longitudinal circular data based on the projected normal distribution. Comput Stat Data Anal 71:506–519
Article Google Scholar
Patlak CS (1953a) A mathematical contribution to the study of orientation of organisms. Bull Math Biophys 15:431–476
Article Google Scholar
Patlak CS (1953b) Random walk with persistence and external bias. Bull Math Biophys 15:311–338
Article CAS Google Scholar
Patterson TA, Thomas L, Wilcox C, Ovaskainen O, Matthiopoulos J (2008) State-space models of individual animal movement. Trends Ecol Evol 23:87–94
Article Google Scholar
Presnell B, Morrison SP, Littell RC (1998) Projected multivariate linear model for directional data. J Am Stat Assoc 93:1068–1077
Article Google Scholar
Song PXK (2007) Correlated data analysis. Springer, Berlin
Google Scholar
Visser I, Raijmakers M, Molenaar P (2000) Confidence intervals for hidden Markov model parameters. Br J Math Stat Psychol 53:317–327
Article Google Scholar
Visser I, Raijmakers MEJ, Molenaar PCM (2002) Fitting hidden Markov models to psychological data. Sci Program 10:185–199
Google Scholar
Wang F, Gelfand A (2013) Directional data analysis under the general projected normal distribution. Stat Methodol 10:113–127
Article Google Scholar
Wang F, Gelfand A (2014) Modeling space and space–time directional data using projected Gaussian processes. J Am Stat Assoc 109:1565–1580
Article CAS Google Scholar
Wang F, Gelfand A, Jona-Lasinio G (2015) Joint spatio-temporal analysis of a linear and a directional variable: space–time modeling of wave heights and wave directions in the Adriatic Sea. Stat Sin 25:25–39
CAS Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Scienze Economiche, Politiche e delle Lingue Moderne, Libera Università Maria Ss. Assunta, Roma, Italy
Antonello Maruotti
Centre for Innovation and Leadership in Health Sciences, University of Southampton, Southampton, UK
Antonello Maruotti
Dipartimento di Economia e Impresa, Università di Catania, Catania, Italy
Antonio Punzo
Dipartimento di Economia, Università di Roma Tre, Roma, Italy
Gianluca Mastrantonio
Dipartimento di Scienze Politiche, Università di Roma Tre, Roma, Italy
Francesco Lagona

Authors

Antonello Maruotti
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Punzo
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Mastrantonio
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Lagona
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonello Maruotti.

Appendices

Appendix 1: The mixed projected normal model

The mixed project normal model

$$\mu _{itj} = {\mathbf{x}}_{it}'{\varvec{\beta }}_j + b_{ij},\quad j=1,2,$$

relaxes the independence assumption on repeated measurements and may account for correlations between projections. Indeed, by assuming that projections are conditionally independent given the covariates and the random effects, for the i-th unit, the likelihood contribution would be

$$\begin{aligned} \int \int \prod _{t=1}^Tf({\mathbf{y}}_{it}\mid {\mathbf{x}}_{it},{\mathbf{b}}_{i})g({\mathbf{b}}_{i})d{\mathbf{b}}_{i}=\int \int \prod _{t=1}^Tf(y_{it1}\mid {\mathbf{x}}_{it}, b_{i1})f(y_{it2}\mid {\mathbf{x}}_{it}, b_{i2})g(b_{i1},b_{i2})db_{i1}db_{i2}. \end{aligned}$$

where $g(\cdot )$ is the random effects density function. Nevertheless, in practice, $g(b_{i1},b_{i2}) = g_1(b_{i1})g_2(b_{i2})$, i.e. random effects are assumed independent and, accordingly, projections can be separately modelled and are independent as well. Thus, although theoretically the mixed projected normal model could allow for conditional independence, this feature is not investigated or modelled in the literature. Formally, if $g(b_{i1},b_{i2}) = g_1(b_{i1})g_2(b_{i2})$, we have

$$\begin{aligned} \int \int \prod _{t=1}^Tf(y_{it1}\mid {\mathbf{x}}_{it}, b_{i1})f(y_{it2}\mid {\mathbf{x}}_{it},b_{i2})g(b_{i1},b_{i2})db_{i1}db_{i2}\\ = \int \prod _{t=1}^Tf(y_{it1}\mid {\mathbf{x}}_{it},b_{i1})g_1(b_{i1})db_{i1}\int \prod _{t=1}^Tf(y_{it2}\mid {\mathbf{x}}_{it},b_{i2})g_2(b_{i2})db_{i2} \end{aligned}$$

and, thus, the projections are independent.

Appendix 2: Computational details

As it stands the likelihood function of the hidden Markov projected normal model is of little or no computational use, because it involves a sum over $K^T$ terms for each unit i and cannot be directly evaluated. It quickly becomes infeasible to compute even for small values of K as T grows to moderate size. Clearly, a more efficient procedure is needed to perform the calculation of the likelihood function. This issue may be addressed via the so-called forward variables (Baum et al. 1970). Let us start defining

$$\alpha _{itk}= f({\mathbf{y}}_{i1},\ldots ,{\mathbf{y}}_{it},{\mathbf{b}}_{it}={\mathbf{b}}_k),\quad i =1,\ldots ,I, \ t = 1,\ldots ,T,$$

which represents the probability of seeing the partial sequence ending up in state k at time t for a generic unit i. We can compute $\alpha _{itk}$ recursively by

$$\alpha _{i1k} = \pi _k f({\mathbf{y}}_{i1}\mid {\mathbf{b}}_{i1}={\mathbf{b}}_k)$$

$$\alpha _{it+1k} = \sum _{h=1}^K\alpha _{ith}\pi _{k|h} f({\mathbf{y}}_{it+1}\mid {\mathbf{b}}_{it+1}={\mathbf{b}}_k).$$

As a by-product of the forward procedure we find that the likelihood can be written as

$$\ell ({\varvec{\lambda }}) = \sum _{i=1}^I\log \sum _{k=1}^K\alpha _{iTk}.$$

Let us further define

$$\tau _{itk} = f({\mathbf{y}}_{it+1},\ldots ,{\mathbf{y}}_{iT}\mid {\mathbf{b}}_{it}={\mathbf{b}}_k),$$

i.e. the probability of the partial sequence $({\mathbf{y}}_{it+1},\ldots ,{\mathbf{y}}_{iT})$ given that the i-th unit started in state k at time t. The backward recursion is given by

$$\tau _{iTk}=1$$

$$\tau _{itk} = \sum _{k=1}^K\pi _{k|h} f({\mathbf{y}}_{it+1}\mid {\mathbf{b}}_{it+1}={\mathbf{b}}_k)\tau _{it+1k}.$$

We can express $\hat{\xi }_{itk}$ and $\hat{\zeta }_{itjk}$ in terms of forward and backward variables by

$$\hat{\xi }_{itk} = \frac{\alpha _{itk}\tau _{itk}}{\sum _{k=1}^K\alpha _{itk}\tau _{itk}},$$

$$\begin{aligned} \hat{\zeta }_{ithk} = \frac{\alpha _{it-1h}\pi _{k|h} f({\mathbf{y}}_{it}\mid {\mathbf{b}}_{it}={\mathbf{b}}_k)\tau _{itk}}{\sum _{h,k=1}^K\alpha _{it-1h}\pi _{k\mid h} f({\mathbf{y}}_{it}\mid {\mathbf{b}}_{it}={\mathbf{b}}_k)\tau _{itk}}. \end{aligned}$$

At last, we have

$$\begin{aligned} {\mathtt{E}}(r_{itk}\mid \cdot )=\hat{r}_{itk} = {\mathbf{u}}_{it}{\varvec{\mu }}_{itk}+\frac{{\varvec{{{\varPhi}}}} ({\mathbf{u}}_{it}{\varvec{\mu }}_{itk})}{\phi ({\mathbf{u}}_{it}{\varvec{\mu }}_{itk})+{\mathbf{u}}_{it}{\varvec{\mu }}_{itk}{\varvec{{{\varPhi}}}} ({\mathbf{u}}_{it}{\varvec{\mu }}_{itk})}, \end{aligned}$$

where ${\varvec{\mu }}_{itk} = (\mu _{it1k},\mu _{it2k})$ is the state-specific vector of projections’ means and ${\varvec{{{\varPhi}}}} (\cdot )$ is the cdf of a bivariate standard normal distribution.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Maruotti, A., Punzo, A., Mastrantonio, G. et al. A time-dependent extension of the projected normal regression model for longitudinal circular data based on a hidden Markov heterogeneity structure. Stoch Environ Res Risk Assess 30, 1725–1740 (2016). https://doi.org/10.1007/s00477-015-1183-5

Download citation

Published: 01 December 2015
Issue Date: August 2016
DOI: https://doi.org/10.1007/s00477-015-1183-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A time-dependent extension of the projected normal regression model for longitudinal circular data based on a hidden Markov heterogeneity structure

Abstract

Access this article

Similar content being viewed by others

Statistical modelling of individual animal movement: an overview of key methods and a discussion of practical challenges

A Bayesian Markov Model with Pólya-Gamma Sampling for Estimating Individual Behavior Transition Probabilities from Accelerometer Classifications

The Conditionally Autoregressive Hidden Markov Model (CarHMM): Inferring Behavioural States from Animal Tracking Data Exhibiting Conditional Autocorrelation

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: The mixed projected normal model

Appendix 2: Computational details

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A time-dependent extension of the projected normal regression model for longitudinal circular data based on a hidden Markov heterogeneity structure

Abstract

Access this article

Similar content being viewed by others

Statistical modelling of individual animal movement: an overview of key methods and a discussion of practical challenges

A Bayesian Markov Model with Pólya-Gamma Sampling for Estimating Individual Behavior Transition Probabilities from Accelerometer Classifications

The Conditionally Autoregressive Hidden Markov Model (CarHMM): Inferring Behavioural States from Animal Tracking Data Exhibiting Conditional Autocorrelation

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: The mixed projected normal model

Appendix 2: Computational details

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation