Skip to main content
Log in

A Multivariate Hidden Markov Model for the Identification of Sea Regimes from Incomplete Skewed and Circular Time Series

  • Published:
Journal of Agricultural, Biological, and Environmental Statistics Aims and scope Submit manuscript

Abstract

The identification of sea regimes from environmental multivariate times series is complicated by the mixed linear–circular support of the data, by the occurrence of missing values, by the skewness of some variables, and by the temporal autocorrelation of the measurements. We address these issues simultaneously by a hidden Markov approach, and segment the data into pairs of toroidal and skew-elliptical clusters by means of the inferred sequence of latent states. Toroidal clusters are defined by a class of bivariate von Mises densities, while skew-elliptical clusters are defined by mixed linear models with positive random effects. The core of the classification procedure is an EM algorithm accounting for missing measurements, unknown cluster membership, and random effects as different sources of incomplete information. Moreover, standard simulation routines allow for the efficient computation of bootstrap standard errors. The proposed procedure is illustrated for a multivariate marine time series, and identifies a number of wintertime regimes in the Adriatic Sea.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ailliot, P., and Monbet, V. (2012), “Markov-Switching Autoregressive Models for Wind Time Series,” Environmental Modelling & Software, 30, 92–101.

    Article  Google Scholar 

  • Barbu, V., and Limnios, N. (2005), “Maximum Likelihood Estimation for Hidden Semi-Markov Models,” Comptes Rendus Mathematique, 342, 201–205.

    Article  MathSciNet  MATH  Google Scholar 

  • Baudry, J.-P., Raftery, A. E., Celeux, G., Lo, K., and Gottardo, R. (2010), “Combining Mixture Components for Clustering,” Journal of Computational and Graphical Statistics, 19, 332–353.

    Article  MathSciNet  Google Scholar 

  • Bertotti, L., and Cavalieri, L. (2009), “Wind and Wave Predictions in the Adriatic Sea,” Journal of Marine Systems, 78, S227–S234.

    Article  Google Scholar 

  • Biernacki, C., Celeux, G., and Govaert, G. (2000), “Assessing a Mixture Model for Clustering With the Integrated Completed Likelihood,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 719–725.

    Article  Google Scholar 

  • — (2003), “Choosing Starting Values for the EM Algorithm for Getting the Highest Likelihood in Multivariate Gaussian Mixture Models,” Computational Statistics & Data Analysis, 41, 561–575.

    Article  MathSciNet  Google Scholar 

  • Bulla, I., Bulla, J., and Nenadić, O. (2010), “hsmm—An R Package for Analyzing Hidden Semi-Markov Models,” Computational Statistics & Data Analysis, 54, 611–619.

    Article  MathSciNet  MATH  Google Scholar 

  • Bulla, J., and Berzel, A. (2008), “Computational Issues in Parameter Estimation for Stationary Hidden Markov Models,” Computational Statistics, 23, 1–18.

    Article  MathSciNet  MATH  Google Scholar 

  • Cabral, C. R. B., Lachos, V. H., and Prates, M. O. (2012), “Multivariate Mixture Modeling Using Skew-Normal Independent Distributions,” Computational Statistics & Data Analysis, 56, 126–142.

    Article  MathSciNet  MATH  Google Scholar 

  • Caires, S., and Sterl, A. (2005), “A New Non-Parametric Method to Correct Model Data: Application to Significant Wave Height From the ERA-40 Reanalysis,” Journal of Atmospheric and Oceanic Technology, 22, 443–459.

    Article  Google Scholar 

  • Cappé, O., Moulines, E., and Rydén, T. (2005), Inference in Hidden Markov Models, Berlin: Springer.

    MATH  Google Scholar 

  • Faltinsen, O. (1990), Sea Loads on Ships and Offshore Structures, Cambridge: Cambridge University Press.

    Google Scholar 

  • Hamilton, L. (2010), “Characterising Spectral Sea Wave Conditions With Statistical Clustering of Actual Spectra,” Applied Ocean Research, 32, 332–342.

    Article  Google Scholar 

  • Holzmann, H., Munk, A., Suster, M., and Zucchini, W. (2006), “Hidden Markov Models for Circular and Linear–Circular Time Series,” Environmental and Ecological Statistics, 13, 325–347. doi:10.1007/s10651-006-0015-7.

    Article  MathSciNet  Google Scholar 

  • Huang, G., Wing-Keung Law, A., and Huang, Z. (2011), “Wave-Induced Drift of Small Floating Objects in Regular Waves,” Ocean Engineering, 38, 712–718.

    Article  Google Scholar 

  • Ingrassia, S., and Rocci, R. (2011), “Degeneracy of the EM Algorithm for the MLE of Multivariate Gaussian Mixtures and Dynamic Constraints,” Computational Statistics & Data Analysis, 55, 1715–1725.

    Article  MathSciNet  Google Scholar 

  • Izquierdo, P., and Guedes Soares, C. (2005), “Analysis of Sea Waves and Wind From X-Band Radar,” Ocean Engineering, 32, 1404–1419.

    Article  Google Scholar 

  • Jin, K.-R., and Ji, Z.-G. (2004), “Case Study: Modeling of Sediment Transport and Wind–Wave Impact in Lake Okeechobee,” Journal of Hydraulic Engineering, 130, 1055–1067.

    Article  Google Scholar 

  • Kato, S., and Shimizu, K. (2008), “Dependent Models for Observations Which Include Angular Ones,” Journal of Statistical Planning and Inference, 138, 3538–3549. Special Issue in Honor of Junjiro Ogawa (1915–2000): Design of Experiments, Multivariate Analysis and Statistical Inference.

    Article  MathSciNet  MATH  Google Scholar 

  • Lagona, F., and Picone, M. (2011), “A Latent-Class Model for Clustering Incomplete Linear and Circular Data in Marine Studies,” Journal of Data Science, 9, 585–605

    MathSciNet  Google Scholar 

  • — (2012a), “Maximum Likelihood Estimation of Bivariate Circular Hidden Markov Models From Incomplete Data,” Journal of Statistical Computation and Simulation. Available online at http://www.tandfonline.com/doi/pdf/10.1080/00949655.2012.656642.

  • — (2012b), “Model-Based Clustering of Multivariate Skew Data With Circular Components and Missing Values,” Journal of Applied Statistics, 39, 927–945.

    Article  MathSciNet  Google Scholar 

  • Lange, K., and Weeks, D. E. (1989), “Efficient Computation of LOD Scores: Genotype Elimination, Genotype Redefinition, and Hybrid Maximum Likelihood Algorithms,” Annals of Human Genetics, 53, 67–83.

    Article  MathSciNet  MATH  Google Scholar 

  • Langrock, R., and Zucchini, W. (2011), “Hidden Markov Models With Arbitrary State Dwell-Time Distributions,” Computational Statistics & Data Analysis, 55, 715–724.

    Article  MathSciNet  MATH  Google Scholar 

  • Lin, T. I. (2009), “Maximum Likelihood Estimation for Multivariate Skew Normal Mixture Models,” Journal of Multivariate Analysis, 100, 257–265.

    Article  MathSciNet  MATH  Google Scholar 

  • Marchenko, Y. V., and Genton, M. G. (2010), “Multivariate Log-Skew-Elliptical Distributions With Applications to Precipitation Data,” Environmetrics, 21, 318–340.

    Article  MathSciNet  Google Scholar 

  • Mardia, K., Taylor, C., and Subramaniam, G. (2007), “Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data,” Biometrics, 63, 505–512.

    Article  MathSciNet  MATH  Google Scholar 

  • Mardia, K. V., Hughes, G., Taylor, C. C., and Singh, H. (2008), “A Multivariate von Mises Distribution With Applications to Bioinformatics,” Canadian Journal of Statistics, 36, 99–109.

    Article  MathSciNet  MATH  Google Scholar 

  • Maruotti, A. (2011), “Mixed Hidden Markov Models for Longitudinal Data: An Overview,” International Statistical Review, 79, 427–454.

    Article  MATH  Google Scholar 

  • McLachlan, G., and Peel, D. (2000), Finite Mixture Models, New York: Wiley.

    MATH  Google Scholar 

  • Monbet, V., Ailliot, P., and Prevosto, M. (2007), “Survey of Stochastic Models for Wind and Sea-State Time Series,” Probabilistic Engineering Mechanics, 22, 113–126.

    Article  Google Scholar 

  • Pleskachevsky, A., Eppel, D., and Kapitza, H. (2009), “Interaction of Waves, Currents and Tides, and Wave-Energy Impact on the Beach Area of Sylt Island,” Ocean Dynamics, 59, 451–461.

    Article  Google Scholar 

  • Redner, R. A., and Walker, H. F. (1984), “Mixture Densities, Maximum Likelihood and the EM Algorithm,” SIAM Review, 26, 195–239.

    Article  MathSciNet  MATH  Google Scholar 

  • Reikard, G., and Rogers, W. E. (2011), “Forecasting Ocean Waves: Comparing a Physics-Based Model With Statistical Models,” Coastal Engineering, 58, 409–416.

    Article  Google Scholar 

  • Sahu, S., Dey, D., and Branco, M. (2003), “A New Class of Multivariate Skew Distributions With Applications to Bayesian Regression Models,” Canadian Journal of Statistics, 31, 129–150.

    Article  MathSciNet  MATH  Google Scholar 

  • Singh, H., Hnizdo, V., and Demchuk, E. (2002), “Probabilistic Model for Two Dependent Circular Variables,” Biometrika, 89 (3), 719–723.

    Article  MathSciNet  MATH  Google Scholar 

  • Teicher, H. (1967), “Identifiability of Mixtures of Product Measures,” Annals of Mathematical Statistics, 38, 1300–1302.

    Article  MathSciNet  MATH  Google Scholar 

  • Visser, I., Raijmakers, M., and Molenaar, P. (2000), “Confidence Intervals for Hidden Markov Model Parameters,” British Journal of Mathematical & Statistical Psychology, 53, 317–327.

    Article  Google Scholar 

  • Visser, I., Raijmakers, M. E. J., and Molenaar, P. C. M. (2002), “Fitting Hidden Markov Models to Psychological Data,” Scientific Programming, 10, 185–199.

    Google Scholar 

  • Wu, C. (1983), “On the Convergence Properties of the EM Algorithm,” Annals of Statistics, 11, 95–103.

    Article  MathSciNet  MATH  Google Scholar 

  • Yakowitz, S., and Spragins, J. (1968), “On the Identifiability of Finite Mixtures,” Annals of Mathematical Statistics, 39, 209–214.

    Article  MathSciNet  MATH  Google Scholar 

  • Zhang, Q., Snow Jones, A., Rijmen, F., and Ip, E. (2010), “Multivariate Discrete Hidden Markov Models for Domain-Based Measurements and Assessment of Risk Factors in Child Development,” Journal of Computational and Graphical Statistics, 19, 746–765.

    Article  MathSciNet  Google Scholar 

  • Zucchini, W., and Guttorp, P. (1991), “A Hidden Markov Model for Space-Time Precipitation,” Water Resources Research, 27, 1917–1923.

    Article  Google Scholar 

  • Zucchini, W., and MacDonald, I. (2009), Hiddden Markov Models for Time Series: An Introduction Using R, London: Chapman & Hall.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to F. Lagona.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bulla, J., Lagona, F., Maruotti, A. et al. A Multivariate Hidden Markov Model for the Identification of Sea Regimes from Incomplete Skewed and Circular Time Series. JABES 17, 544–567 (2012). https://doi.org/10.1007/s13253-012-0110-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13253-012-0110-1

Key Words

Navigation