Abstract
The identification of sea regimes from environmental multivariate times series is complicated by the mixed linear–circular support of the data, by the occurrence of missing values, by the skewness of some variables, and by the temporal autocorrelation of the measurements. We address these issues simultaneously by a hidden Markov approach, and segment the data into pairs of toroidal and skew-elliptical clusters by means of the inferred sequence of latent states. Toroidal clusters are defined by a class of bivariate von Mises densities, while skew-elliptical clusters are defined by mixed linear models with positive random effects. The core of the classification procedure is an EM algorithm accounting for missing measurements, unknown cluster membership, and random effects as different sources of incomplete information. Moreover, standard simulation routines allow for the efficient computation of bootstrap standard errors. The proposed procedure is illustrated for a multivariate marine time series, and identifies a number of wintertime regimes in the Adriatic Sea.
Similar content being viewed by others
References
Ailliot, P., and Monbet, V. (2012), “Markov-Switching Autoregressive Models for Wind Time Series,” Environmental Modelling & Software, 30, 92–101.
Barbu, V., and Limnios, N. (2005), “Maximum Likelihood Estimation for Hidden Semi-Markov Models,” Comptes Rendus Mathematique, 342, 201–205.
Baudry, J.-P., Raftery, A. E., Celeux, G., Lo, K., and Gottardo, R. (2010), “Combining Mixture Components for Clustering,” Journal of Computational and Graphical Statistics, 19, 332–353.
Bertotti, L., and Cavalieri, L. (2009), “Wind and Wave Predictions in the Adriatic Sea,” Journal of Marine Systems, 78, S227–S234.
Biernacki, C., Celeux, G., and Govaert, G. (2000), “Assessing a Mixture Model for Clustering With the Integrated Completed Likelihood,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 719–725.
— (2003), “Choosing Starting Values for the EM Algorithm for Getting the Highest Likelihood in Multivariate Gaussian Mixture Models,” Computational Statistics & Data Analysis, 41, 561–575.
Bulla, I., Bulla, J., and Nenadić, O. (2010), “hsmm—An R Package for Analyzing Hidden Semi-Markov Models,” Computational Statistics & Data Analysis, 54, 611–619.
Bulla, J., and Berzel, A. (2008), “Computational Issues in Parameter Estimation for Stationary Hidden Markov Models,” Computational Statistics, 23, 1–18.
Cabral, C. R. B., Lachos, V. H., and Prates, M. O. (2012), “Multivariate Mixture Modeling Using Skew-Normal Independent Distributions,” Computational Statistics & Data Analysis, 56, 126–142.
Caires, S., and Sterl, A. (2005), “A New Non-Parametric Method to Correct Model Data: Application to Significant Wave Height From the ERA-40 Reanalysis,” Journal of Atmospheric and Oceanic Technology, 22, 443–459.
Cappé, O., Moulines, E., and Rydén, T. (2005), Inference in Hidden Markov Models, Berlin: Springer.
Faltinsen, O. (1990), Sea Loads on Ships and Offshore Structures, Cambridge: Cambridge University Press.
Hamilton, L. (2010), “Characterising Spectral Sea Wave Conditions With Statistical Clustering of Actual Spectra,” Applied Ocean Research, 32, 332–342.
Holzmann, H., Munk, A., Suster, M., and Zucchini, W. (2006), “Hidden Markov Models for Circular and Linear–Circular Time Series,” Environmental and Ecological Statistics, 13, 325–347. doi:10.1007/s10651-006-0015-7.
Huang, G., Wing-Keung Law, A., and Huang, Z. (2011), “Wave-Induced Drift of Small Floating Objects in Regular Waves,” Ocean Engineering, 38, 712–718.
Ingrassia, S., and Rocci, R. (2011), “Degeneracy of the EM Algorithm for the MLE of Multivariate Gaussian Mixtures and Dynamic Constraints,” Computational Statistics & Data Analysis, 55, 1715–1725.
Izquierdo, P., and Guedes Soares, C. (2005), “Analysis of Sea Waves and Wind From X-Band Radar,” Ocean Engineering, 32, 1404–1419.
Jin, K.-R., and Ji, Z.-G. (2004), “Case Study: Modeling of Sediment Transport and Wind–Wave Impact in Lake Okeechobee,” Journal of Hydraulic Engineering, 130, 1055–1067.
Kato, S., and Shimizu, K. (2008), “Dependent Models for Observations Which Include Angular Ones,” Journal of Statistical Planning and Inference, 138, 3538–3549. Special Issue in Honor of Junjiro Ogawa (1915–2000): Design of Experiments, Multivariate Analysis and Statistical Inference.
Lagona, F., and Picone, M. (2011), “A Latent-Class Model for Clustering Incomplete Linear and Circular Data in Marine Studies,” Journal of Data Science, 9, 585–605
— (2012a), “Maximum Likelihood Estimation of Bivariate Circular Hidden Markov Models From Incomplete Data,” Journal of Statistical Computation and Simulation. Available online at http://www.tandfonline.com/doi/pdf/10.1080/00949655.2012.656642.
— (2012b), “Model-Based Clustering of Multivariate Skew Data With Circular Components and Missing Values,” Journal of Applied Statistics, 39, 927–945.
Lange, K., and Weeks, D. E. (1989), “Efficient Computation of LOD Scores: Genotype Elimination, Genotype Redefinition, and Hybrid Maximum Likelihood Algorithms,” Annals of Human Genetics, 53, 67–83.
Langrock, R., and Zucchini, W. (2011), “Hidden Markov Models With Arbitrary State Dwell-Time Distributions,” Computational Statistics & Data Analysis, 55, 715–724.
Lin, T. I. (2009), “Maximum Likelihood Estimation for Multivariate Skew Normal Mixture Models,” Journal of Multivariate Analysis, 100, 257–265.
Marchenko, Y. V., and Genton, M. G. (2010), “Multivariate Log-Skew-Elliptical Distributions With Applications to Precipitation Data,” Environmetrics, 21, 318–340.
Mardia, K., Taylor, C., and Subramaniam, G. (2007), “Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data,” Biometrics, 63, 505–512.
Mardia, K. V., Hughes, G., Taylor, C. C., and Singh, H. (2008), “A Multivariate von Mises Distribution With Applications to Bioinformatics,” Canadian Journal of Statistics, 36, 99–109.
Maruotti, A. (2011), “Mixed Hidden Markov Models for Longitudinal Data: An Overview,” International Statistical Review, 79, 427–454.
McLachlan, G., and Peel, D. (2000), Finite Mixture Models, New York: Wiley.
Monbet, V., Ailliot, P., and Prevosto, M. (2007), “Survey of Stochastic Models for Wind and Sea-State Time Series,” Probabilistic Engineering Mechanics, 22, 113–126.
Pleskachevsky, A., Eppel, D., and Kapitza, H. (2009), “Interaction of Waves, Currents and Tides, and Wave-Energy Impact on the Beach Area of Sylt Island,” Ocean Dynamics, 59, 451–461.
Redner, R. A., and Walker, H. F. (1984), “Mixture Densities, Maximum Likelihood and the EM Algorithm,” SIAM Review, 26, 195–239.
Reikard, G., and Rogers, W. E. (2011), “Forecasting Ocean Waves: Comparing a Physics-Based Model With Statistical Models,” Coastal Engineering, 58, 409–416.
Sahu, S., Dey, D., and Branco, M. (2003), “A New Class of Multivariate Skew Distributions With Applications to Bayesian Regression Models,” Canadian Journal of Statistics, 31, 129–150.
Singh, H., Hnizdo, V., and Demchuk, E. (2002), “Probabilistic Model for Two Dependent Circular Variables,” Biometrika, 89 (3), 719–723.
Teicher, H. (1967), “Identifiability of Mixtures of Product Measures,” Annals of Mathematical Statistics, 38, 1300–1302.
Visser, I., Raijmakers, M., and Molenaar, P. (2000), “Confidence Intervals for Hidden Markov Model Parameters,” British Journal of Mathematical & Statistical Psychology, 53, 317–327.
Visser, I., Raijmakers, M. E. J., and Molenaar, P. C. M. (2002), “Fitting Hidden Markov Models to Psychological Data,” Scientific Programming, 10, 185–199.
Wu, C. (1983), “On the Convergence Properties of the EM Algorithm,” Annals of Statistics, 11, 95–103.
Yakowitz, S., and Spragins, J. (1968), “On the Identifiability of Finite Mixtures,” Annals of Mathematical Statistics, 39, 209–214.
Zhang, Q., Snow Jones, A., Rijmen, F., and Ip, E. (2010), “Multivariate Discrete Hidden Markov Models for Domain-Based Measurements and Assessment of Risk Factors in Child Development,” Journal of Computational and Graphical Statistics, 19, 746–765.
Zucchini, W., and Guttorp, P. (1991), “A Hidden Markov Model for Space-Time Precipitation,” Water Resources Research, 27, 1917–1923.
Zucchini, W., and MacDonald, I. (2009), Hiddden Markov Models for Time Series: An Introduction Using R, London: Chapman & Hall.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bulla, J., Lagona, F., Maruotti, A. et al. A Multivariate Hidden Markov Model for the Identification of Sea Regimes from Incomplete Skewed and Circular Time Series. JABES 17, 544–567 (2012). https://doi.org/10.1007/s13253-012-0110-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13253-012-0110-1