Computational issues in parameter estimation for stationary hidden Markov models

Bulla, Jan; Berzel, Andreas

doi:10.1007/s00180-007-0063-y

Computational issues in parameter estimation for stationary hidden Markov models

Original Paper
Published: 13 July 2007

Volume 23, pages 1–18, (2008)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Jan Bulla¹ &
Andreas Berzel¹

392 Accesses
52 Citations
Explore all metrics

Abstract

The parameters of a hidden Markov model (HMM) can be estimated by numerical maximization of the log-likelihood function or, more popularly, using the expectation–maximization (EM) algorithm. In its standard implementation the latter is unsuitable for fitting stationary hidden Markov models (HMMs). We show how it can be modified to achieve this. We propose a hybrid algorithm that is designed to combine the advantageous features of the two algorithms and compare the performance of the three algorithms using simulated data from a designed experiment, and a real data set. The properties investigated are speed of convergence, stability, dependence on initial values, different parameterizations. We also describe the results of an experiment to assess the true coverage probability of bootstrap-based confidence intervals for the parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Archer G, Titterington D (2002) Parameter estimation for hidden Markov chains. J Stat Plann Inference 108(1–2):365–390
Article MATH MathSciNet Google Scholar
Baum L, Petrie T, Soules G, Weiss N (1970) A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 41(1):164–171
Article MathSciNet Google Scholar
Böhning D (2000) Computer assisted analysis of mixtures and applications. Meta-analysis, disease mapping and others. In: Monographs on statistics and applied probability, vol 81. Chapman & Hall/ CRC, London
Campillo F, Le Gland F (1989) MLE for partially observed diffusions: direct maximization vs. the EM algorithm, Stoch Processes Appl 33(2):245–274
Article MATH MathSciNet Google Scholar
Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. Discussion. J R Stat Soc Ser B Stat Methodol 39(1):1–38
MATH MathSciNet Google Scholar
Dennis JE, Moré JJ (1977) Quasi-Newton methods, motivation and theory. SIAM Rev 19:46–89
Article MATH MathSciNet Google Scholar
Dunmur A, Titterington D (1998) The influence of initial conditions on maximum likelihood estimation of the parameters of a binary hidden Markov model, Stat Probab Lett 40(1):67–73
MATH MathSciNet Google Scholar
Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. In: Monographs on statistics and applied probability, vol 57. Chapman & Hall, New York
Hathaway RJ (1986) A constrained EM algorithm for univariate normal mixtures. J Stat Comput Simul 23(3):211–230
Article Google Scholar
Jamshidian M, Jennrich RI (1997) Acceleration of the EM algorithm by using quasi-Newton methods. J R Stat Soc Ser B Stat Methodol 59(3):569–587
Article MATH MathSciNet Google Scholar
Lange K (1995) A quasi-Newton acceleration of the EM algorithm. Stat Sin 5(1):1–18
MATH Google Scholar
Lange K, Weeks D (1989) Efficient computation of lod scores: Genotype elimination, genotype redefinition, and hybrid maximum likelihood algorithms. Ann Hum Genet 53(1):67–83
Article MATH MathSciNet Google Scholar
Liporace LA (1982) Maximum likelihood estimation for multivariate observations of Markov sources. IEEE Trans Inf Theory 28(5):729–734
Article MATH MathSciNet Google Scholar
MacDonald IL, Zucchini W (1997) Hidden Markov and other models for discrete-valued time series. In: Monographs on statistics and applied probability, vol 70. Chapman & Hall, London
Matsumoto M, Nishimura T (1998) Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans Model Comput Simul 8(1):3–30
Article MATH Google Scholar
Nelder J, Mead R (1965) A simplex method for function minimization. Computer J. 7(4):308–313
Google Scholar
Nityasuddhi D, Böhning D (2003) Asymptotic properties of the EM algorithm estimate for normal mixture models with component specific variances. Comput Stat Data Anal 41(3–4):591–601
Article Google Scholar
R Development Core Team (2004) R: a language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria. URL: http://www.R-project.org
Rabiner L (1989) A tutorial on hidden Markov models and selected applications in speech recognition. IEEE Trans Inf Theory 77(2):257–284
MathSciNet Google Scholar
Redner RA, Walker HF (1984) Mixture densities, maximum likelihood and the EM algorithm. SIAM Rev 26(2):195–239
Article MATH MathSciNet Google Scholar
Robert CP, Mengersen KL (1996) Testing for mixtures: a bayesian entropic approach. Bayesian Statistics 5: Proceedings of the Fifth Valencia International Meeting, pp 255–276
Robert CP, Mengersen KL (1999) Reparameterisation issues in mixture modelling and their bearing on MCMC algorithms. Comput Stat Data Anal 29(3):325–343
Article MATH Google Scholar
Robert CP, Titterington DM (1998) Reparameterization strategies for hidden Markov models and Bayesian approaches to maximum likelihood estimation. Stat Comput 8(2):145–158
Article Google Scholar
Schnabel RB, Koontz JE, Weiss BE (1985) A modular system of algorithms for unconstrained minimization. ACM Trans Math Softw 11(4):419–440
Article MATH MathSciNet Google Scholar
Visser I, Raijmakers MEJ, Molenaar PCM (2000) Confidence intervals for hidden Markov model parameters. Br J Math Stat Psychol 53(2):17–327
Article Google Scholar
Wang P, Puterman ML (2001) Analysis of longitudinal data of epileptic seizure counts—a two-state hidden Markov regression approach. Biom J 43(8):941–962
Article MATH MathSciNet Google Scholar
Wu C (1983). On the convergence properties of the EM algorithm. Ann Stat 11(1): 95–103
Article MATH Google Scholar
Zucchini W, MacDonald IL (1998) Hidden Markov Time Series Models: Some Computational Issues. In: Weisenberg (ed) Computational Science and Statistics, vol 30, Interface foundation of America, pp. 157–163

Download references

Author information

Authors and Affiliations

Institute for Statistics and Econometrics/Center for Statistics, Georg-August-Universität Göttingen, Platz der Göttinger Sieben 5, 37073, Göttingen, Germany
Jan Bulla & Andreas Berzel

Authors

Jan Bulla
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Berzel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jan Bulla.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bulla, J., Berzel, A. Computational issues in parameter estimation for stationary hidden Markov models. Computational Statistics 23, 1–18 (2008). https://doi.org/10.1007/s00180-007-0063-y

Download citation

Accepted: 20 September 2006
Published: 13 July 2007
Issue Date: January 2008
DOI: https://doi.org/10.1007/s00180-007-0063-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computational issues in parameter estimation for stationary hidden Markov models

Abstract

Access this article

Similar content being viewed by others

S-estimation of hidden Markov models

Parameter Estimation for Continuous Time Hidden Markov Processes

An expectation maximization algorithm for the hidden markov models with multiparameter student-t observations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Computational issues in parameter estimation for stationary hidden Markov models

Abstract

Access this article

Similar content being viewed by others

S-estimation of hidden Markov models

Parameter Estimation for Continuous Time Hidden Markov Processes

An expectation maximization algorithm for the hidden markov models with multiparameter student-t observations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation