Skip to main content
Log in

Curve prediction and clustering with mixtures of Gaussian process functional regression models

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Shi, Wang, Murray-Smith and Titterington (Biometrics 63:714–723, 2007) proposed a Gaussian process functional regression (GPFR) model to model functional response curves with a set of functional covariates. Two main problems are addressed by their method: modelling nonlinear and nonparametric regression relationship and modelling covariance structure and mean structure simultaneously. The method gives very good results for curve fitting and prediction but side-steps the problem of heterogeneity. In this paper we present a new method for modelling functional data with ‘spatially’ indexed data, i.e., the heterogeneity is dependent on factors such as region and individual patient’s information. For data collected from different sources, we assume that the data corresponding to each curve (or batch) follows a Gaussian process functional regression model as a lower-level model, and introduce an allocation model for the latent indicator variables as a higher-level model. This higher-level model is dependent on the information related to each batch. This method takes advantage of both GPFR and mixture models and therefore improves the accuracy of predictions. The mixture model has also been used for curve clustering, but focusing on the problem of clustering functional relationships between response curve and covariates, i.e. the clustering is based on the surface shape of the functional response against the set of functional covariates. The model is examined on simulated data and real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Cappé, O., Robert, C.P., Rydén, T.: Reversible jump, birth-and-death and more general continuous time Markov chain Monte Carlo samplers. J. Roy. Stat. Soc. Ser. B 65, 679–700 (2003)

    Article  MATH  Google Scholar 

  • Fan, J., Yao, Q., Cai, Z.: Adaptive varying-coefficient linear models. J. Roy. Stat. Soc. Ser. B 65, 57–80 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  • Faraway, J.: Regression analysis for a functional response. Technometrics 39, 254–261 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  • Faraway, J.: Modelling hand trajectories during reaching motions. Technical report # 383, Department of Statistics, University of Michigan (2001)

  • Fernandez, C., Green, P.: Modelling spatially correlated data via mixtures: a Bayesian approach. J. Roy. Stat. Soc. Ser. B 64, 805–826 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  • Fraley, C., Raftery, A.E.: MCLUST version 3 for R: Normal mixture modeling and model-based clustering. Technical report #504, Department of Statistics, University of Washington (2006)

  • Gaffney, S., Smyth, P.: Curve clustering with random effects regression mixtures. In: Bishop, C., Frey, B. (eds.) Proc. Ninth Inter. Workshop on Artificial Intelligence and Statistics (2003)

  • Green, P.J.: Reversible jump MCMC computation and Bayesian model determination. Biometrika 82, 711–732 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  • Green, P.J., Richardson, S.: Spatially correlated allocation models for count data. Technical report, University of Bristol (2000)

  • Hubert, L., Arabie, P.: Comparing partitions. J. Classif. 2, 193–218 (1985)

    Article  Google Scholar 

  • James, G., Sugar, C.: Clustering for sparsely sampled functional data. J. Am. Stat. Assoc. 98, 397–408 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  • Kamnik, R., Bajd, T., Kralj, A.: Functional electrical stimulation and arm supported sit-to-stand transfer after paraplegia: a study of kinetic parameters. Artif. Organs 23, 413–417 (1999)

    Article  Google Scholar 

  • Kamnik, R., Shi, J.Q., Murray-Smith, R., Bajd, T.: Nonlinear modelling of FES-supported standing up in paraplegia for selection of feedback sensors. IEEE Trans. Neural Syst. Rehabil. Eng. 13, 40–52 (2005)

    Article  Google Scholar 

  • Kennedy, M.C., O’Hagan, A.: Bayesian calibration of computer models (with discussion). J. Roy. Stat. Soc. Ser. B 63, 425–464 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  • Louis, T.A.: Finding the observed information matrix when using the EM algorithm. J. Roy. Stat. Soc. Ser. B 44, 226–233 (1982)

    MathSciNet  MATH  Google Scholar 

  • MacKay, D.J.C.: Introduction to Gaussian processes. Technical report, Cambridge University; available from http://wol.ra.phy.cam.ac.uk/mackay/GP/ (1999)

  • McLachlan, G.J., Peel, D.: Finite Mixture Models. Wiley, New York (2000)

    MATH  Google Scholar 

  • Müller, H.G.: Functional modelling and classification of longitudinal data. Scand. J. Stat. 32, 223–240 (2005)

    Article  MATH  Google Scholar 

  • Ramsay, J.O., Silverman, B.W.: Functional Data Analysis. Springer, New York (1997)

    MATH  Google Scholar 

  • Rand, W.M.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66, 846–850 (1971)

    Article  Google Scholar 

  • Rice, J.A., Silverman, B.W.: Estimating the mean and covariance nonparametrically when the data are curves. J. Roy. Stat. Soc. Ser. B 53, 233–243 (1991)

    MathSciNet  MATH  Google Scholar 

  • Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978)

    Article  MATH  Google Scholar 

  • Shi, J.Q., Murray-Smith, R., Titterington, D.M.: Bayesian regression and classification using mixtures of Gaussian process. Int. J. Adapt. Control Signal Process. 17, 149–161 (2003)

    Article  MATH  Google Scholar 

  • Shi, J.Q., Murray-Smith, R., Titterington, D.M.: Hierarchical Gaussian process mixtures for regression. Stat. Comput. 15, 31–41 (2005)

    Article  MathSciNet  Google Scholar 

  • Shi, J.Q., Wang, B., Murray-Smith, R., Titterington, D.M.: Gaussian process functional regression modeling for batch data. Biometrics 63, 714–723 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  • Stephens, M.: Bayesian analysis of mixture models with an unknown number of components–an alternative to reversible jump methods. Ann. Stat., 40–74 (2000)

  • Titterington, D.M., Smith, A.F.M., Makov, U.E.: Statistical Analysis of Finite Mixture Distributions. Wiley, New York (1985)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Q. Shi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shi, J.Q., Wang, B. Curve prediction and clustering with mixtures of Gaussian process functional regression models. Stat Comput 18, 267–283 (2008). https://doi.org/10.1007/s11222-008-9055-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11222-008-9055-1

Keywords

Navigation