, Volume 46, Issue 1, pp 163–175 | Cite as

Multivariate functional clustering and its application to typhoon data

  • Toshihiro MisumiEmail author
  • Hidetoshi Matsui
  • Sadanori Konishi
Original Paper


We propose a multivariate nonlinear mixed effects model for clustering multiple longitudinal data. Advantages of the nonlinear mixed effects model are that it is easy to handle unbalanced data which highly occur in the longitudinal study, and it can take into account associations among longitudinal variables at a given time point. The joint modeling for multivariate longitudinal data, however, requires a high computational cost because numerous parameters are included in the model. To overcome this issue, we perform a pairwise fitting procedure based on a pseudo-likelihood function. Unknown parameters included in each bivariate model are estimated by the maximum likelihood method along with the EM algorithm, and then the number of basis functions included in the model is selected by model selection criteria. After estimating the model, a non-hierarchical clustering algorithm by self-organizing maps is applied to the predicted coefficient vectors of individual specific random effect functions. We present the results of the application of the proposed method to the analysis of data of typhoons that occurred between 2000 and 2017 in Asia.


Cluster analysis Functional data analysis Multivariate longitudinal data Nonlinear mixed effects model Self-organizing maps 


  1. Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov B, Csaki F (eds) Proceedings of 2nd international symposium information theory. Akademiai Kiado, Budapest, pp 267–281Google Scholar
  2. Akaike H (1974) A new look at the statistical model identification. IEEE Trans Auto Control 19:716–723MathSciNetCrossRefzbMATHGoogle Scholar
  3. Coffey N, Hinde J, Holian E (2014) Clustering longitudinal profiles using \(P\)-splines and mixed effects models applied to time-course gene expression data. Comput Stat Data Anal 71:14–19MathSciNetCrossRefzbMATHGoogle Scholar
  4. Fieuws S, Verbeke G (2006) Pairwise fitting of mixed models for the joint modeling of multivariate longitudinal profiles. Biometrics 62:424–431MathSciNetCrossRefzbMATHGoogle Scholar
  5. Giacofci M, Lambert-Lacroix S, Marot G, Picard F (2013) Wavelet-based clustering for mixed-effects functional models in high dimension. Biometrics 69:31–40MathSciNetCrossRefzbMATHGoogle Scholar
  6. Guo W (2002) Functional mixed effects models. Biometrics 58:121–128MathSciNetCrossRefzbMATHGoogle Scholar
  7. Hui FK, Müller S, Welsh AH (2017) Sparse pairwise likelihood estimation for multivariate longitudinal mixed models. J Am Stat Assoc. zbMATHGoogle Scholar
  8. Jacques J, Preda C (2014a) Model-based clustering for multivariate functional data. Comput Stat Data Anal 71:92–106MathSciNetCrossRefzbMATHGoogle Scholar
  9. Jacques J, Preda C (2014b) Functional data clustering: a survey. Adv Data Anal Classif 8:231–255MathSciNetCrossRefGoogle Scholar
  10. James G, Sugar C (2003) Clustering for sparsely sampled functional data. J Am Stat Assoc 98:397–408MathSciNetCrossRefzbMATHGoogle Scholar
  11. Kawano S, Konishi S (2007) Nonlinear regression modeling via regularized Gaussian basis functions. Bull Inform Cybern 39:83–96MathSciNetzbMATHGoogle Scholar
  12. Kayano M, Dozono K, Konishi S (2010) Functional cluster analysis via orthonormalized Gaussian basis expansions and its application. J Classif 230:211–230MathSciNetCrossRefzbMATHGoogle Scholar
  13. Matsui H, Misumi T, Yokomizo T, Konishi S (2010) Clustering for functional data via nonlinear mixed effects models. Jpn J Appl Stat 45:25–45CrossRefGoogle Scholar
  14. Kohonen T (2001) Self-organizing maps, 3rd edn. Springer Science & Business Media, New YorkCrossRefzbMATHGoogle Scholar
  15. Laird N, Ware J (1982) Random-effects models for longitudinal data. Biometrics 38:963–974CrossRefzbMATHGoogle Scholar
  16. Luan Y, Li H (2003) Clustering of time-course gene expression data using a mixed-effects model with \(B\)-splines. Bioinformatics 19:474–482CrossRefGoogle Scholar
  17. Park J, Ahn J (2017) Clustering multivariate functional data with phase variation. Biometrics 73:324–333MathSciNetCrossRefzbMATHGoogle Scholar
  18. Reithinger F, Jank W, Tutz G, Shmueli G (2008) Modelling price paths in on-line auctions: smoothing sparse and unevenly sampled curves by using semiparametric mixed models. J R Stat Soc C 57:127–148MathSciNetCrossRefzbMATHGoogle Scholar
  19. Rice J, Wu C (2001) Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics 57:253–259MathSciNetCrossRefzbMATHGoogle Scholar
  20. Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464MathSciNetCrossRefzbMATHGoogle Scholar
  21. Verbeke G, Molenberghs G (2000) Linear mixed models for longitudinal data. Springer, New YorkzbMATHGoogle Scholar

Copyright information

© The Behaviormetric Society 2018

Authors and Affiliations

  1. 1.Department of BiostatisticsYokohama City University School of MedicineYokohamaJapan
  2. 2.Faculty of Data ScienceShiga UniversityHikoneJapan
  3. 3.Department of MathematicsChuo UniversityTokyoJapan

Personalised recommendations