Skip to main content
Log in

Crisp and fuzzy k-means clustering algorithms for multivariate functional data

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

Functional data analysis, as proposed by Ramsay (Psychometrika 47:379–396, 1982), has recently attracted many researchers. The most popular approach taken in recent studies of functional data has been the extension of statistical methods for the analysis of usual data to that of functional data (e.g., Ramsay and Silverman in Functional data Analysis Springer, Berlin Heidelberg New York, 1997, Applied functional data analysis: methods and case studies. Springer, Berlin Heidelberg New York, 2002; Mizuta in Proceedings of the tenth Japan and Korea Joint Conference of Statistics, pp 77–82, 2000; Shimokawa et al. in Japan J Appl Stat 29:27–39, 2000). In addition, several methods for clustering functional data have been proposed (Abraham et al. in Scand J Stat 30:581–595, 2003; Gareth and Catherine in J Am Stat Assoc 98:397–408, 2003; Tarpey and kinateder in J Classif 20:93–114, 2003; Rossi et al. in Proceedings of European Symposium on Artificial Neural Networks pp 305–312, 2004). Furthermore, Tokushige et al. (J Jpn Soc Comput Stat 15:319–326, 2002) defined several dissimilarities between functions for the case of functional data. In this paper, we extend existing crisp and fuzzy k-means clustering algorithms to the analysis of multivariate functional data. In particular, we consider the dissimilarity between functions as a function. Furthermore, cluster centers and memberships, which are defined as functions, are determined at the minimum of a certain target function by using a calculus-of-variations approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abraham C, Cornillon PA, Matzner-Lober E, Molinari N (2003) Unsupervised curve clustering using B-splines. Scand J Statist 30:581–595

    Article  MATH  MathSciNet  Google Scholar 

  • D’Urso P (2004) Fuzzy C-means clustering models for multivariate time-varying data: different approaches. Int J Uncertain Fuzziness Knowl Based Syst 12(3):287–326

    Article  MATH  MathSciNet  Google Scholar 

  • Gareth MJ, Catherine AS (2003) Clustering for sparsely sampled functional data. J Am Statist Assoc 98:397–408

    Article  MATH  Google Scholar 

  • MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, vol. 1, pp. 281–297, University of California Press, Berkeley

  • Mizuta M (2000) Functional multidimensional scaling. In: Proceedings of the Tenth Japan and Korea joint conference of statistics, pp. 77–82

  • Ramsay JO (1982) When the data are functions. Psychometrika 47:379–396

    Article  MATH  MathSciNet  Google Scholar 

  • Ramsay JO, Silverman BW (1997) Functional data analysis. Springer, Berlin Heidelberg New York

    MATH  Google Scholar 

  • Ramsay JO, Silverman BW, (2002) Applied functional data analysis: methods and case studies. Springer, Berlin Heidelberg New York

    MATH  Google Scholar 

  • Rossi F, Conan-Guez B, El Golli A (2004) Clustering functional data with the SOM algorithm. In: Proceedings of European symposium on artificial neural networks, 2004, pp. 305–312

  • Ruspini EH (1969) Hierarchical grouping to optimize an objective function. J Am Statist Assoc 58:236–244

    Google Scholar 

  • Shimokawa M, Mizuta M, Sato Y (2000) An expansion of functional regression analysis (in Japanese). Jpn J Appl Statist 29:27–39

    Article  Google Scholar 

  • Simonoff JS (1998) Smoothing methods in statistics. Springer, Berlin Heidelberg New York

    Google Scholar 

  • Tarpey T, Kinateder KKJ (2003) Clustering functional data. J Classif 20:93–114

    Article  MATH  MathSciNet  Google Scholar 

  • Tokushige S, Inada K, Yadohisa H (2002) Dissimilarity and related methods for functional data. J Jpn Soc Comput Statist 15:319–326

    MathSciNet  Google Scholar 

  • Tuddenham RD, Snyder MM (1954) Physical growth of California boys and girls from birth to eighteen years. University of California Press, Berkeley, Los Angeles

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shuichi Tokushige.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tokushige, S., Yadohisa, H. & Inada, K. Crisp and fuzzy k-means clustering algorithms for multivariate functional data. Computational Statistics 22, 1–16 (2007). https://doi.org/10.1007/s00180-006-0013-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-006-0013-0

Keywords

Navigation