Abstract
In this work a robust clustering algorithm for stationary time series is proposed. The algorithm is based on the use of estimated spectral densities, which are considered as functional data, as the basic characteristic of stationary time series for clustering purposes. A robust algorithm for functional data is then applied to the set of spectral densities. Trimming techniques and restrictions on the scatter within groups reduce the effect of noise in the data and help to prevent the identification of spurious clusters. The procedure is tested in a simulation study, and is also applied to a real data set.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aghabozorgi, S., Shirkhorshidi, A.S., Wah, T.Y.: Time-series clustering - a decade review. Inf. Syst. 53, 16–38 (2015)
Alvarez-Esteban, P.C., Euán, C., Ortega, J.: Statistical analysis of stationary intervals for random waves. In: Proceedings of International Society of Offshore and Polar Engineering Conference, vol. 3, pp. 305–311 (2016)
Alvarez-Esteban, P.C., Euán, C., Ortega, J.: Time series clustering using the total variation distance with applications in oceanography. Environmetrics 27(6), 355–369 (2016)
Bahadori, M.T., Kale, D.C., Fan, Y., Liu, Y.: Functional subspace clustering with application to time series. In: Proceedings of 32nd International Conference on Machine Learning, pp. 228–237 (2015)
Bouveyron, C., Jacques, J.: Model-based clustering of time series in group-specific functional subspaces. Adv. Data Anal. Classif. 5(4), 281–300 (2011)
Caiado, J., Crato, N., Peña, D.: A periodogram-based metric for time series classification. Comput. Stat. Data Anal. 50(10), 2668–2684 (2006)
Caiado, J., Crato, N., Peña, N.: Comparison of times series with unequal length in the frequency domain. Commun. Stat. Simul. Comput. 38(3), 527–540 (2009)
Caiado, J., Maharaj, E.A., D’Urso, P.: Time Series Clustering. Chapman & Hall/CRC Handbooks of Modern Statistical Methods. Taylor & Francis, London (2015). Chap. 12
Cuesta-Albertos, J.A., Fraiman, R.: Impartial trimmed \(k\)-means for functional data. Comput. Stat. Data Anal. 51(10), 4864–4877 (2007)
Delaigle, A., Hall, P.: Defining probability density for a distribution of random functions. Ann. Stat. 38(2), 1171–1193 (2010)
D’Urso, P., De Giovanni, L., Massari, R.: Time series clustering by a robust autoregressive metric with application to air pollution. Chemometr. Intell. Lab. Syst. 141, 107–124 (2015)
D’Urso, P., De Giovanni, L., Massari, R.: GARCH-based robust clustering of time series. Fuzzy Sets Syst. 305, 1–28 (2016)
Euán, C., Ombao, H., Ortega, J.: The hierarchical spectral merger algorithm: a new time series clustering procedure. J. Classif. (2017, accepted)
Fritz, H., García-Escudero, L.A., Mayo-Iscar, A.: A fast algorithm for robust constrained clustering. Comput. Stat. Data Anal. 61, 124–136 (2013)
García-Escudero, L.A., Gordaliza, A.: A proposal for robust curve clustering. J. Classif. 22(2), 185–201 (2005)
Hubert, L., Arabie, P.: Comparing partitions. J. Classif. 2, 193–218 (1985)
Jacques, J., Preda, C.: Funclust: a curves clustering method using functional random variables density approximation. Neurocomputing 112, 164–171 (2013)
James, G.M., Sugar, C.A.: Clustering for sparsely sampled functional data. J. Am. Stat. Assoc. 98(462), 397–408 (2003)
Liao, T.W.: Clustering of time series data - a survey. Pattern Recogn. 38, 1857–1874 (2005)
Maharaj, E.A., D’Urso, P.: Fuzzy clustering of time series in the frequency domain. Inf. Sci. 181(7), 1187–1211 (2011)
Montero, P., Vilar, J.: TSclust: an R package for time series clustering. J. Stat. Softw. 62(1), 43 (2014)
Ramsay, J.O., Silverman, B.W.: Functional Data Analysis. Springer Series in Statistics, 2nd edn. Springer, New York (2005)
Rand, W.M.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66, 846–850 (1971)
Rivera-García, D., García-Escudero, L.A., Mayo-Iscar, A., Ortega, J.: Robust clustering for functional data based on trimming and constraints. arXiv:1701.03267 (2017)
Rousseeuw, P.J., Van Driessen, K.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41, 212–223 (1999)
Soueidatt, M.: Funclustering: A package for functional data clustering. R package version 1.0.1 (2014)
Acknowledgements
Data for station 160 were furnished by the Coastal Data Information Program (CDIP), Integrative Oceanographic Division, operated by the Scripps Institution of Oceanography (http://cdip.ucsd.edu/). Research by DRG and JO was partially supported by Conacyt, Mexico Proyecto 169175 Análisis Estadístico de Olas Marinas, Fase II. Research by LA G-E and A M-I was partially supported by the Spanish Ministerio de Economía y Competitividad y fondos FEDER, grant MTM2014-56235-C2-1-P, and by Consejería de Educación de la Junta de Castilla y León, grant VA212U13.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Rivera-García, D., García-Escudero, L.A., Mayo-Iscar, A., Ortega, J. (2017). Robust Clustering for Time Series Using Spectral Densities and Functional Data Analysis. In: Rojas, I., Joya, G., Catala, A. (eds) Advances in Computational Intelligence. IWANN 2017. Lecture Notes in Computer Science(), vol 10306. Springer, Cham. https://doi.org/10.1007/978-3-319-59147-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-59147-6_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59146-9
Online ISBN: 978-3-319-59147-6
eBook Packages: Computer ScienceComputer Science (R0)