Abstract
Several non-parametric statistics originally designed to test the equality of the log-spectra of two stochastic processes are proposed as dissimilarity measures between two time series. Their behavior in time series clustering is analyzed throughout a simulation study, and compared with the performance of several model-free and model-based dissimilarity measures. Up to three different classification settings are considered: (1) to distinguish between stationary and non-stationary time series, (2) to classify different ARMA processes and (3) to classify several non-linear time series models. As it was expected, the performance of a particular dissimilarity metric strongly depended on the type of processes subjected to clustering. Among all the measures studied, the non-parametric distances showed the most robust behaviour.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Caiado, J., Crato, N., & Peña, D. (2006). A periodogram-based metric for time series classification. Computational Statististics & Data Analysis, 50, 2668–2684.
Fan, J., & Kreutzberger, E. (1998). Automatic local smoothing for spectral density estimation. Scandinavian Journal of Statistics, 25, 359–369.
Fan, J., & Zhang, W. (2004). Generalised likelihood ratio tests for spectral density. Biometrika, 91, 195–209.
Galeano, P., & Peña, D. (2000). Multivariate analysis in vector time series. Resenhas, 4, 383–403.
Gavrilov, M., Anguelov, D., Indyk, P., & Motwani, R. (2000). Mining the stock market (extended abstract): which measure is best? Proceedings of the 6th Internatinal Conference on Knowledge Discovery and Data Mining (KDD). August 20–23, pp. 487–496, Boston, MA, USA.
Kakizawa, Y., Shumway, R. H., & Taniguchi, M. (1998). Discrimination and clustering for multivariate time series. Journal of the American Statistical Association, 93, 328–340.
Liao, T. W. (2005). Clustering of time series data: a survey. Pattern Recognition, 38, 1857–1874.
Maharaj, E. A. (1996). A significance test for classifying ARMA models. Journal of Statistical Computation and Simulation, 54, 305–331.
Maharaj, E. A. (2000). Clusters of time series. Journal of Classification, 17, 297–314.
Piccolo, D. (1990). A distance measure for classifying arima models. Journal of Time Series Analysis, 11, 153–164.
Tong, H., & Yeung, I. (1991). On tests for self-exciting threshold autoregressive type non-linearity in partially observed time series. Applied Statistics, 40, 43–62.
Vilar, J. A., & Pértega, S. (2004). Discriminant and cluster analysis for Gaussian stationary processes: Local linear fitting approach. Journal of Nonparametric Statistics 16, 443–462.
Vilar, J. M., Vilar, J. A., & Pértega, S. (2009) Classifying time series data: A nonparametric approach. Journal of Classification, 2009 (to appear).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Díaz, S.P., Vilar, J.A. (2010). A Comparative Study of Several Parametric and Semiparametric Approaches for Time Series Classification. In: Locarek-Junge, H., Weihs, C. (eds) Classification as a Tool for Research. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10745-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-10745-0_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10744-3
Online ISBN: 978-3-642-10745-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)