Abstract
Large datasets are often time series data, and such datasets present challenging problems that arise from the passage of time reflected in the datasets. A problem of current interest is clustering and classification of multiple time series. When various time series are fitted to models, the different time series can be grouped into clusters based on the fitted models. If there are different identifiable classes of time series, the fitted models can be used to classify new time series.
For massive time series datasets, any assumption of stationarity is not likely to be met. Any useful time series model that extends over a lengthy time period must either be very weak, that is, a model in which the signal-to-noise ratio is relatively small, or else must be very complex with many parameters. Hence, a common approach to model building in time series is to break the series into separate regimes and to identify an adequate local model within each regime. In this case, the problem of clustering or classification can be addressed by use of sequential patterns of the models for the separate regimes.
In this chapter, we discuss methods for identifying changepoints in a univariate time series. We will emphasize a technique called alternate trends smoothing.
After identification of changepoints, we briefly discuss the problem of defining patterns. The objectives of defining and identifying patterns are twofold: to cluster and/or to classify sets of time series, and to predict future values or trends in a time series.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Badhiye SS, Hatwar KS, Chatur PN (2015) Trend based approach for time series representation. Int J Comput Appl 113:10–13
Bao D, Yang Z (2008) Intelligent stock trading system by turning point confirming and probabilistic reasoning. Expert Syst Appl 34:620–627
Dorsey TJ (2007) Point and figure charting: the essential application for forecasting and tracking market prices, 3rd edn. Wiley, Hoboken
Fink E, Gandhi HS (2011) Compression of time series by extracting major extrema. J Exp Theor Artif Intell 23:255–270
Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24:164–181
Fu T-C, Chung KF-L, Luk RWP, man Ng C (2008) Representing financial time series based on data point importance. Eng Appl Artif Intell 21:277–300. https://doi.org/10.1016/j.engappai.2007.04.009
Gentle JE (2012) Mining for patterns in financial time series. In: JSM 2012 proceedings. American Statistical Association, Alexandria, pp 2978–2988
Gentle JE, Härdle WK (2012) Modeling asset prices. In: Handbook of computational finance. Springer, Heidelberg, pp 15–33
He X, Shao C, Xiong Y (2014) A non-parametric symbolic approximate representation for long time series. Pattern Anal Appl 15. https://doi.org/10.1007/s10044-014-0395-5
Lin J, Li Y (2009) Finding structural similarity in time series data using bag-of-patterns representation. In: Statistical and scientific database management, pp 461–477. https://doi.org/10.1007/978-3-642-02279-1_33
Lin J, Keogh E, Wei L, Lonardi S (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 15:107–144. https://doi.org/10.1007/s10618-007-0064-z
Wilson SJ (2016) Statistical learning in financial time series data. Dissertation, George Mason University
Zhou J, Ye G, Yu D (2012) A new method for piecewise linear representation of time series data. Phys. Procedia 25:1097–1103
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Gentle, J.E., Wilson, S.J. (2018). Finding Patterns in Time Series. In: Härdle, W., Lu, HS., Shen, X. (eds) Handbook of Big Data Analytics. Springer Handbooks of Computational Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-18284-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-18284-1_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18283-4
Online ISBN: 978-3-319-18284-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)