Skip to main content
Log in

Abstract

Time-series discord is widely used in data mining applications to characterize anomalous subsequences in time series. Compared to some other discord search algorithms, the direct search algorithm based on the recurrence plot shows the advantage of being fast and parameter free. The direct search algorithm, however, relies on quasi-periodicity in input time series, an assumption that limits the algorithm's applicability. In this paper, we eliminate the periodicity assumption from the direct search algorithm by proposing a reference function for subsequences and a new sampling strategy based on the reference function. These measures result in a new algorithm with improved efficiency and robustness, as evidenced by our empirical evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Shoeb A, Edwards H, Connolly J et al (2004) Patient-specific seizure onset detection. Epilepsy & Behavior 5(4):483–498

    Article  Google Scholar 

  2. Febrero M, Galeano P, González-Manteiga W (2007) A functional analysis of NOx levels: Location and scale estimation and outlier detection. Computational Statistics 22(3):411–427

    Article  MathSciNet  MATH  Google Scholar 

  3. van Wijk J, Van Selow E. Cluster and calendar based visualization of time series data. In Proc. the 1999 IEEE Symposium on Information Visualization, Oct. 1999, pp.4–9.

  4. Rebbapragada U, Protopapas P, Brodley C, Alcock C (2009) Finding anomalous periodic time series. Machine learning 74(3):281–313

    Article  Google Scholar 

  5. Rousseeuw P, Leroy A. Robust Regression and Outlier Detection. Wiley Online Library, 1987.

  6. Dasgupta D, Forrest S. Novelty detection in time series data using ideas from immunology. In Proc. the International Conference on Intelligent Systems, Dec. 1996, pp.82–87.

  7. Hyndman R, Ullah S (2007) Robust forecasting of mortality and fertility rates: A functional data approach. Computational Statistics & Data Analysis 51(10):4942–4956

    Article  MathSciNet  MATH  Google Scholar 

  8. Febrero M, Galeano P, González-Manteiga W (2008) Outlier detection in functional data by depth measures, with application to identify abnormal NOX levels. Environmetrics 19(4):331–345

    Article  MathSciNet  Google Scholar 

  9. Keogh E, Lin J, Fu A. HOT SAX: Efficiently finding the most unusual time series subsequence. In Proc. the 5th IEEE International Conference on Data Mining, Nov. 2005, pp.226–233.

  10. Lin J, Keogh E, Fu A, Van Herle H. Approximations to magic: Finding unusual medical time series. In Proc. the 18th IEEE Symposium on Computer-Based Medical Systems, Jun. 2005, pp.329–334.

  11. Bu Y, Leung T, Fu A, Keogh E, Pei J, Meshkin S. WAT: Finding top-K discords in time series database. In Proc. the 7th SIAM International Conference on Data Mining, Apr. 2007.

  12. Yankov D, Keogh E, Rebbapragada U (2008) Disk aware discord discovery: Finding unusual time series in terabyte sized datasets. Knowledge and Information Systems 17(2):241–262

    Article  Google Scholar 

  13. Goldberger A, Amaral L, Glass L et al (2000) Physiobank, physiotoolkit, and physionet: Components of a new research resource for complex physiologic signals. Circulation 101(23):e215–e220

    Article  Google Scholar 

  14. Shoeb A, Guttag J. Application of machine learning to epileptic seizure detection. In Proc. the 27th International Conference on Machine Learning, Jun. 2010, pp.975–982.

  15. Pham N, Le Q, Dang T. HOT aSAX: A novel adaptive symbolic representation for time series discords discovery. In Proc. the 2nd Intelligent Information and Database Systems, Mar. 2010, pp.113–121.

  16. Fu A, Leung O, Keogh E, Lin J. Finding time series discords based on Haar transform. In Proc. the 2nd Advanced Data Mining and Applications, Aug. 2006, pp.31–41.

  17. Shieh J, Keogh E. iSAX: Indexing and mining terabyte sized time series. In Proc. the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2008, pp.623–631.

  18. Son M, Anh D. EWAT+: Finding time series discords based on new discord measure functions. In Proc. IEEE International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future, Nov. 2010, pp.1–4.

  19. Luo W, Gallagher M. Faster and parameter-free discord search in quasi-periodic time series. In Proc. PAKDD, May 2011, pp.135–148.

  20. Eckmann J, Kamphorst S, Ruelle D (1987) Recurrence plots of dynamical systems. Europhysics Letters 4(9):973–977

    Article  Google Scholar 

  21. Iwanski J, Bradley E (1998) Recurrence plots of experimental data: To embed or not to embed? Chaos: An Interdisciplinary Journal of Nonlinear Science 8(4):861–871

    Article  Google Scholar 

  22. Webber Jr C, Zbilut J. Recurrence quantification analysis of nonlinear dynamical systems. Tutorials in Contemporary Nonlinear Methods for the Behavioral Sciences, 2005, pp.26–94.

  23. Marwan N, Carmen Romano MC, Thiel M, Kurths J (2007) Recurrence plots for the analysis of complex systems Physics Reports 438(5/6):237–329

    Google Scholar 

  24. Marwan N (2008) A historical review of recurrence plots. The European Physical Journal-Special Topics 164(1):3–12

    Article  MathSciNet  Google Scholar 

  25. Chatfield C. The Analysis of Time Series: An Introduction, (6th edition). Chapman & Hall/CRC, 2004.

  26. Keogh E, Lin J, Fu A. The UCR time series discords. http://www.cs.ucr.edu/~eamonn/discords/, April 2010.

  27. Hyndman R. Time series data library. http://data.is/TSDLdemo, April 2010.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Luo.

Additional information

Support by Australian Research Council Linkage Grant No. LP 0776417.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(DOC 28.0 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luo, W., Gallagher, M. & Wiles, J. Parameter-Free Search of Time-Series Discord. J. Comput. Sci. Technol. 28, 300–310 (2013). https://doi.org/10.1007/s11390-013-1330-8

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-013-1330-8

Keywords

Navigation