Advertisement

GeoInformatica

, Volume 15, Issue 1, pp 75–109 | Cite as

TIDES—a new descriptor for time series oscillation behavior

  • Leonardo E. Mariote
  • Claudia Bauzer MedeirosEmail author
  • Ricardo da Silva Torres
  • Lucas M. Bueno
Article

Abstract

Sensor networks have increased the amount and variety of temporal data available, requiring the definition of new techniques for data mining. Related research typically addresses the problems of indexing, clustering, classification, summarization, and anomaly detection. There is a wide range of techniques to describe and compare time series, but they focus on series’ values. This paper concentrates on a new aspect—that of describing oscillation patterns. It presents a technique for time series similarity search, and multiple temporal scales, defining a descriptor that uses the angular coefficients from a linear segmentation of the curve that represents the evolution of the analyzed series. This technique is generalized to handle co-evolution, in which several phenomena vary at the same time. Preliminary experiments with real datasets showed that our approach correctly characterizes the oscillation of single time series, for multiple time scales, and is able to compute the similarity among sets of co-evolving series.

Keywords

Time series similarity computation Time series descriptor Oscillation of series Series co-evolution 

Notes

Acknowledgements

This work was partially funded by CPqD Foundation, CAPES, FAPESP, CNPq grants and CNPq projects WebMAPS and RPG. It is also being partially funded by the Microsoft Research-FAPESP Virtual institute, under the eFarms project. We thank Jeferson Lobato Fernandes for providing us with experimental data.

References

  1. 1.
    Agrawal R, Lin KI, Sawhney HS, Shim K (1995) Fast similarity search in the presence of noise, scaling, and translation in time-series databases. In: 21st VLSB conference, pp 490–501Google Scholar
  2. 2.
    Akyildiz IF, Su W, Sankarasubramaniam Y, Cayirci E (2002) Wireless sensor networks: a survey. Comput Netw 38(4):393–422CrossRefGoogle Scholar
  3. 3.
    Babcock B, Babu S, Datar M, Motwani R, Widom J (2003) Models and issues in data stream systems. Technical report, Department of Computer Science, Stanford UniversityGoogle Scholar
  4. 4.
    Cai Y, Ng R (2004) Indexinials. In: Proc ACM SIGMOD conference spatio-temporal trajectories with Chebyshev polynomGoogle Scholar
  5. 5.
    Deshpande A, Guestrin C, Madden S (2004) Model-driven data acquisition in sensor networks. In: Proc 30th VLDB conferenceGoogle Scholar
  6. 6.
    Deshpande A, Madden S (2006) MauveDB: supporting model-based user views in database systems. In: Proc of the 2006 ACM SIGMOD conference, pp 73–84Google Scholar
  7. 7.
    Ding H, Trajcevski G, Scheuermann P, Wang X, Keogh E (2008) Querying and mining of time series data: experimental comparison of representations and distance measures. In: Proc VLDB conferenceGoogle Scholar
  8. 8.
    Etien A, Salinesi C (2005) Managing requirements in a co-evolution context. In: RE05—Proceedings of the 13th international conference on requirements engineeringGoogle Scholar
  9. 9.
    Faloutsos C (2002) Tutorial: sensor data mining: similarity search and pattern analysis. In: 28th VLDB conferenceGoogle Scholar
  10. 10.
    Faloutsos C, Ranganathan M, Manolopoulos Y (1994) Fast subsequence matching in time-series databases. In: Proceedings 1994 ACM SIGMOD conference, Minneapolis, MN, pp 419–429Google Scholar
  11. 11.
    Fu L, Soh L, Samal A (2008) Techniques for computing fitness of use (FoU) for time series datasets with applications in the geospatial domain. GeoInfo 12(1):91–93CrossRefGoogle Scholar
  12. 12.
    Golab L, Oszu M (2003) Issues in data stream management. CM SIGMOD Rec 32:5–14CrossRefGoogle Scholar
  13. 13.
    Han J, Kamber M (2002) Data mining: concepts and techniques. In: ACM SIGMOD, vol 31Google Scholar
  14. 14.
    Han J, Pei J, Mortazavi-Asl B, Chen Q, Dayal U, Hsu M (2000) Freespan: frequent pattern-projected sequential pattern mining. In: KDD ’00: Proceedings of the sixth ACM SIGKDD international conference on knowledge discovery and data mining, pp 355–359Google Scholar
  15. 15.
    Hugueney B (2003) Representations symboliques de longues series temporelles (Symbolic representations of long temporal series). PhD thesis, University Paris 6Google Scholar
  16. 16.
    Joliveau M, De Vuyst F (2007) Space-Time summarization of multisensor time series. Case of missing data. In: Int workshop on spatial and spatio-temporal data mining—SSTDMGoogle Scholar
  17. 17.
    Junkui L, Yuanzhen W (2007) APCAS: an approximate approach to adaptively segment time series streams. In: Advances in data and web management, vol 4505. Springer, Berlin, pp 554–565CrossRefGoogle Scholar
  18. 18.
    Keogh E, Chakrabarti K, Pazzani M, Mehrotra S (2001) Locally adaptive dimensionality reduction for indexing large time series databases. In: Proc ACM SIGMOD conference, pp 151–162Google Scholar
  19. 19.
    Keogh E, Pazzani M (1998) An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. In Agrawal R, Stolorz P, Piatetsky-Shapiro G (eds) Fourth international conference on knowledge discovery and data mining (KDD’98). ACM, New York, pp 239–241Google Scholar
  20. 20.
    Keogh E, Ratanamahatana CA (2005) Exact indexing of dynamic time warping. Knowl Inf Syst 7(3):358–386CrossRefGoogle Scholar
  21. 21.
    Keogh E, Smyth P (1997) A probabilistic approach to fast pattern matching in time series databases. In Heckerman D, Mannila H, Pregibon D, Uthurusamy R (eds) Third international conference on knowledge discovery and data mining. Newport Beach, CA, USA. AAAI, Menlo Park, pp 24–30Google Scholar
  22. 22.
    Keogh E, Xi X, Wei L, Ratanamahatana CA (2006) The UCR time series classification/clustering homepage. www.cs.ucr.edu/~eamonn/timeseriesdata/
  23. 23.
    Keogh EJ, Chu S, Hart D, Pazzani MJ (2001) An online algorithm for segmenting time series. In: ICDM ’01: Proceedings of the 2001 IEEE international conference on data mining. IEEE Computer Society, Washington, DC, pp 289–296CrossRefGoogle Scholar
  24. 24.
    Keogh EJ, Pazzani MJ (2000) A simple dimensionality reduction technique for fast similarity search in large time series databases. In: Knowledge discovery and data mining, current issues and new applications, 4th Pacific-Asia conference, PAKDD 2000, vol 1805. Springer, Berlin, pp 122–133CrossRefGoogle Scholar
  25. 25.
    Korth H, Jagadish H, Faloutsos C (1997) Efficiently supporting ad hoc queries in large data sets of time sequences. In: Proc ACM SIGMOD conferenceGoogle Scholar
  26. 26.
    Lin J, Keogh E, Lonardi S, Chiu B (2003) A symbolic representation of time series, with implications for streaming algorithms. In: DMKD ’03: Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery. ACM, New York, pp 2–11CrossRefGoogle Scholar
  27. 27.
    Lin J, Keogh E, Wei L, Lonardi S (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 15:107–144CrossRefGoogle Scholar
  28. 28.
    Mainwaring A, Culler D, Polastre J, Szewczyk R, Anderson J (2002) Wireless sensor networks for habitat monitoring. In: WSNA ’02: Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications. ACM, New York, pp 88–97CrossRefGoogle Scholar
  29. 29.
    Mariote L, Medeiros CB, Torres R (2007) Diagnosing similarity of oscillation trends in time series. In: International Workshop on spatial and spatio-temporal data mining—SSTDM. LNCS, pp 643–648Google Scholar
  30. 30.
    Mirmomeni M, Lucas C, Araabi B, Moshiri B (2007) Forecasting solar activity using co-evolution of models and tests. In: Proc 7th international conference on intelligent systems design and applicationsGoogle Scholar
  31. 31.
    Navarro G (2001) A guided tour to approximate string matching. ACM Comput Surv 33(1):31–88CrossRefGoogle Scholar
  32. 32.
    Park S, Lee D, Chu WW (1999) Fast retrieval of similar subsequences in long sequence databases. In: KDEX ’99: Proceedings of the 1999 workshop on knowledge and data engineering exchange. IEEE Computer Society, Washington, DC, p 60Google Scholar
  33. 33.
    Patt-Shamir B (2007) A note on efficient aggregate queries in sensor networks. Theor Comp Sci 370(1–3):254–264CrossRefGoogle Scholar
  34. 34.
    Rafiei D, Mendelzon A (1997) Similarity-based queries for time series data. In: SIGMOD ’97: Proceedings of the 1997 ACM SIGMOD international conference on management of data, pp 13–25Google Scholar
  35. 35.
    Sacchi L, Larizza C, Combi C, Bellazzi R (2007) Data mining with temporal abstractions: learning rules from time series. Data Min Knowl Discov 15(2):217–247CrossRefGoogle Scholar
  36. 36.
    Shoshani A, Kawagoe K (1986) Temporal data management. In: Twelfth international conference on very large data bases table of contents, pp 79–88Google Scholar
  37. 37.
    Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22:1349–1380CrossRefGoogle Scholar
  38. 38.
    Szewczyk R, Polastre J, Mainwaring A, Culler D (2004) Lessons from a sensor network expedition. In: Proceedings of the first European workshop on sensor networks (EWSN)Google Scholar
  39. 39.
    Torres RS, Falcao AX, Costa LF (2002) Shape description by image foresting transform. In: Digital signal processing, 2002. DSP 2002. 2002 14th International conference on, vol 2, pp 1089–1092Google Scholar
  40. 40.
    Wu H, Salzberg B, Sharp GC, Jiang SB, Shirato H, Kaeli D (2005) Subsequence matching on structured time series data. In: SIGMOD ’05: Proceedings of the 2005 ACM SIGMOD international conference on management of data. ACM, New York, pp 682–693CrossRefGoogle Scholar
  41. 41.
    Yi B, Sidiropoulos ND, Johnson T, Jagadish HV, Faloutsos C, Biliris A (2000) Online data mining for co-evolving time sequences. In: ICDE ’00: Proceedings of the 16th international conference on data engineering. IEEE Computer Society, Washington, DC, p 13Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  • Leonardo E. Mariote
    • 1
  • Claudia Bauzer Medeiros
    • 1
    Email author
  • Ricardo da Silva Torres
    • 1
  • Lucas M. Bueno
    • 1
  1. 1.Institute of ComputingUniversity of Campinas—CP6176CampinasBrazil

Personalised recommendations