Skip to main content

Temporal Constraints and Sub-Dimensional Clustering for Fast Similarity Search over Time Series Data. Application to Information Retrieval Tasks.

  • Conference paper
  • 3094 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 366))

Abstract

In this paper we introduce a novel approach for fast retrieval of temporal patterns from time series data. This method constructs firstly an index over key subsequences, using subdimensional clustering. Then, during the querying process, rather than scanning the whole database, to extract relevant answers for a given query, our method suggests the traversal of the index represented as centroids of clusters, and search for similar subsequences.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Gaber Mohamed et al. Mining data streams: a review. SIGMOD Rec., 34(2):18–26, 2005.

    Google Scholar 

  2. Ralanamahatana et al. Mining time series data. In DMKD’05, pages 1069–1103.

    Google Scholar 

  3. Lin et al. A symbolic representation of time series, with implications for streaming algorithms. In ACM SIGMOD RIDMKD Workshop 03, pages 2–11.

    Google Scholar 

  4. Wei et al. Efficient query filtering for streaming times series. ICDM ’05, pages 490–497.

    Google Scholar 

  5. Capitani et al. Warping the time on data streams. DKE, 62(3):438–458, September 2007.

    Google Scholar 

  6. Rafiei et al. Similarity-based queries for time series data. SIGMOD’97, pages 13–25.

    Google Scholar 

  7. Chakrabarti et al. Locally adaptive dimensionality reduction for indexing large time series databases. ACM TDS., 27(2):188–228, June 2002.

    Google Scholar 

  8. Lin Jessica et al. Rotation-invariant similarity in time series using bag-of-patterns representation. JIIS, 39(2):287–315, 2012.

    Google Scholar 

  9. Dhiral et al. Distance measures for effective clustering of arima time-series. In ICDM’01.

    Google Scholar 

  10. Guralnik et al. Event detection from time series data. KDD ’99, pages 33–42.

    Google Scholar 

  11. Pong et al. Efficient time series matching by wavelets. In ICDE99, pages 126–133.

    Google Scholar 

  12. Qiang et al. A multiresolution symbolic representation of ts. In ICDE05, pages 668–679.

    Google Scholar 

  13. Malinowsky Simon et al. 1d-sax; a novel symbolic representation for time series. In IDA’13.

    Google Scholar 

  14. Wang et al. Time series analysis with multiple resolutions. Inf. Syst., 35(1):56–74, 2010.

    Google Scholar 

  15. Aho et al. Efficient string matching: an aid to bibliographic search. com. ACM, 18(6), 1975.

    Google Scholar 

  16. Benabderrahmane et al. Intelligo: a new vector-based semantic similarity measure including annotation origin. BMC Bioinformatics, 11(1):588, 2010.

    Google Scholar 

  17. Salton et al. Introduction to Modern Information Retrieval. McGraw-Hill, NY, USA, 1983.

    Google Scholar 

  18. Cohen, J. (1968). Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological Bulletin.

    Google Scholar 

  19. Raymond et al. On the marriage of lp-norms and edit distance. In VLDB 04.

    Google Scholar 

  20. Vlachos et al. Indexing ts with support for multiple distance measures. KDD ’03.

    Google Scholar 

  21. Ucr database. http://www.cs.ucr.edu/eamonn/timeseriesdata.

  22. Kelley et al. An automated approach for clustering an ensemble of NMR-derived protein structures into conformationally related subfamilies. Protein eng., 9(11):1063–1065, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sidahmed Benabderrahmane .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Benabderrahmane, S. (2015). Temporal Constraints and Sub-Dimensional Clustering for Fast Similarity Search over Time Series Data. Application to Information Retrieval Tasks.. In: Selvaraj, H., Zydek, D., Chmaj, G. (eds) Progress in Systems Engineering. Advances in Intelligent Systems and Computing, vol 366. Springer, Cham. https://doi.org/10.1007/978-3-319-08422-0_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08422-0_40

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08421-3

  • Online ISBN: 978-3-319-08422-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics