Advertisement

Fast Retrieval of Time Series Using a Multi-resolution Filter with Multiple Reduced Spaces

  • Muhammad Marwan Muhammad Fuad
  • Pierre-François Marteau
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6440)

Abstract

Fast retrieval of time series that are similar to a given pattern in large databases is a problem which has received a lot of attention in the last decade. The high dimensionality and large size of time series databases make sequential scanning inefficient to handle the similarity search problem. Several dimensionality reduction techniques have been proposed to reduce the complexity of the similarity search. Multi-resolution techniques are methods that speed-up the similarity search problem by economizing distance computations. In this paper we revisit two of previously proposed methods and present an improved algorithm that combine the advantages of these two methods. We conduct extensive experiments that show the show the superior performance of the improved algorithm over the previously proposed techniques.

Keywords

Time Series Databases Similarity Search Dimensionality Reduction Techniques Sequential Scanning MIR MIR_X Tight-MIR 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Faloutsos, C., Swami, A.: Efficient Similarity Search in Sequence Databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730. Springer, Heidelberg (1993)Google Scholar
  2. 2.
    Agrawal, R., Lin, K.I., Sawhney, H.S., Shim, K.: Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-series Databases. In: Proceedings of the 21st Int’l Conference on Very Large Databases, Zurich, Switzerland, pp. 490–501 (1995)Google Scholar
  3. 3.
    Cai, Y., Ng, R.: Indexing Spatio-temporal Trajectories with Chebyshev Polynomials. In: SIGMOD (2004)Google Scholar
  4. 4.
    Chan, K., Fu, A.W.: Efficient Time Series Matching by Wavelets. In: Proc. of the 15th IEEE Int’l Conf. on Data Engineering, Sydney, Australia, March 23-26, pp. 126–133 (1999)Google Scholar
  5. 5.
    Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. In: Proc of the 34th VLDB (2008)Google Scholar
  6. 6.
    Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast Subsequence Matching in Time-series Databases. In: Proc. ACM SIGMOD Conf., Minneapolis (1994)Google Scholar
  7. 7.
    Lin, J., Keogh, E.J., Lonardi, S., Chiu, B.Y.-c.: A Symbolic Representation of Time Series, with Implications for Streaming Algorithms. In: DMKD (2003)Google Scholar
  8. 8.
    Keogh, E.,, Chakrabarti, K.,, Pazzani, M., Mehrotra: Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. J. of Know. and Inform. Sys. (2000)Google Scholar
  9. 9.
    Keogh, E., Chakrabarti, K,. Pazzani, M., Mehrotra: Locally Adaptive Dimensionality Reduction for Similarity Search in Large Time Series Databases. In: SIGMOD (2001)Google Scholar
  10. 10.
    Korn, F., Jagadish, H., Faloutsos, C.: Efficiently Supporting ad hoc Queries in Large Datasets of Time Sequences. In: Proceedings of SIGMOD 1997, Tucson, AZ, pp. 289–300 (1997)Google Scholar
  11. 11.
    Morinaka, Y., Yoshikawa, M., Amagasa, T., Uemura, S.: The L-index: An indexing Structure for Efficient Subsequence Matching in Time Sequence Databases. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, pp. 51–60. Springer, Heidelberg (2001)Google Scholar
  12. 12.
    Muhammad Fuad, M.M., Marteau, P.F.: Multi-resolution Approach to Time Series Retrieval. In: Fourteenth International Database Engineering & Applications Symposium– IDEAS 2010, Montreal, QC, Canada (2010)Google Scholar
  13. 13.
    Muhammad Fuad, M.M. , Marteau P.F.: Speeding-up the Similarity Search in Time Series Databases by Coupling Dimensionality Reduction Techniques with a Fast-and-dirty Filter. In: Fourth IEEE International Conference on Semantic Computing– ICSC 2010.Carnegie Mellon University, Pittsburgh (2010)Google Scholar
  14. 14.
  15. 15.
    Schulte, M.J., Lindberg, M., Laxminarain, A.: Performance Evaluation of Decimal Floating-point Arithmetic in IBM Austin Center for Advanced Studies Conference (February 2005)Google Scholar
  16. 16.
    SISTA’s Identification Database, http://www.esat.kuleuven.ac.be/~tokka/daisydata.html
  17. 17.
    StatLib - Datasets Archive, http://lib.stat.cmu.edu/datasets/
  18. 18.
    Yi, B.K., Faloutsos, C.: Fast Time Sequence Indexing for Arbitrary Lp norms. In: Proceedings of the 26st International Conference on Very Large Databases, Cairo, Egypt (2000)Google Scholar
  19. 19.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Muhammad Marwan Muhammad Fuad
    • 1
  • Pierre-François Marteau
    • 1
  1. 1.VALORIAUniversité de Bretagne Sud, Université Européenne de BretagneVannesFrance

Personalised recommendations