Skip to main content
Log in

Indexing Multidimensional Time-Series

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

While most time series data mining research has concentrated on providing solutions for a single distance function, in this work we motivate the need for an index structure that can support multiple distance measures. Our specific area of interest is the efficient retrieval and analysis of similar trajectories. Trajectory datasets are very common in environmental applications, mobility experiments, and video surveillance and are especially important for the discovery of certain biological patterns. Our primary similarity measure is based on the longest common subsequence (LCSS) model that offers enhanced robustness, particularly for noisy data, which are encountered very often in real-world applications. However, our index is able to accommodate other distance measures as well, including the ubiquitous Euclidean distance and the increasingly popular dynamic time warping (DTW). While other researchers have advocated one or other of these similarity measures, a major contribution of our work is the ability to support all these measures without the need to restructure the index. Our framework guarantees no false dismissals and can also be tailored to provide much faster response time at the expense of slightly reduced precision/recall. The experimental results demonstrate that our index can help speed up the computation of expensive similarity measures such as the LCSS and the DTW.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aach, J., Church, G.: Aligning gene expression time series with time warping algorithms. Bioinformatics 17, 495–508 (2001)

    Article  Google Scholar 

  2. Agrawal, R, Faloutsos, C, Swami, A: Efficient similarity search in sequence databases. In: Proc. of the 4th FODO, pp. 69–84 (October 1993)

  3. Agrawal, R., Lin, K., Sawhney, H.S., Shim, K.: Fast similarity search in the presence of noise, scaling and translation in time-series databases. In: Proc. of VLDB, pp. 490–501 (September 1995)

  4. Arikan, O., Forsyth, D.: Interactive motion generation from examples. In: Proc. of ACM SIGGRAPH (2002)

  5. Bollobás, B., Das, G., Gunopulos, D., Mannila, H.: Time-series similarity problems and well-separated geometric sets. In: Proc. of the 13th SCG (1997)

  6. Bar-Joseph, Z., Gerber, G., Gifford, D., Jaakkola, T., Simon, I.: A new approach to analyzing gene expression time series data. In: Proc. of 6th RECOMB, pp. 39–48 (2002)

  7. Barbara, D.: Mobile computing and databases—a survey. In: IEEE TKDE, pp. 108–117 (January 1999)

  8. Berndt, D., Clifford, J.: Using dynamic time warping to find patterns in time series. In: Proc. of AAAI-94 Workshop of SIGKDD (1994)

  9. Betke, M., Gips, J., Fleming, P.: The camera mouse: visual tracking of body features to provide computer access for people with severe disabilities. IEEE Trans. Neural Syst. Rehabil. Eng. 10(1) (2002)

  10. Bozkaya, T., Yazdani, N., Ozsoyoglu, M.: Matching and indexing sequences of different lengths. In: Proc. of the CIKM (1997)

  11. Cardle, M., Vlachos, M., Brooks, S., Keogh, E., Gunopulos, D.: Fast motion capture matching with replicated motion editing. In: 29th ACM SIGGRAPH, Sketches and Applications (2003)

  12. Chu, K., Wong, M.: Fast time-series searching with scaling and shifting. ACM Principles of Database Systems, pp. 237–248 (June 1999)

  13. Chudova, D., Gaffney, S., Mjolsness, E., Smyth, P.: Translation-invariant mixture models for curve clustering. In: Proc. of 9th SIGKDD, pp. 79–88 (2003)

  14. Das, G., Gunopulos, D., Mannila, H.: Finding similar time series. In: Proc. of the 1st PKDD Symposium, pp. 88–100 (1997)

  15. de Boor, C.: A Practical Guide to Splines. Springer, Berlin Heidelberg New York (1978)

    Google Scholar 

  16. Faloutsos, C., Jagadish, H., Mendelzon, A., Milo, T.: Signature technique for similarity-based queries. In: SEQUENCES 97 (1997)

  17. Faloutsos, C., Lin, K.-I.: FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In: Proc. of ACM SIGMOD, pp. 163–174 (May 1995)

  18. Faloutsos, C., Ranganathan, M., Manolopoulos, I.: Fast subsequence matching in time series databases. In: Proc. of ACM SIGMOD, pp. 419–429 (1994)

  19. Gaffney, S., Smyth, P.: Trajectory clustering with mixtures of regression models. In: Proc. of ACM SIGKDD, pp. 63–72 (1999)

  20. Gavrila, D., Davis, L.: Towards 3-d model-based tracking and recognition of human movement: a multi-view approach. In: Int. Workshop on Face and Gesture Recognition (1995)

  21. Ge, X., Smyth, P.: Deformable markov model templates for time-series pattern matching. In: Proc. of ACM SIGKDD (2000)

  22. Goldin, D., Kanellakis, P.: On similarity queries for time-series data. In: Proc. of Principles and Practice of Constraint Programming (September 1995)

  23. Gollmer, K., Posten, C.: Detection of distorted pattern using dynamic time warping algorithm and application for supervision of bioprocesses. On-Line Fault Detection and Supervision in Chemical Process Industries (1995)

  24. Grumbach, S., Rigaux, P., Segoufin, L.: Manipulating interpolated data is easier than you thought. In: Proc. of VLDB, pp. 156–165 (2000)

  25. Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proc. of ACM SIGMOD, pp. 47–57 (1984)

  26. Hadjieleftheriou, M., Kollios, G., Tsotras, V., Gunopulos, D.: Efficient indexing of spatiotemporal objects. In: Proc. of 8th EDBT, pp. 251–268 (2002)

  27. Jagadish, H.V., Mendelzon, A.O., Milo, T.: Similarity-based queries. In: Proc. of the 14th ACM PODS, pp. 36–45 (1995)

  28. Kahveci, T., Singh, A., Gürel, A.: Similarity searching for multi-attribute sequences. In: Proc. of SSDBM (2002)

  29. Kahveci, T., Singh, A.K.: Variable length queries for time series data. In: Proc. of IEEE ICDE, pp. 273–282 (2001)

  30. Keogh, E.: Exact indexing of dynamic time warping. In: Proc. of VLDB, pp. 406–417 (2002)

  31. Keogh, E., Chakrabarti, K., Mehrotra, S., Pazzani, M.: Locally adaptive dimensionality reduction for indexing large time series databases. In: Proc. of ACM SIGMOD, pp. 151–162 (2001)

  32. Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: a survey and empirical demonstration. In: Proc. of SIGKDD, pp. 102–111 (2002)

  33. Kim, S., Park, S., Chu, W.: An index-based approach for similarity search supporting time warping in large sequence databases. In: Proc. of IEEE ICDE, pp. 607–614 (2001)

  34. Kovács-Vajna, Z.: A fingerprint verification system based on triangular matching and dynamic time warping. IEEE Trans Pattern Anal Mach Intell, 22(11), (2000)

  35. Lee, S.-L., Chun, S.-J., Kim, D.-H., Lee, J.-H., Chung, C.-W.: Similarity search for multidimensional data sequences. In: Proc. of IEEE ICDE, pp. 599–608 (2000)

  36. Levenshtein, V.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics—Doklady 10(10), 707–710 (1966)

    MathSciNet  Google Scholar 

  37. Munich, M., Perona, P.: Continuous dynamic time warping for translation-invariant curve alignment with applications to signature verification. In: 7th International Conference on Computer Vision, pp. 108–115 (1999)

  38. Park, S., Chu, W., Yoon, J., Hsu, C.: Efficient searches for similar subsequences of different lengths in sequence databases. In: Proc. of IEEE ICDE, pp. 23–32 (2000)

  39. Perng, S., Wang, H., Zhang, S., Parker, D.S.: Landmarks: A new model for similarity-based pattern querying in time series databases. In: Proc. of IEEE ICDE, pp. 33–42 (2000)

  40. Pfoser, D., Jensen, C.S.: Capturing the uncertainty of moving-object representations. Lecture Notes in Computer Science, vol 1651 (1999)

  41. Pratt, K.B.: Locating patterns in discrete time-series. Master’s thesis (2001)

  42. Qu, Y., Wang, C., Wang, X.: Supporting fast search in time series for movement patterns in multiple scales. In: Proc. of ACM CIKM, pp. 251–258 (1998)

  43. Rafiei, D., Mendelzon, A.: Querying time series data based on similarity. In: IEEE Trans. Knowl. Data Eng., 12(5), 675–693 (2000)

    Article  Google Scholar 

  44. Ratanamahatana, C.A., Keogh, E.: Everything you know about dynamic time warping is wrong. In: 3rd Workshop on Mining Temporal and Sequential Data (SIGKDD), 2004

  45. Ratanamahatana, C.A., Keogh, E.: Making time-series classification more accurate using learned constraints. In: Proc. of SIAM International Conference on Data Mining (SDM) (2004)

  46. Rath, T., Manmatha, R.: Word image matching using dynamic time warping. Tec Report MM-38. Center for Intelligent Information Retrieval, University of Massachusetts Amherst (2002)

  47. Roddick, J.F., Hornsby, K.: Temporal, spatial and spatio-temporal data mining. TSDM (2000)

  48. Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. In: Proc. of ACM SIGMOD (1995)

  49. Shimada, M., Uehara, K.: Discovery of correlation from multi-stream of human motion. Discovery Sci., pp. 290–294 (2000)

  50. Strik, H., Boves, L.: Averaging physiological signals with the use of a dtw algorithm. In: SPEECH’88, 7th FASE Symposium, Book 3, pp. 883–890 (1988)

  51. Valdes-Perez, R.E., Stone, C.A.: Systematic detection of subtle spatio-temporal patterns in time-lapse imaging: II. Particle migrations. Bioimaging 6(2), 71–78 (1998)

    Article  Google Scholar 

  52. Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proc. of IEEE ICDE, pp. 673–684 (2002)

  53. Yi, B.-K., Faloutsos, C.: Fast time sequence indexing for arbitrary Lp norms. In: Proc. of VLDB, pp. 385–394 (2000)

  54. Yi, B.-K., Jagadish, H.V., Faloutsos, C.: Efficient retrieval of similar time sequences under time warping. In: Proc. of IEEE ICDE, pp. 201–208 (1998)

  55. Zhu, Y., Shasha, D.: Warping indexes with envelope transforms for query by humming. In: Proc. of ACM SIGMOD, pp. 181–192 (2003)

Download references

Author information

Authors and Affiliations

Authors

Additional information

Edited by B. Ooi

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vlachos, M., Hadjieleftheriou, M., Gunopulos, D. et al. Indexing Multidimensional Time-Series. The VLDB Journal 15, 1–20 (2006). https://doi.org/10.1007/s00778-004-0144-2

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-004-0144-2

Keywords

Navigation