Skip to main content

Correlation Analysis of Spatial Time Series Datasets: A Filter-and-Refine Approach

  • Conference paper
  • First Online:
Advances in Knowledge Discovery and Data Mining (PAKDD 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2637))

Included in the following conference series:

Abstract

A spatial time series dataset is a collection of time series, each referencing a location in a common spatial framework. Correlation analysis is often used to identify pairs of potentially interacting elements from the cross product of two spatial time series datasets. However, the computational cost of correlation analysis is very high when the dimension of the time series and the number of locations in the spatial frameworks are large. The key contribution of this paper is the use of spatial autocorrelation among spatial neighboring time series to reduce computational cost. A filter-and-refine algorithm based on coning, i.e. grouping of locations, is proposed to reduce the cost of correlation analysis over a pair of spatial time series datasets. Cone-level correlation computation can be used to eliminate (filter out) a large number of element pairs whose correlation is clearly below (or above) a given threshold. Element pair correlation needs to be computed for remaining pairs. Using experimental studies with Earth science datasets, we show that the filter-and-refine approach can save a large fraction of the computational cost, particularly when the minimal correlation threshold is high.

This work was partially supported by NASA grant No. NCC 2 1231 and by Army High Performance Computing Research Center contract number DAAD19-01-2-0014. The content of this work does not necessarily reflect the position or policy of the government and no official endorsement should be inferred. AHPCRC and Minnesota Supercomputer Institute provided access to computing facilities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, C. Faloutsos, and A. Swami. Efficient Similarity Search In Sequence Databases. In Proc. of the 4th Int’l Conference of Foundations of Data Organization and Algorithms, 1993.

    Google Scholar 

  2. G. Box, G. Jenkins, and G. Reinsel. Time Series Analysis: Forecasting and Control. Prentice Hall, 1994.

    Google Scholar 

  3. B. W. Lindgren. Statistical Theory (Fourth Edition). Chapman-Hall, 1998.

    Google Scholar 

  4. K. Chan and A. W. Fu. Efficient Time Series Matching by Wavelets. In Proc. of the 15th ICDE, 1999.

    Google Scholar 

  5. N. Cressie. Statistics for Spatial Data. John Wiley and Sons, 1991.

    Google Scholar 

  6. Christos Faloutsos. Searching Multimedia Databases By Content. Kluwer Academic Publishers, 1996.

    Google Scholar 

  7. R. Grossman, C. Kamath, P. Kegelmeyer, V. Kumar, and R. Namburu, editors. Data Mining for Scientific and Engineering Applications. Kluwer Academic Publishers, 2001.

    Google Scholar 

  8. D. Gunopulos and G. Das. Time Series Similarity Measures and Time Series Indexing. SIGMOD Record, 30(2):624–624, 2001.

    Article  Google Scholar 

  9. J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2000.

    Google Scholar 

  10. E. Keogh and M. Pazzani. An Indexing Scheme for Fast Similarity Search in Large Time Series Databases. In Proc. of 11th Int’l Conference on Scientific and Statistical Database Management, 1999.

    Google Scholar 

  11. Y. Moon, K. Whang, and W. Han. A Subsequence Matching Method in Time-Series Databases Based on Generalized Windows. In Proc. of ACM SIGMOD, Madison, WI, 2002.

    Google Scholar 

  12. C. Potter, S. Klooster, and V. Brooks. Inter-annual Variability in Terrestrial Net Primary Production: Exploration of Trends and Controls on Regional to Global Scales. Ecosystems, 2(1):36–48, 1999.

    Article  Google Scholar 

  13. J. Roddick and K. Hornsby. Temporal, Spatial, and Spatio-Temporal Data Mining. In First Int’l Workshop on Temporal, Spatial and Spatio-Temporal Data Mining, 2000.

    Google Scholar 

  14. S. Shekhar and S. Chawla. Spatial Databases: A Tour. Prentice Hall, 2002.

    Google Scholar 

  15. S. Shekhar, S. Chawla, S. Ravada, A. Fetterer, X. Liu, and C.T. Lu. Spatial databases: Accomplishments and research needs. IEEE Transactions on Knowledge and Data Engineering, 11(1):45–55, 1999.

    Article  Google Scholar 

  16. M. Steinbach, P. Tan, V. Kumar, C. Potter, S. Klooster, and A. Torregrosa. Data Mining for the Discovery of Ocean Climate Indices. In Proc of the Fifth Workshop on Scientific Data Mining, 2002.

    Google Scholar 

  17. P. Tan, M. Steinbach, V. Kumar, C. Potter, S. Klooster, and A. Torregrosa. Finding Spatio-Temporal Patterns in Earth Science Data. In KDD 2001 Workshop on Temporal Data Mining, 2001.

    Google Scholar 

  18. G. H. Taylor. Impacts of the El Niño/Southern Oscillation on the Pacific Northwest. http://www.ocs.orst.edu/reports/enso_pnw.html.

  19. Michael F. Worboys. GIS — A Computing Perspective. Taylor and Francis, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, P., Huang, Y., Shekhar, S., Kumar, V. (2003). Correlation Analysis of Spatial Time Series Datasets: A Filter-and-Refine Approach. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_53

Download citation

  • DOI: https://doi.org/10.1007/3-540-36175-8_53

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-04760-5

  • Online ISBN: 978-3-540-36175-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics