Skip to main content

Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4693))

  • 1486 Accesses

Abstract

In this paper we present a system for mining very large spatio-temporal datasets. The system comprises two main layers: the mining layer and the visualization layer. The mining layer implements a new approach based on a 2-pass strategy to efficiently support the data-mining process, address the spatial and temporal dimensions of the dataset, and visualize and interpret results. In the first pass, the data objects are grouped according to their close similarity. In the second pass these groups are clustered to produce new models or patterns. The main reason for this 2-pass strategy is that the datasets are too large for traditional mining and cannot support the interactivity required by the visualization layer.

An Erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-540-74827-4_171

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: Proc. of ACM SIGMOD (1996)

    Google Scholar 

  2. Guha, S., Rastogi, R., Shim, K.: CURE: An efficient clustering algorithm for large databases. In: Proc. of ACM SIGMOD (1998)

    Google Scholar 

  3. Nanopoulos, A., Theodoridis, Y., Manolopoulos, Y.: C2P: Clustering based on closest pairs. In: Proc. of VLDB (2001)

    Google Scholar 

  4. Martin, E., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proc. of KDD (1996)

    Google Scholar 

  5. Ankerst, M., Breunig, M., Kriegel, H.P., Sander, J.: OPTICS: Ordering points to identify the clustering structure. In: Proc. of ACM SIGMOD, pp. 49–60 (1999)

    Google Scholar 

  6. Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proc. of ICDE, pp. 673–684 (2002)

    Google Scholar 

  7. Compieta, P., Di Martino, S., Bertolotto, M., Ferrucci, F., Kechadi, T.: Exploratory Spatio-Temporal Data Mining and Visualization. Journal of Elsevier Science, Special Issue on Human-GIS Interaction, 18(3) (June 2007)

    Google Scholar 

  8. Bertolotto, M., Di Martino, S., Ferrucci, F., Kechadi, T.: A Visualization System for Collaborative Spatio-Temporal Data Mining. International Journal of Geographical Information Science 21(7) (July 2007)

    Google Scholar 

  9. Spenke, M., Beilken, C.: Visual, Interactive Data Mining with InfoZoom – The Financial Data Set. In: Żytkow, J.M., Rauch, J. (eds.) Principles of Data Mining and Knowledge Discovery. LNCS (LNAI), vol. 1704, pp. 15–18. Springer, Heidelberg (1999)

    Google Scholar 

  10. Keim, D.A., Panse, C., Sips, M.: Visual Data Mining in Large Geospatial Poin Sets. IEEE Computer Graphics and Applications 24(5), 36–44 (2004)

    Article  Google Scholar 

  11. Roddick, J.F., Lees, B.G.: Paradigms for spatial and spatio-temporal data mining. In: Miller, H.G., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, Taylor & Francis, London (2001)

    Google Scholar 

  12. National Hurricane Center, Tropical Cyclone Report: Hurricane Isabel (2003), http://www.tpc.ncep.noaa.gov/2003isabel.shtml

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kechadi, T., Bertolotto, M. (2007). Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets. In: Apolloni, B., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2007. Lecture Notes in Computer Science(), vol 4693. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74827-4_99

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74827-4_99

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74826-7

  • Online ISBN: 978-3-540-74827-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics