Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets

Kechadi, Tahar; Bertolotto, Michela

doi:10.1007/978-3-540-74827-4_99

Tahar Kechadi⁴ &
Michela Bertolotto⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4693))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

1486 Accesses

Abstract

In this paper we present a system for mining very large spatio-temporal datasets. The system comprises two main layers: the mining layer and the visualization layer. The mining layer implements a new approach based on a 2-pass strategy to efficiently support the data-mining process, address the spatial and temporal dimensions of the dataset, and visualize and interpret results. In the first pass, the data objects are grouped according to their close similarity. In the second pass these groups are clustered to produce new models or patterns. The main reason for this 2-pass strategy is that the datasets are too large for traditional mining and cannot support the interactivity required by the visualization layer.

An Erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-540-74827-4_171

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: Proc. of ACM SIGMOD (1996)
Google Scholar
Guha, S., Rastogi, R., Shim, K.: CURE: An efficient clustering algorithm for large databases. In: Proc. of ACM SIGMOD (1998)
Google Scholar
Nanopoulos, A., Theodoridis, Y., Manolopoulos, Y.: C2P: Clustering based on closest pairs. In: Proc. of VLDB (2001)
Google Scholar
Martin, E., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proc. of KDD (1996)
Google Scholar
Ankerst, M., Breunig, M., Kriegel, H.P., Sander, J.: OPTICS: Ordering points to identify the clustering structure. In: Proc. of ACM SIGMOD, pp. 49–60 (1999)
Google Scholar
Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proc. of ICDE, pp. 673–684 (2002)
Google Scholar
Compieta, P., Di Martino, S., Bertolotto, M., Ferrucci, F., Kechadi, T.: Exploratory Spatio-Temporal Data Mining and Visualization. Journal of Elsevier Science, Special Issue on Human-GIS Interaction, 18(3) (June 2007)
Google Scholar
Bertolotto, M., Di Martino, S., Ferrucci, F., Kechadi, T.: A Visualization System for Collaborative Spatio-Temporal Data Mining. International Journal of Geographical Information Science 21(7) (July 2007)
Google Scholar
Spenke, M., Beilken, C.: Visual, Interactive Data Mining with InfoZoom – The Financial Data Set. In: Żytkow, J.M., Rauch, J. (eds.) Principles of Data Mining and Knowledge Discovery. LNCS (LNAI), vol. 1704, pp. 15–18. Springer, Heidelberg (1999)
Google Scholar
Keim, D.A., Panse, C., Sips, M.: Visual Data Mining in Large Geospatial Poin Sets. IEEE Computer Graphics and Applications 24(5), 36–44 (2004)
Article Google Scholar
Roddick, J.F., Lees, B.G.: Paradigms for spatial and spatio-temporal data mining. In: Miller, H.G., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, Taylor & Francis, London (2001)
Google Scholar
National Hurricane Center, Tropical Cyclone Report: Hurricane Isabel (2003), http://www.tpc.ncep.noaa.gov/2003isabel.shtml

Download references

Author information

Authors and Affiliations

University College Dublin, Belfield, Dublin 4, Ireland
Tahar Kechadi
Sergio Di Martino, Filomena Ferrucci, Dip. di Matematica e Informatica, Università degli Studi di Salerno, Email: sdimartino,fferrucci@unisa.it, Italy
Michela Bertolotto

Authors

Tahar Kechadi
View author publications
You can also search for this author in PubMed Google Scholar
Michela Bertolotto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Scienze dell’Informazione, Università degli Studi di Milano, Via Comelico 39/41, 20135, Milano, Italy
Bruno Apolloni
Centre for SMART Systems, School of Engineering, University of Brighton, BN2 4GJ, Brighton, UK
Robert J. Howlett
Knowledge-Based Intelligent Engineering Systems Centre, University of South Australia, Mawson Lakes, SA 5095, Adelaide, Australia
Lakhmi Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kechadi, T., Bertolotto, M. (2007). Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets. In: Apolloni, B., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2007. Lecture Notes in Computer Science(), vol 4693. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74827-4_99

Download citation

DOI: https://doi.org/10.1007/978-3-540-74827-4_99
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74826-7
Online ISBN: 978-3-540-74827-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics