Abstract
In this paper we present a system for mining very large spatio-temporal datasets. The system comprises two main layers: the mining layer and the visualization layer. The mining layer implements a new approach based on a 2-pass strategy to efficiently support the data-mining process, address the spatial and temporal dimensions of the dataset, and visualize and interpret results. In the first pass, the data objects are grouped according to their close similarity. In the second pass these groups are clustered to produce new models or patterns. The main reason for this 2-pass strategy is that the datasets are too large for traditional mining and cannot support the interactivity required by the visualization layer.
An Erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-540-74827-4_171
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: Proc. of ACM SIGMOD (1996)
Guha, S., Rastogi, R., Shim, K.: CURE: An efficient clustering algorithm for large databases. In: Proc. of ACM SIGMOD (1998)
Nanopoulos, A., Theodoridis, Y., Manolopoulos, Y.: C2P: Clustering based on closest pairs. In: Proc. of VLDB (2001)
Martin, E., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proc. of KDD (1996)
Ankerst, M., Breunig, M., Kriegel, H.P., Sander, J.: OPTICS: Ordering points to identify the clustering structure. In: Proc. of ACM SIGMOD, pp. 49–60 (1999)
Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proc. of ICDE, pp. 673–684 (2002)
Compieta, P., Di Martino, S., Bertolotto, M., Ferrucci, F., Kechadi, T.: Exploratory Spatio-Temporal Data Mining and Visualization. Journal of Elsevier Science, Special Issue on Human-GIS Interaction, 18(3) (June 2007)
Bertolotto, M., Di Martino, S., Ferrucci, F., Kechadi, T.: A Visualization System for Collaborative Spatio-Temporal Data Mining. International Journal of Geographical Information Science 21(7) (July 2007)
Spenke, M., Beilken, C.: Visual, Interactive Data Mining with InfoZoom – The Financial Data Set. In: Żytkow, J.M., Rauch, J. (eds.) Principles of Data Mining and Knowledge Discovery. LNCS (LNAI), vol. 1704, pp. 15–18. Springer, Heidelberg (1999)
Keim, D.A., Panse, C., Sips, M.: Visual Data Mining in Large Geospatial Poin Sets. IEEE Computer Graphics and Applications 24(5), 36–44 (2004)
Roddick, J.F., Lees, B.G.: Paradigms for spatial and spatio-temporal data mining. In: Miller, H.G., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, Taylor & Francis, London (2001)
National Hurricane Center, Tropical Cyclone Report: Hurricane Isabel (2003), http://www.tpc.ncep.noaa.gov/2003isabel.shtml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kechadi, T., Bertolotto, M. (2007). Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets. In: Apolloni, B., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2007. Lecture Notes in Computer Science(), vol 4693. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74827-4_99
Download citation
DOI: https://doi.org/10.1007/978-3-540-74827-4_99
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74826-7
Online ISBN: 978-3-540-74827-4
eBook Packages: Computer ScienceComputer Science (R0)