Abstract
Abnormal events in earth science have great influence on both the natural environment and the human society. Finding association patterns among these events has great significance. Because data in earth science has characteristics of mass, high dimension, spatial autocorrelation and time delay, existing mining technologies cannot be directly used on it. We propose a RSNN (range-based searching nearest neighbors) spatial clustering algorithm to reduce the data size and auto-correlation. Based on the clustered data, we propose a GEAM (geographic episode association pattern mining) algorithm which can deal with events time lags and find interesting patterns with specific constraints, to mine the association patterns. We carried out experiments on global climate datasets and found many interesting association patterns. Some of the patterns are coincident with known knowledge in climate science, which indicates the correctness and feasibilities of our methods, and the others are unknown to us before, which will give new information to this research field.
Similar content being viewed by others
References
Wold S. Principal component analysis. Chemom Intell Lab Syst, 1987, 2: 37–52
Klema V, Laub A. The singular value decomposition: Its computation and some applications. IEEE Trans Autom Contr, 1980, 25(2): 164–176
Mannila H, Toivonen H, Verkamo A. Discovering Frequent Episodes in Sequences. New York: AAAI Press, 1995. 210–215
Mannila H, Toivonen H, Verkamo A. Discovery of frequent episodes in event sequences. Data Min Knowl Disc, 1997, 1(3): 259–289
Garriga G C. Discovering Unbounded Episodes in Sequential Data. Berlin: Springer, 2003. 83–93
Meger N, Rigotti C. Constraint-Based Mining of Episode Rules and Optimal Window Sizes. Berlin: Springer, 2004. 313–324
Tobler W. A computer movie simulating urban growth in the detroit region. Econ Geogr, 1970, 46: 234–240
Ertz L, Steinbach M, Kumar V. Finding Topics in Collections of Documents: A Shared Nearest Neighbor Approach. Boston: Kluwer Academic Publishers, 2001. 84–90
Steinbach M, Tan P, Kumar V, et al. Discovery of Climate Indices Using Clustering. New York: ACM Press, 2003. 446–455
Steinbach M, Tan P, Kumar V, et al. Temporal Data Mining for the Discovery and Analysis of Ocean Climate Indices. New York: ACM Press, 2002. 1–10
Tan P, Steinbach M, Kumar V et al. Finding Spatio-Temporal Patterns in Earth Science Data. New York: ACM Press, 2001. 1–12
Su F, Zhou C, Lyne V, et al. A data-mining approach to determine the spatio-temporal relationship between environmental factors and fish distribution. Ecol Model, 2004, 174: 421–431
Harms S K, Deogun J S, Tadesse T. Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences. Berlin: Springer, 2002, 2366: 432–442
Mannila H, Toivonen H. Discovering Generalized Episodes Using Minimal Occurrences. New York: ACM Press, 1996. 146–151
Harms S K, Deogun J S. Sequential Association Rule Mining with Time Lags. Berlin: Springer, 2004, 22(1): 7–22
Xue J B, Zhong W, et al. Relationship between the droughts, floods and climatic changes in Guangdong during the history period. Geo Geo-Info Sci (in Chinese), 2005, 5: 75–79
Oldenborgh G J, Burgers G, Tank A K, et al. On the El-Nino teleconnection to spring precipitation in Europe. Int J Clim, 2000, 20: 565–574
Liu Q, Liu Z Y, Pan A J. Conceptual model about the interaction between El Niño/Southern Oscillation and Quasi-Biennial Oscillation in far west equatorial Pacific. Sci China Ser D-Earth Sci, 2006, 49(8): 889–896
Author information
Authors and Affiliations
Corresponding author
Additional information
Supported by the National Hi-Tech Research and Development Program of China (Grand No. 2006AA12Z217) and the National Natural Science Foundation of China (Grant No. 60703066)
Rights and permissions
About this article
Cite this article
Wu, T., Song, G., Ma, X. et al. Mining geographic episode association patterns of abnormal events in global earth science data. Sci. China Ser. E-Technol. Sci. 51 (Suppl 1), 155–164 (2008). https://doi.org/10.1007/s11431-008-5008-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11431-008-5008-3