Spatio-Temporal Data Mining and Knowledge Discovery: Issues Overview
Data mining or knowledge discovery refers to a variety of techniques having the intent of uncovering useful patterns and associations from large databases. The initial steps of data mining are concerned with preparation of data, including data cleaning intended to resolve errors and missing data and integration of data from multiple heterogeneous sources. Next are the steps needed to prepare for actual data mining including the selection of the specific data relevant to the task and the transformation of this data into a format required by the data mining approach. Finally, specific data mining algorithms such as class description, association rules and classification clustering are applied. There are specific characteristics of spatial and temporal data, as found in GIS and multi-media data, that make knowledge discovery in this domain more complex than in mining ordinary data such as found in typical business sales applications. Here we provide a survey of work in spatio-temporal data mining emphasizing the special characteristics. An overview is given of different sources and types of geospatial, oceanographie and meteorological data and the associated issues inherent in their use in knowledge discovery.
Key wordsData Mining spatio-temporal data data preparation
Unable to display preview. Download preview PDF.
- ARGUS, http://cil-www.oce.orst.edu:8080 .
- Burrough P and Frank A (eds.) Geographic Objects with Indeterminate Boundaries, GISDATA Series Vol. 2, London, UK, Taylor and Francis, 1996.Google Scholar
- Chawla S Shekar S Wu W and Ozesmi U Predicting Locations using Map Similarity (Plums): A Framework for Spatial Data Mining. In Proceedings 6 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY, ACM Press: 243–251, 2000.Google Scholar
- Chung M Wilson R Ladner R Lovitt T Cobb M Abdelguerfi M and Shaw K The Geospatial Information Distribution System (GIDS). In Chaudhri A and Zicari R (eds) Succeeding with Object Databases. New York, NY, Wiley and Sons: 357–378, 2001.Google Scholar
- Ernst I 3D City -Adaptive Capture and Visualization of Cityscapes, GMD First, German National Research Center for Information Technology Institute for Computer Architecture and Software Technology, http://www.first.gmd.de/vista/3dcity/, January 2000.Google Scholar
- FNMOC, http://www.fnmoc.navy.mil , April 2002.
- Geometrix, Inc., http://www.geometrixinc.com , January 2000.
- Han J and Kamber M. Data Mining: Concepts and Techniques. San Diego, CA Academic Press, 2000.Google Scholar
- Hand D Mannila H and Smyth P Principles of Data Mining. Cambridge, MA MIT Press, 2001.Google Scholar
- Irvin, R. Bruce, and David M. McKeown, Jr., Methods for Exploiting the Relationship Between Buildings and Their Shadows in Aerial Imagery, IEEE Transactions on Systems, Man, and Cybernetics, 19, No. 6, 1989.Google Scholar
- Koperski K and Han J Discovery of Spatial Association Rules in Geographic InformationGoogle Scholar
- Databases. In Proceedings of 4th International Symposium on Large Spatial Databases. Berlin, GD, Springer-Verlag: 47–66, 1995.Google Scholar
- Ladner R Petry F Cobb M Fuzzy Set Approaches to Spatial Data Mining of Association Rules. To appear in Transactions in Geographic Information Systems, 2003.Google Scholar
- Lu W Han J and Ooi B Discovery of general knowledge in large spatial databases. In Proceedings of Far East Workshop Geographic Information Systems. Singapore, World Scientific Press: 275–289, 1993.Google Scholar
- MEL, Master Environment Library, Defense Modeling & Simulation Office, http://mel.dmso.mil, January 2000.
- Ng R. and Han J Efficient and effective clustering method for spatial data mining. In Proceedings of 1994 International Conference on Very Large Database. San Francisco, CA, Morgan Kaufmann: 144–155, 1994.Google Scholar
- NGDC, Federal Geographic Data Committee (FGDC) National Geospatial Data Clearinghouse (NGDC), http://www.fgdc.govhttp://22.214.171.124/gateways.html, January 2000.
- NIMA, Digitizing the Future, National Imagery and Mapping Agency.Google Scholar
- NIM A 2000, National Imagery and Mapping Agency, Technical Report 8350.2, January 3, 2000.Google Scholar
- NOAA, National Oceanographie and Atmospheric Administration, http://www.esdim.noaa.gov/noaaserver-bin/NOAAServer, January 2000.
- Roux, Michel, and David M. McKeown, Feature Matching for Building Extraction from Multiple Views, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: 46–53, 1994.Google Scholar
- Snyder J Map Projections — A Working Manual, U.S. Geological Survey Professional Paper 1395, U.S. Government Printing Office, Washington, D.C., 1987.Google Scholar
- TMPO, Terrain Resource Repository, Terrain Modeling Project Office, http://www.tmpo.nima.mil/mel, January 2000.
- TOWAN, Tactical Oceanography Wide Area Network, Naval Research Laboratory, Stennis Space Center, http://www7180.nrlssc.navy.mil/homepages/TOWAN/TOWAN.htm, January 2000.
- Trott K Analysis of Digital Topographic Data Issues in Support of Synthetic Environment Terrain Data Base Generation, TEC-0091, U.S. Army Corps of Engineers, Topographic Engineering Center, November 1996.Google Scholar
- USAS, U.S. Army Simulation, Training, and Instrumentation Command, Orlando, Florida,Google Scholar
- SEDRIS and The Synthetic Environment Domain, Volume 1 of the SEDRIS Document SET, 12350 Research Parkway, Orlando, FL, March 28, 1998.Google Scholar
- USGS, Map Scales, Fact Sheet, 015-02, U.S. Geological Survey, February 2002.Google Scholar
- VPF, Departmentment of Defense, Interface Standard for Vector Product Format, MIL-STD 2407, 28 June 1996.Google Scholar