Discovery of Interesting Regions in Spatial Data Sets Using Supervised Clustering

  • Christoph F. Eick
  • Banafsheh Vaezian
  • Dan Jiang
  • Jing Wang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4213)

Abstract

The discovery of interesting regions in spatial datasets is an important data mining task. In particular, we are interested in identifying disjoint, contiguous regions that are unusual with respect to the distribution of a given class; i.e. a region that contains an unusually low or high number of instances of a particular class. This paper centers on the discussion of techniques, methodologies, and algorithms to discover such regions. A measure of interestingness and a supervised clustering framework are introduced for this purpose. Moreover, three supervised clustering algorithms are proposed in the paper: an agglomerative hierarchical supervised clustering named SCAH, an agglomerative, grid-based clustering method named SCHG, and lastly an algorithm named SCMRG which searches a multi-resolution grid structure top down for interesting regions. Finally, experimental results of applying the proposed framework and algorithms to the problem of identifying hotspots in spatial datasets are discussed.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Basu, S., Bilenko, M., Mooney, R.: Comparing and Unifying Search-based and Similarity-Based Approaches to Semi-Supervised Clustering. In: Proc. ICML 2003 Workshop on The Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining, Washington, DC, pp. 42–29 (2003)Google Scholar
  2. 2.
    Brimicombe, A.J.: Cluster Detection in Point Event Data Having Tendency Towards Spatially Repetitive Events. In: 8th International Conference on GeoComputation, Ann Arbor, Michigan (2005)Google Scholar
  3. 3.
    Chou, Y.: Exploring Spatial Analysis in Geographic Information System. Onward Press (1997), ISBN: 1-56690-119-7Google Scholar
  4. 4.
    Cressie, N.: In Statistics for Spatial Data. Wiley-Interscience, Hoboken (1993)Google Scholar
  5. 5.
    Demiriz, A., Benett, K.-P., Embrechts, M.J.: Semi-supervised Clustering using Genetic Algorithms. In: Proc. ANNIE, St. Louis, Missouri (1999)Google Scholar
  6. 6.
    Eick, C., Zeidat, N.: Using Supervised Clustering to Enhance Classifiers. In: Proc. 15th International Symposium on Methodologies for Intelligent Systems, Saratoga Springs, New York (2005)Google Scholar
  7. 7.
    Eick, C., Zeidat, N., Zhao, Z.: Supervised Clustering - Algorithms and Benefits, UH-Technical Report UH-CS-05-10; In: Proc. International Conference on Tools with AI (ICTAI), Boca Raton, Florida, pp. 774–776 (2004) (short version appeared)Google Scholar
  8. 8.
    Klösgen, W., May, M.: Spatial Subgroup Mining Integrated in an Object-Relational Spatial Database. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS, vol. 2431, pp. 275–286. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  9. 9.
    Koperski, K., Han, J.: Discovery of Spatial Association Rules in Geographic Spatial Data bases. In: Proc. 4th Int. Symp. Advances in Spatial Databases, Maine, pp. 47–66 (1995)Google Scholar
  10. 10.
    Shekhar, S., Huang, Y.: Discovering Spatial Co-location Patterns: A Summary of Results. In: Jensen, C.S., Schneider, M., Seeger, B., Tsotras, V.J. (eds.) SSTD 2001. LNCS, vol. 2121, p. 236. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  11. 11.
    Tay, S.C., Lim, K.H., Yap, L.C.: Spatial Data Mining: Clustering of Hot Spots and Pattern Recognition. IEEE, Los Alamitos (2003)Google Scholar
  12. 12.
    UH Data Mining & Machine Learning Group, http://www.tlc2.uh.edu/dmmlg/Datasets
  13. 13.
    Wang, W., Yang, J., Muntz, R.: STING: A Statistical Information Grid Approach to Spatial Data Mining. In: Proceedings of the 23rd VLDB Conference, Athens, Greece (1997)Google Scholar
  14. 14.
    Williams, G.J.: Evolutionary Hot Spots Data Mining, An Architecture for Exploring for Interesting Discoveries. In: Pacific Asia Conference on Knowledge Discovery and Data Mining, Beijing, China (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Christoph F. Eick
    • 1
  • Banafsheh Vaezian
    • 1
  • Dan Jiang
    • 1
  • Jing Wang
    • 1
  1. 1.Department of Computer ScienceUniversity of HoustonHoustonU.S.A

Personalised recommendations