Abstract
In spatial clustering, spatial objects are grouped into clusters according to their similarities. In terms of learning or pattern recognition, it belongs to the identification of structures/classes through an unsupervised process. In terms of data mining, it is the discovery of intrinsic classes, particularly new classes, in spatial data. It formulates class structures and determines the number of classes. I have examined in Chap. 2 the importance of clustering as a means for unraveling interesting, useful and natural patterns in spatial data. The process generally does not involve how to separate predetermined classes, or how to determine whether classes are significantly different from each other, or how to assign new objects to given classes. Another fundamental issue of spatial knowledge discovery involves spatial classification. It essentially deals with the separation of pre-specified classes and the assignment of new spatial objects to these classes on the basis of some measurements (with respect to selected features) about them. In terms of learning or pattern recognition, it is actually a supervised learning process which searches for the decision surface separating appropriately various classes. In terms of data mining, it often involves the discovery of classification rules from the training/learning data set that can separate distinct/genuine classes of spatial objects and the assignment of new spatial objects to these labeled classes. Whether the pre-specified classes are significantly different is usually not the main concern in classification. It can be determined by procedures such as the analysis of variance in statistics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aha DW, Kibler D, Albert MK (1991) Instance-based learning algorithms. Mach Learn 6:37–66
Allenby GM, Rossi PE (1994) Modeling household purchase behavior with logistic normal regression. J Am Stat Assoc 89:1218–1231
Anderson JA (1982) Logistic discrimination. In: Krishnaiah PR, Kanal L (eds) Hand book of statistics, vol 2. North-Holland, Amsterdam, pp 169–191
Angulo C, Catala A (2000) K_SVCR. A Multi-class support vector machine. In: Lopez de Mantaras R, Plaza E (eds) ECML 2000, LNAI 1810. Springer, Berlin, pp. 31–38
Berkson J (1944) Application of the logistic function to bio-assay. J Am Stat Assoc 39:357–365
Brown M, Lewis HG, Gunn SR (1999) Support vector machines for spectral unmixing. IGRASS’99 2:1363–1365
Burges C, Scholkopf B (1997) Improving the accuracy and speed of support vector machines. In: Mozer M, Jordan M, Petsche T (eds) Neural information processing systems. MIT, Cambridge
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2:121–167
Carpenter GA, Grossberg S, Reynolds JH (1991) ARTMAP: supervised real time learning and classification of nonstationary data by a self-organising neural network. Neural Networks 4:565–588
Collett D (1991) Modelling binary data. Chapman and Hall, London
Das Gupta S (1980) Discriminant analysis. In: Krishnaiah PR, Kanal L, Fisher RA (eds) An appreciation. Springer, New York, pp 161–170
Efron B (1975) The efficiency of logistic regression compared to normal discriminant analysis. J Am Stat Assoc 70(352):892–898
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7:179–188
Fitzpatrick DB (1976) An analysis of bank credit card profit. J Bank Res 7:199–205
Fix E, Hodges JL (1951) Discriminatory analysis – nonparametric discrimination: consistency properties. Report No. 4, Project no. 21-29-004. USAF School of Aviation Medicine, Randolph Field, Texas. Reprinted in Int Stat Rev 57(1989):238–247
Geisser S (1982) Beyesian discrimination. In: Krishnaiah PR, Kanal L (eds) Hand book of statistics, vol 2. North-Holland, Amsterdam, pp 101–120
Guadagni P, Little J (1983) A logit model of brand choice calibrated on scanner data. Market Sci 2:203–238
Hand DJ (1982) Kernel discriminant analysis. Research Studies Press, Letchworth
Hand DJ, Henley WE (1997) Statistical classification methods in consumer credit scoring. J Roy Stat Soc Series A 160:523–541
Hart PE (1968) The condensed nearest neighbour rule. IEEE Trans Inform Theor 14:515–516
Hastie TJ, Tibshirani RJ (1996) Discriminant analysis by Gaussian mixtures. J R Stat Soc B 58:155–176
Hearst MA, Scholkopf B, Dumais S (1998) Trends and controversies-support vector machines. IEEE Intell Syst 13(4):18–28
Hermes L, Frieauff D, Puzicha J, Buhmann JM (1999) Support vector machines for land usage classification in Landsat TM imagery. In: Proceeding of the IEEE international geoscience and remote sensing symposium, vol. 1. Hamburg, pp. 348–350
Hosmer DW, Lemeshow S (1989) Applied logistic regression. Wiley, New York
John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers In Proceedings of the 11th conference on uncertainty in artificial intelligence. Morgan Kaufmann, San Mateo, CA
Keerthi SS, Shevade SK, Bhattacharya U, Murthy KRK (2000) A fast iterative nearest point algorithm for support vector machine classifier design. IEEE Trans Neural Network 11(1):124–136
Kohavi R (1996) Scaling up the accuracy of naive-Bayes classifiers: a deceision-tree hybrid. In: Proceedings of the second international conference on knowledge discovery and data mining. Morgan Kaufman, San Mateo CA
Langley P, Sage S (1994) Induction of selective Bayesian classifers. In: Proceedings of the 10th conference on uncertainty in artificial intelligence. Morgan Kaufmann, Seattle, WA,
Leung Y, Leung KS, Ma JH (2003a) Data mining for bank databases (unpublished paper)
Leung Y, Leung KS, Mei CL (2003b) Data mining for credit card promotion in the banking sector (unpublished paper)
Leung Y, Luo JC, Zhou CH, Ma JH (2002b) Support vector machine for spatial feature extraction and classification of high resolution remote sensing images (unpublished paper)
McLachlan GJ (1992) Discriminant analysis and statistical pattern recognition. Wiley, New York
Menard SW (1995) Applied logistic regression analysis. Sage Publication, Thousand Oaks, CA
Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, San Mateo, CA
Piramuthu S (1999) Feature selection for financial credit-risk evaluation. Inform J Comput 11(3):258–266
Powell MJD (1987) Radial basis functions for multivariable interpolation: a review. In: Mason JC, Cox MG (eds) Algorithms for Approximation of Functions and Data. Oxford University Press, Oxford, pp 143–167
Ramoni M, Sebastiani P (1996) Robust learning with missing data. Technical Report Kmi-TR-28, Knowledge Media Institute, The Open University
Rubinstein YD, Hastie TJ (1997) Discriminative vs informative learning. In: Proceedings of 3rd international conference on knowledge discovery and data mining. AAAI Press, Menlo Park, CA, pp. 49–53
Scholkopf B, Burges CJC, Smola AJ (1999) Advances in kernel methods: support vector learning. MIT, Cambridge
Scholkopf B, Sung KK, Burges CJC, Girosi F, Niyogi P, Poggio T, Vapnik VN (1997) Comparing support vector machines with Gaussian Kernels to radial basis function classifiers. IEEE Trans Signal Process 45(11):2758–2765
Smith CAB (1947) Some examples of discrimination. Ann Eugen 13:272–282
Stanfill C, Waltz D (1986) Toward memory-based reasoning. Commun ACM 29:1213–1228
Thomas LC, Crook JN, Edelman DB (eds) (1992) Credit scoring and credit control. Clarendon Press, Oxford
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Wand MP, Jones MC (1995) Kernel smoothing. Chapman and Hall, London
Xu ZB, Leung Y (2004) How neural networks can be made more effective and efficient: a view of learning theory (unpublished paper)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Leung, Y. (2010). Statistical Approach to the Identification of Separation Surface for Spatial Data. In: Knowledge Discovery in Spatial Data. Advances in Spatial Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02664-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-02664-5_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02663-8
Online ISBN: 978-3-642-02664-5
eBook Packages: Humanities, Social Sciences and LawLiterature, Cultural and Media Studies (R0)