Abstract
Spatial data mining is a process used to discover interesting but not explicitly available, highly usable patterns embedded in both spatial and non-spatial data, which are possibly stored in a spatial database. An important application of spatial data mining methods is the extraction of knowledge from a Geographic Information System (GIS). INGENS (INductive GEographic iNformation System) is a prototype GIS which integrates data mining tools to assist users in their task of topographic map interpretation. The spatial data mining process is aimed at a user who controls the parameters of the process by means of a query written in a mining query language. In this paper, we present SDMOQL (Spatial Data Mining Object Query Language), a spatial data mining query language used in INGENS, whose design is based on the standard OQL (Object Query Language). Currently, SDMOQL supports two data mining tasks: inducing classification rules and discovering association rules. For both tasks the language permits the specification of the task-relevant data, the kind of knowledge to be mined, the background knowledge and the hierarchies, the interestingness measures and the visualization for discovered patterns. Some constraints on the query language are identified by the particular mining task. The syntax of the query language is described and the application to a real repository of maps is briefly reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
DeMers, M.N.: Fundamentals of Geographic Information Systems, 2nd edn. John Wiley & Sons, Chichester (2000)
Dzeroski, S., Lavrac, N. (eds.): Relational Data Mining. Springer, Berlin, Germany (2001)
Egenhofer, M.J.: Reasoning about Binary Topological Relations. In: Proceedings of the Second Symposium on Large Spatial Databases, Zurich, Switzerland, pp. 143–160 (1991)
Elfeky, M.G., Saad, A.A., Fouad, S.A.: ODMQL: Object Data Mining Query Language. In: Proceedings of Symposium on Objects and Databases, Sophia Antipolis, France (2001)
Ester, M., Frommelt, A., Kriegel, H.P., Sander, J.: Algorithms for characterization and trend detection in spatial databases. In: Proceedings of the 4th Int. Conf. on Knowledge Discovery and Data Mining, New York City, NY, pp. 44–50 (1998)
Ester, M., Gundlach, S., Kriegel, H.P., Sander, J.: Database primitives for spatial data mining. In: Proceedings of Int. Conf. on Database in Office, Engineering and Science (BTW 1999), Freiburg, Germany (1999)
Güting, R.H.: An introduction to spatial database systems. VLDB Journal (3,4), 357–399 (1994)
Han, J., Fu, Y., Wang, W., Koperski, K., Zaïane, O.R.: DMQL: a data mining query language for relational databases. In: Proceedings of the Workshop on Research Issues on Data Mining and Knowledge Discovery, Montreal, QB, pp. 27–34 (1996)
Han, J., Koperski, K., Stefanovic, N.: GeoMiner: A System Prototype for Spatial Data Mining. In: Peckham, J. (ed.) SIGMOD 1997, Proceedings of the ACM-SIGMOD International Conference on Management of Data. SIGMOD Record, vol. 26(2), pp. 553–556 (1997)
Han, J., Kamber, M.: Data mining. Morgan Kaufmann Publishers, San Francisco (2000)
Han, J., Kamber, M., Tung, A.K.H.: Spatial clustering methods in data mining. In: Miller, H.J., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, Taylor and Francis, London, UK, pp. 188–217 (2001)
Imielinski, T., Virmani, A.: MSQL: A query language for database mining. Data Mining and Knowledge Discovery 3(4), 373–408 (1999)
Klosgen, W., May, M.: Spatial Subgroup Mining Integrated in an Object-Relational Spatial Database. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 275–286. Springer, Heidelberg (2002)
Koperski, K., Han, J.: Discovery of spatial association rules in geographic information database. In: Egenhofer, M.J., Herring, J.R. (eds.) SSD 1995. LNCS, vol. 951, pp. 47–66. Springer, Heidelberg (1995)
Koperski, K., Adhikary, J., Han, J.: Knowledge discovery in spatial databases: progress and challenges. In: Proc. SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (1996)
Koperski, K.: A progressive refinement approach to spatial data mining. Ph.D. thesis, Computing Science, Simon Fraser University (1999)
Lanza, A., Malerba, D., Lisi, L.F., Appice, A., Ceci, M.: Generating Logic Descriptions for the Automated Interpretation of Topographic Maps. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 200–210. Springer, Heidelberg (2002)
Malerba, D., Esposito, F., Lisi, F.A.: Learning recursive theories with ATRE. In: Prade, H. (ed.) Proc. 13th European Conference on Artificial Intelligence, pp. 435–439. John Wiley & Sons, Chichester, England (1998)
Malerba, D., Esposito, F., Lanza, A., Lisi, F.A., Appice, A.: Empowering a GIS with Inductive Learning Capabilities: The Case of INGENS. Journal of Computers, Environment and Urban Systems (in press)
Malerba, D., Esposito, F., Lanza, A., Lisi, F.A.: Machine learning for information extraction from topographic maps. In: Miller, H.J., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, pp. 291–314. Taylor and Francis, London (2001)
Malerba, D., Lisi, F.A.: An ILP method for spatial association rule mining. In: Working notes of the First Workshop on Multi-Relational Data Mining, Freiburg, Germany, pp. 18–29 (2001)
Preparata, F., Shamos, M.: Computational Geometry: An Introduction. Springer, New York (1985)
Sagonas, K.F., Swift, T., Warren, D.S.: XSB as an Efficient Deductive Database Engine. In: Snodgrass, R.T., Winslett, M. (eds.) Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, pp. 442–453 (1994); SIGMOD Record 23(2)
Sander, J., Ester, M., Kriegel, H.-P., Xu, X.: Density-Based Clustering in Spatial Databases: A New Algorithm and its Applications. In: Data Mining and Knowledge Discovery, vol. 2(2), pp. 169–194. Kluwer Academic Publishers, Dordrecht (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Malerba, D., Appice, A., Ceci, M. (2004). A Data Mining Query Language for Knowledge Discovery in a Geographical Information System. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds) Database Support for Data Mining Applications. Lecture Notes in Computer Science(), vol 2682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44497-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-44497-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22479-2
Online ISBN: 978-3-540-44497-8
eBook Packages: Springer Book Archive