Abstract
The paper describes the SimUrb tool, which was used to calculate the similarity between municipalities (or other territorial units) according to several characteristics (attributes). The SimUrb tool allows similar groups between ordered sequences to be found. The tool was designed to work with hundreds of records. In order to find similar groups, the tool employs graph theory, in which similar groups are represented as cliques on a simple graph. The Bron-Kerbosch algorithm was used to search for them. The degree of similarity was determined from a metric based on the Euclidean distance between strings. Finding all non-trivial cliques would have been difficult or impossible for such data to be feasible in this study. Therefore, the SimUrb tool was used to find similar groups as the largest disjoint cliques in the respective graph. A case study is introduced in the second part of the paper to illustrate SimUrb’s functionality. The results of standard grouping methods (two methods offered by ArcGIS software) and groups defined under official planning documents were compared to the results from the SimUrb software. We concluded that SimUrb can be used in many applications where the user needs to define groups of objects with the same degree of similarity.
Similar content being viewed by others
Data Availability
SimUrb was created in PHP, HTML and C# programming languages, and graphical visualization was done with the D3.js and jQuery libraries. The tool is located on the server of Palacký University, Olomouc (web-n.upol.cz). For public users, the tool is accessible at http://eyetracking.upol.cz/simurb. The server runs on Linux OS with an Apache 2.0 web server, PHP version 5.3.17 and MySQL database version 5.0.8. Complete configuration of the server is available at http://eyetracking.upol.cz/info.php.
Preparation of the geographic data for SimUrb was done in open source software QGIS version 2.18. All tools were tested for use mainly in the Google Chrome web browser. The SimUrb source code is available at: https://github.com/swenney/simurb.
References
Bron C, Kerbosch J (1973) Algorithm 457: finding all cliques of an undirected graph. Commun ACM 16:575–577. https://doi.org/10.1145/362342.362367
Burian J, Pászto V (2016a) Land sustainability analysis of Olomouc region [Hodnocení rozboru udržitelného rozvoje území obcí Olomouckého kraje - Vyhodnocení vyváženosti vztahu územních podmínek]. Olomouc
Burian J, Pászto V (2016b) Methodology for Land sustainability analysis of municipalities in Olomouc region [Metodika hodnocení rozboru udržitelného rozvoje území obcí Olomouckého kraje]. Olomouc
Burian J, Brychtová A, Vávra A, Hladišová B (2016) Analytical material for planning in Olomouc, Czech Republic. J Maps 12:649–654
D’Urso P (2015) Fuzzy clustering. Handbook of Cluster Analysis
De Brito DM, Maracaja-Coutinho V, De Farias ST et al (2016) A novel method to predict genomic islands based on mean shift clustering algorithm. PLoS One 11:e0146352. https://doi.org/10.1371/journal.pone.0146352
Doležalová J, Popelka S (2016) ScanGraph: A novel scanpath comparison method using graph cliques visualization. J Eye Mov Res 9:1–13. https://doi.org/10.16910/jemr.9.4.5
Du KL (2010) Clustering: a neural network approach. Neural Netw 23:89–107. https://doi.org/10.1016/j.neunet.2009.08.007
Esri (2018a) How grouping analysis works—ArcGIS Pro | ArcGIS desktop. https://pro.arcgis.com/en/pro-app/tool-reference/spatial-statistics/how-grouping-analysis-works.htm#ESRI_SECTION1_103DAE5B22924FEFA7F8B9A79D9614F7. Accessed 23 Feb 2019
Esri (2018b) How Multivariate Clustering works—ArcGIS Pro | ArcGIS Desktop. https://pro.arcgis.com/en/pro-app/tool-reference/spatial-statistics/how-multivariate-clustering-works.htm. Accessed 23 Feb 2019
Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recogn Lett 31:651–666. https://doi.org/10.1016/j.patrec.2009.09.011
Kong F, Yin H, Nakagoshi N, Zong Y (2010) Urban green space network development for biodiversity conservation: identification based on graph theory and gravity modeling. Landsc Urban Plan 95:16–27. https://doi.org/10.1016/J.LANDURBPLAN.2009.11.001
Kubíček P, Konečný M, Stachoň Z, Shen J, Herman L, Řezník T, Staněk K, Štampach R, Leitgeb Š (2018) Population distribution modelling at fine spatio-temporal scale based on mobile phone data. Int J Digital Earth:1–22
Lata S, Ayyub M (2014) Fuzzy inference system. Int J Appl Eng Res 9(7):805–813
McArdle G, Demšar U, van der Spek S, McLoone S (2014) Classifying pedestrian movement behaviour from GPS trajectories using visualization and clustering. Ann GIS 20:85–98. https://doi.org/10.1080/19475683.2014.904560
Mertel A, Ondrejka P, Šabatová K (2018) Spatial predictive modeling of prehistoric sites in the bohemian-Moravian highlands based on graph similarity analysis. Open Geosciences 10(1):261–274
Nehéz M (2016) Analysis of the randomized algorithm for clique problems. Paper presented at the 15th conference on applied mathematics APLIMAT 2016, Bratislava
Nurmi L, Meskanen T (2018) Political areas of competition in Southwest Finland: a cluster analysis of parliamentary election data from three levels of aggregation. Res Finn Soc 11:24–43
Opach T, Popelka S, Dolezalova J, Rød JK (2018) Star and polyline glyphs in a grid plot and on a map display: which perform better? Cartogr Geogr Inf Sci 45:400–419. https://doi.org/10.1080/15230406.2017.1364169
Pászto V, Brychtová A, Tuček P, Marek L, Burian J (2015) Using a fuzzy inference system to delimit rural and urban municipalities in the Czech republic in 2010. J Maps 11(2):231–239
QGIS (2019) QGIS Plugins. https://plugins.qgis.org/. Accessed 23 Feb 2019
Selvi HZ, Caglar B (2018) Using cluster analysis methods for multivariate mapping of traffic accidents. Open Geosciences 10(1):772–781
Treml EA, Halpin PN, Urban DL, Pratson LF (2008) Modeling population connectivity by ocean currents, a graph-theoretic approach for marine conservation. Landsc Ecol 23:19–36. https://doi.org/10.1007/s10980-007-9138-y
Zetterberg A, Mörtberg UM, Balfors B (2010) Making graph theory operational for landscape ecological assessments, planning, and design. Landsc Urban Plan 95:181–191. https://doi.org/10.1016/J.LANDURBPLAN.2010.01.002
Zhong C, Zeng S, Tu W, Yoshida M (2018) Profiling the spatial structure of London: from individual tweets to aggregated functional zones. ISPRS Int J Geo Inf 7(10):386
Zhu X, Guo D (2014) Mapping large spatial flow data with hierarchical clustering. Trans GIS 18:421–435. https://doi.org/10.1111/tgis.12100
Acknowledgements
This research was supported by the Czech Science Foundation, project no. 19-14506S; and by the NAKI II research programme, project no. DG18P02OVV017. We would like to thank Ondřej Štrubl, who did the programming part of the work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by: H. Babaie
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Doležalová, J., Burian, J. & Popelka, S. SimUrb – software for identifying similar municipalities by comparing Urban indices using a graph algorithm. Earth Sci Inform 12, 699–714 (2019). https://doi.org/10.1007/s12145-019-00399-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-019-00399-8