Abstract
The growing demand for Geographic Information Systems (GIS) calls for high computation reliability to handle vast and complex spatial data processing tasks. A better parallel computing scheme should ensure balanced workload at different data processors to ensure optimal use of computing resources and minimise execution times, which poses more challenges with spatial data due to the nature of having spatial correlations and uneven distributions. In this paper, we propose a spatial clustering approach for workload balance, by using an evolutionary computation method that considers the nature of spatial data, to increase the computation performance for processing GIS polygon-based maps with massive number of vertices and complex shapes. To evaluate our proposed approach, We proposed two different experimental approaches for comparing our results: (i) Non–merging based experiment, and (ii) merging based experiment. The results demonstrated the advantage of the proposed spatial clustering approach in real GIS map based partitioning scenarios. The advantages and limitations of the proposed approach are discussed and further research directions are highlighted toward a development work by the research community.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
GIS (Geographic Information System) Overview. https://www.esri.com/en-us/what-is-gis/overview
Aji, A., et al.: Hadoop-GIS: a high performance spatial data warehousing system over MapReduce. In: The 39th International Conference on Very Large Data Bases, vol. 6, pp. 1009–1020 (2013)
Araujo Neto, A.C., Coelho da Silva, T.L., de Farias, V.A.E., Macêdo, J.A.F., de Castro Machado, J.: G2P: a partitioning approach for processing DBSCAN with MapReduce. In: Web and Wireless Geographical Information Systems, pp. 191-d-202. Springer, Cham (2015)
Bação, F., Lobo, V., Painho, M.: Applying genetic algorithms to zone design. Soft. Comput. 9(5), 341–348 (2005)
Barua, H.B., Das, D.K., Sarmah, S.: A density based clustering technique for large spatial data using polygon approach. J. Comput. Eng. 3(6), 1–9 (2012)
Boobalan, M.P., Lopez, D., Gao, X.: Graph clustering using k-neighbourhood attribute structural similarity. Appl. Soft Comput. 47, 216–223 (2016)
Cao, Z., Wang, S., Forestier, G., Puissant, A., Eick, C.F.: Analyzing the composition of cities using spatial clustering. In: Proceedings of the 2nd ACM SIGKDD International Workshop on Urban Computing, pp. 141–148 (2013)
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: the 6th Conference on Symposium on Opearting Systems Design and Implementation, pp. 1–13. Google, Inc. (2004)
Eldawy, A., Alarabi, L., Mokbel, M.F.: Spatial partitioning techniques in SpatialHadoop. Proc. VLDB Endow. 8(12), 1602–1605 (2015)
Eldawy, A., Mokbel, M.F.: SpatialHadoop: a MapReduce framework for spatial data. In: The 31st International Conference on Data Engineering, pp. 1352–1363 (2015)
Ericsson, A., WCDMA, R.: Clustering and polygon merging algorithms for fingerprinting positioning in LTE. In: 5th International Conference on Signal Processing and Communication Systems (ICSPCS), pp. 1–10 (2011)
ESRI: ESRI shapefile technical description. Technical report, Environmental Systems Research Institute Inc, 380 New York Street, Redlands, CA 92373–8100, USA (1998)
Fu, Y.X., Zhao, W.Z., Ma, H.F.: Research on parallel DBSCAN algorithm design based on MapReduce. In: Advanced Measurement and Test, Advanced Materials Research, vol. 301, pp. 1133–1138. Trans Tech Publications (2011)
Gu, X., Angelov, P.P., Príncipe, J.C.: A method for autonomous data partitioning. Inf. Sci. 460–461, 65–82 (2018)
Guest, O., Kanayet, F.J., Love, B.C.: Gerrymandering and computational redistricting. J. Comput. Soc. Sci. 2(2), 119–131 (2019). https://doi.org/10.1007/s42001-019-00053-9
Gufler, B., Augsten, N., Reiser, A., Kemper, A.: The partition cost model for load balancing in MapReduce, chap. 5, pp. 371–387. Springer, New York (2012)
Jasim, M., Asadi, T.A.: New graph mining algorithm for vector GIS systems. In: 8th International Conference on Computing Technology and Information Management (NCM and ICNIT), vol. 1, pp. 335–338 (2012)
Ji, G., Zhang, L.: A spatial polygon objects clustering algorithm based on topological relations for GML data. In: 2009 International Conference on Information Engineering and Computer Science, pp. 1–4 (2009)
Joshi, D., Samal, A., Soh, L.K.: A dissimilarity function for clustering geospatial polygons. In: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 384–387. ACM, New York (2009)
Joshi, D., Samal, A.K., Soh, L.K.: Density-based clustering of polygons. In: 2009 IEEE Symposium on Computational Intelligence and Data Mining, pp. 171–178 (2009)
Joshi, D., Soh, L.K., Samal, A.: Redistricting using heuristic-based polygonal clustering. In: 2009 Ninth IEEE International Conference on Data Mining, pp. 830–835 (2009)
Joshi, D., Soh, L.K., Samal, A.: Redistricting using constrained polygonal clustering. IEEE Trans. Knowl. Data Eng. 24(11), 2065–2079 (2012)
Kisore, N.R., Koteswaraiah, C.B.: Improving ATM coverage area using density based clustering algorithm and voronoi diagrams. Inf. Sci. 376, 1–20 (2017)
Levin, H.A., Friedler, S.A.: Automated congressional redistricting. J. Exp. Algorithmics 24, 1–24 (2019)
Li, X., Li, W., Anselin, L., Rey, S., Koschinsky, J.: A MapReduce algorithm to create contiguity weights for spatial analysis of big data. In: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, pp. 50–53. ACM, New York (2014)
Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)
Longley, P.A., Goodchild, M., Maguire, D.J., Rhind, D.W.: Geographic Information Systems and Science, 3rd edn. Wiley, Hoboken (2011)
Photis, Y.N.: Redefinition of the Greek electoral districts through the application of a region-building algorithm. MPRA Paper 42398, University Library of Munich, Germany (2012)
Puri, S., Agarwal, D., He, X., Prasad, S.K.: MapReduce algorithms for GIS polygonal overlay processing. In: 2013 IEEE International Symposium on Parallel Distributed Processing, Workshops and PHD Forum, pp. 1009–1016 (2013)
Qiu, Q., Yao, X., Chen, C., Liu, Y., Fang, J.: A spatial data partitioning and merging method for parallel vector spatial analysis. In: 2015 23rd International Conference on Geoinformatics, pp. 1–5 (2015)
Schutzman, Z.: Trade-offs in fair redistricting. In: AAAI/ACM Conference on AI, Ethics, and Society, pp. 159–165 (2020)
Shuliang, W., Gangyi, D., Ming, Z.: Big spatial data mining. In: IEEE International Conference on Big Data, pp. 13–21 (2013)
Wang, S., Chen, C.S., Rinsurongkawong, V., Akdag, F., Eick, C.F.: A polygon-based methodology for mining related spatial datasets. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Data Mining for Geoinformatics, pp. 1–8 (2010)
Wang, S., Eick, C.F.: A polygon-based clustering and analysis framework for mining spatial datasets. GeoInformatica 18(3), 569–594 (2013). https://doi.org/10.1007/s10707-013-0190-2
Wang, W., Du, S., Guo, Z., Luo, L.: Polygonal clustering analysis using multilevel graph-partition. Trans. GIS 19(5), 716–736 (2015)
Wei, H., et al.: A kd tree-based algorithm to parallelize kriging interpolation of big spatial data. GISci. Remote Sens. 52(1), 40–57 (2015)
Zhang, J., Samal, A., Soh, L.: Polygon-based spatial clustering. In: The 8th International Conference on GeoComputation, pp. 1–5 (2005)
Zhang, X., Huang, B., Tay, R.: Estimating spatial logistic model: a deterministic approach or a heuristic approach? Inf. Sci. 330, 358–369 (2016). SI Visual Info Communication
Zhao, L., Chen, L., Ranjan, R., Choo, K.-K.R., He, J.: Geographical information system parallelization for spatial big data processing: a review. Clust. Comput. 19(1), 139–152 (2015). https://doi.org/10.1007/s10586-015-0512-2
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Abubahia, A., Bader-El-Den, M., Haig, E. (2021). Evolutionary Computation Approach for Spatial Workload Balancing. In: Arai, K. (eds) Intelligent Computing. Lecture Notes in Networks and Systems, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-030-80126-7_38
Download citation
DOI: https://doi.org/10.1007/978-3-030-80126-7_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80125-0
Online ISBN: 978-3-030-80126-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)