Advertisement

Journal of Intelligent Information Systems

, Volume 42, Issue 3, pp 485–505 | Cite as

Mining regional co-location patterns with kNNG

  • Feng QianEmail author
  • Kevin Chiew
  • Qinming He
  • Hao Huang
Article

Abstract

Spatial co-location pattern mining discovers the subsets of features of which the events are frequently located together in geographic space. The current research on this topic adopts a distance threshold that has limitations in spatial data sets with various magnitudes of neighborhood distances, especially for mining of regional co-location patterns. In this paper, we propose a hierarchical co-location mining framework accounting for both variety of neighborhood distances and spatial heterogeneity. By adopting k-nearest neighbor graph (kNNG) instead of distance threshold, we propose “distance variation coefficient” as a new measure to drive the mining operations and determine an individual neighborhood relationship graph for each region. The proposed mining algorithm outputs a set of regions with each of them an individual set of regional co-location patterns. The experimental results on both synthetic and real world data sets show that our framework is effective to discover these regional co-location patterns.

Keywords

Regional co-location pattern mining kNNG Variation coefficient 

Notes

Acknowledgements

This work is partly supported by National Key Technologies R&D Program of China under Grant No. 2011BAD21B02, in which Chiew’s work is partly supported by National Natural Science Foundation of China under Grant No. 61272303.

References

  1. Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules. In Proceedings of the 20th international conference on very large data bases, 12–15 Sept. (pp. 487–499). Santiago de Chile, Chile.Google Scholar
  2. Arge, L., Procopiuc, O., Ramaswamy, S., Suel, T., Vitter, J.S. (1998). Scalable sweeping-based spatial join. In Proceedings of the 24th international conference on very large data bases, 24–27 Aug. (pp. 570–581). New York, USA.Google Scholar
  3. Banerjee, A., & Ghosh, J. (2006). Scalable clustering algorithms with balancing constraints. Data Mining and Knowledge Discovery, 13(3), 365–395.CrossRefMathSciNetGoogle Scholar
  4. Brito, M.R., Chavez, E.L., Quiroz, A.J., Yukich, J.E. (1997). Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection. Statistics and Probability Letters, 35(1), 33–42.CrossRefzbMATHMathSciNetGoogle Scholar
  5. Celik, M., Kang, J.M., Shekhar, S. (2007). Zonal co-location pattern discovery with dynamic parameters. In Proceedings of IEEE international conference on data mining, 28–31 Oct. (pp. 433–438). Omaha, USA.Google Scholar
  6. Celik, M., Shekhar, S., Rogers, J.P., Shine, J.A., Yoo, J.S. (2006). Mixed-drove spatio-temporal co-occurrence pattern mining: A summary of results. In Proceedings of the 6th international conference on data mining, 18–22 Dec. (pp. 119–128). Hong Kong, China.Google Scholar
  7. Digital Chart of the World. (2013). http://www.princeton.edu/~geolib/gis/dcw.html. Accessed 05 Oct 2013.
  8. Ding, C., & He, X. (2004). K-nearest-neighbor consistency in data clustering: Incorporating local information into global optimization. In Proceedings of the ACM symposium on applied computing, 14–17 March (pp. 584–589). Nicosia, Cyprus.Google Scholar
  9. Eick, C.F., Parmar, R., Ding, W., Stepinski, T.F., Nicot, J.P. (2008). Finding regional co-location patterns for sets of continuous variables in spatial datasets. In Proceedings of the 16th ACM SIGSPATIAL international conference on advances in geographic information systems, 5–7 Nov. (pp. 1–110). Irvine, USA.Google Scholar
  10. Friedman, J.H., Bentley, J.L., Finkel, R.A. (1977). An algorithm for finding best matches in logarithmic expected time. ACM Transactions on Mathematical Software, 3, 290–226.Google Scholar
  11. Ge, R., Ester, M., Jin, W., Davidson, I. (2007). Constraint-driven clustering. In Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, 12–15 Aug. (pp. 320–329). San Jose, USA.Google Scholar
  12. Han, E.H., Karypis, G., Kumar, V. (1999). Chameleon: hierarchical clustering using dynamic modeling. IEEE Computer, 32(8), 68–75.CrossRefGoogle Scholar
  13. Huang, Y., Pei, J., Xiong, H. (2006). Mining co-location patterns with rare events from spatial data sets. GeoInformatica, 10(3), 239–260.CrossRefGoogle Scholar
  14. Huang, Y., Shekhar, S., Xiong, H. (2004). Discovering colocation patterns from spatial datasets: a general approach. IEEE Transactions on Knowledge and Data Engineering, 16(12), 1472–1485.CrossRefGoogle Scholar
  15. Huang, Y., Zhang, P., Zhang, C. (2008). On the relationships between clustering and spatial co-location pattern mining. International Journal on Artificial Intelligence Tools, 17(1), 55–70.CrossRefGoogle Scholar
  16. Jin, R., Goswami, A., Agrawal, G. (2006). Fast and exact out-of-core and distributed k-means clustering. Knowledge and Information Systems, 10(1), 17–40.CrossRefGoogle Scholar
  17. Lin, Z., & Lim, S.J. (2009). Optimal candidate generation in spatial co-location mining. In Proceedings of the 2009 ACM symposium on applied computing, 9–12 March (pp. 1441–1445). Hawaii, USA.Google Scholar
  18. Miller, H.J., & Han, J. (2009). Geographic data mining and knowledge discovery. New York: CRC Press.Google Scholar
  19. Müller, E., Günnemann, S., Färber, I., Seidl, T. (2012). Discovering multiple clustering solutions: Grouping objects in different views of the data. In Proceeds of the 28th international conference on data engineering, 1–5 Apr. (pp. 1207–1210). Washington DC, USA.Google Scholar
  20. Munro, R., Chawla, S., Sun, P. (2003). Complex spatial relationships. In Proceedings of the 3rd IEEE international conference on data mining, 19–22 Dec. (pp. 227–234). Melbourne, USA.Google Scholar
  21. Park, K., Shen, C., Hao, Z., Kim, J. (2011). Efficiently learning a distance metric for large margin nearest neighbor classification. In Proceedings of the 25th AAAI conference on artificial intelligence, 7–11 Aug. (pp. 453–458). San Francisco, USA.Google Scholar
  22. Qian, F., He, Q., Chiew, K., He, J. (2012). Spatial co-location pattern discovery without thresholds. Knowledge and Information Systems, 33(2), 419–445.CrossRefGoogle Scholar
  23. Qian, F., He, Q., He, J. (2009a). Mining spread patterns of spatio-temporal co-occurrences over zones. In Proceedings of the international conference on computational science and its applications, 29 June–2 July (pp. 686–701). Seoul, Korea.Google Scholar
  24. Qian, F., Yin, L., He, Q., He, J. (2009b). Mining spatio-temporal co-location patterns with weighted sliding window. In Proceedings of IEEE international conference on intelligent computing and intelligent systems, 20–22 Nov. (pp. 181–185). Shanghai, China.Google Scholar
  25. Samet, H. (1990). The design and analysis of spatial data structures. Boston: Addison Wesley.Google Scholar
  26. Shaw, B., Huang, B.C., Jebara, T. (2011). Learning a distance metric from a network. In Proceeds of the 25th conference on neural information processing systems, 12–14 Dec. (pp. 1899–1907). Granada, Spain.Google Scholar
  27. Shekhar, S., & Huang, Y. (2001). Discovering spatial co-location patterns: A summary of results. In Proceedings of the 7th international symposium on spatial and temporal databases, 12–15 July (pp. 236–256). Redondo Beach, USA.Google Scholar
  28. Sheng, C., Hsu, W., Lee, M.L., Tung, A.K.H. (2008). Discovering spatial interaction patterns. In Proceedings of the 13th international conference on database systems for advanced applications, 19–21 March (pp. 95–109). New Delhi, India.Google Scholar
  29. Slonim, N., Atwal, G.S., Tkačik, G., Bialek, W. (2005). Information-based clustering. Proceedings of the National Academy of Sciences of the United States of America, 102(51), 18297–18302.CrossRefzbMATHMathSciNetGoogle Scholar
  30. Wang, J. (2006). Spatial analysis. Beijing: Science Press.Google Scholar
  31. Wang, L., Zhou, L., Lu, J., Yip, J. (2009). An order-clique-based approach for mining maximal co-locations. Information Sciences, 179(19), 3370–3382.CrossRefzbMATHGoogle Scholar
  32. Wu, X., Kumar, V., Quinlan, J.R., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G., Ng, A., Liu, B., Yu, P., Zhou, Z., Steinbach, M., Hand, D., Steinberg, D. (2008). Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1), 1–37.CrossRefGoogle Scholar
  33. Xiao, X., Xie, X., Luo, Q., Ma, W.Y. (2008). Density based co-location pattern discovery. In Proceedings of the 16th ACM SIGSPATIAL international conference on advances in geographic information systems, 5–7 Nov. (pp. 1–10). Irvine, USA.Google Scholar
  34. Xiong, H., Shekhar, S., Huang, Y., Kumar, V., Ma, X., Yoo, J.S. (2004). A framework for discovering co-location patterns in data sets with extended spatial objects. In Proceedings of the fourth SIAM international conference on data mining, 22–24 Apr. (vol. 89, p. 78). Lake Buena, USA.Google Scholar
  35. Yoo, J.S., & Shekhar, S. (2006). A joinless approach for mining spatial colocation patterns. IEEE Transactions on Knowledge and Data Engineering, 18(10), 1323–1337.CrossRefGoogle Scholar
  36. Yoo, J.S., Shekhar, S., Kim, S., Celik, M. (2006). Discovery of co-evolving spatial event sets. In Proceedings of the 6th SIAM international conference on data mining, 20–22 Nov. (pp. 306–315). Bethesda, USA.Google Scholar
  37. Yoo, J.S., & Bow, M. (2012). Mining spatial colocation patterns: a different framework. Data Mining and Knowledge Discovery, 24(1), 1384–5810.CrossRefMathSciNetGoogle Scholar
  38. Zhang, X., Mamoulis, N., Cheung, D.W., Shou, Y. (2004). Fast mining of spatial collocations. In Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, 22–25 Aug. (pp. 384–393). Seattle, USA.Google Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  1. 1.Hangzhou R&D CenterNetEase Inc.HangzhouPeople’s Republic of China
  2. 2.School of EngineeringTan Tao UniversityDuc Hoa DistrictVietnam
  3. 3.College of Computer Science and TechnologyZhejiang UniversityHangzhouPeople’s Republic of China
  4. 4.School of ComputingNational University of SingaporeSingaporeSingapore

Personalised recommendations