Advertisement

Spatial data analysis using association rule mining in distributed environments: a privacy prospect

  • Raghvendra Kumar
  • Le Hoang Son
  • Sudan Jha
  • Mamta Mittal
  • Lalit Mohan Goyal
Article
  • 9 Downloads

Abstract

In this paper, we investigate the privacy issue that is remained in the spatial association rule mining (SARM). The main aim of SARM is to calculate the relationship among attributes according to their geographic locations. However, the major problem of the distributed data mining is the privacy and security issues, which require executing global results without disclosing private information by third parties. The problem of privacy preservation for SARM in distributed environments is controlled by the proposed algorithm, which is able to extract association among different numbers of attributes in geo-graphical distributed database with high privacy. The proposed algorithm is validated in term of data utility rate, efficiency and privacy preservation against the existing algorithms. It has been revealed that this algorithm decreases the execution time, memory requirements, and privacy failure rate when the size of database increases within the geographically distributed database environment.

Keywords

Spatial association rule mining Frequent patterns Knowledge discovery Spatial database Spatial analysis 

Notes

Acknowledgements

This work was supported by the department of computer science and engineering, LNCT College, Jabalpur, M.P., India and Vietnam National University, Vietnam.

References

  1. 1.
    Tjioe, H. C., & Taniar, D. (2008). Mining association rules in data warehouses. International Journal of Data Warehousing and Mining, 1(3), 54–63.Google Scholar
  2. 2.
    Djenouri, Y., Djenouri, D., Habbas, Z., & Belhadi, A. (2018). How to exploit high performance computing in population-based metaheuristics for solving association rule mining problem. Distributed and Parallel Databases, 36, 1–29.CrossRefGoogle Scholar
  3. 3.
    Kalia, H., Dehuri, S., & Ghosh, A. (2013). A survey on fuzzy association rule mining. International Journal of Data Warehousing and Mining (IJDWM), 9(1), 1–27.CrossRefGoogle Scholar
  4. 4.
    Li, L., & Singham, M. (2015). The applicability of the perturbation model-based privacy preserving data mining for real-world data. In International conference on data mining workshops (pp. 98–105), IEEE.Google Scholar
  5. 5.
    Li, L., et al. (2016). Privacy-preserving-outsourced association rule mining on vertically partitioned databases. IEEE Transactions on Information Forensics and Security, 11(8), 1847–1861.CrossRefGoogle Scholar
  6. 6.
    Bogorny, V., Engel, P. M., & Alvares, L. O. (2005). Reuse-based spatial data preparation framework for data mining. In: International conference on software engineering and knowledge engineering, Seke, Taipei, Taiwan, 17 (pp. 649–652), IEEE.Google Scholar
  7. 7.
    Bogorny, V., Engel, P. M., & Alvares, L. O. (2005). Towards the reduction of spatial join for knowledge discovery in geographic databases using geo-ontologies and spatial integrity constraints. In Workshop on knowledge discovery and ontologies of TheEcml/Pkdd,Kdo, Porto, 2 (pp. 51–58), IEEE.Google Scholar
  8. 8.
    Bogorny, V., Engel, P. M., & Alvares, L. O. (2006). An interoperable framework to improve geographic data pre-processing and spatial association rule mining. In International conference on software engineering and knowledge engineering, Seke, San Francisco, 18 (pp. 70–84), ACM.Google Scholar
  9. 9.
    Yang, J., Sun, J., Zhao, H., et al. (2016). Spatio-temporal differentiation of residential land for coastal town: A case study of Dalian Jinshitan. Chinese Geographical Science, 26(4), 566–576.CrossRefGoogle Scholar
  10. 10.
    Bogorny, V., Camargo, S., Engel, P., & Alvares, L. O. (2006). Mining Frequent Geographic Patterns with Knowledge Constraints. In International symposium on advances in geographic information systems, 14 (pp. 64–71), ACM.Google Scholar
  11. 11.
    Bogorny, V., Camargo, S., Engel, P. M., &Alvares, L. O. (2006). Towards Elimination of Well Known Geographic Domain Patterns in Spatial Association Rule Mining. International Conference on Intelligent Systems, London, 3, (pp.532-537), IEEE.Google Scholar
  12. 12.
    Bogorny, V., Engel, P. M., & Alavares, L. O. (2008). Enhancing the process of knowledge discovery in geographic databases using geo-ontologies (pp. 57–68). New York: IGI Global.Google Scholar
  13. 13.
    Bogorny, V., Valiati, J., Camargo, S., Engel, P., & Alvares, L. O. (2006). Mining maximal generalized frequent geographic patterns with knowledge constraints. In: International conference on data mining, Hong-Kong (pp. 6–12), IEEE.Google Scholar
  14. 14.
    Vaidya J. & Clifton, C. (2002). Privacy preserving association rule mining in vertically partitioned data. In Proceedings of the SIGKDD (pp. 639–644) ACM.Google Scholar
  15. 15.
    Vaidya, J., Clifton, C., Kantarcioglu, M., & Scott Patterson, A. (2008). Privacy-preserving decision trees over vertically partitioned data. ACM Transactions on Knowledge Discovery from Data, 2(3), 85–96.CrossRefGoogle Scholar
  16. 16.
    Mishra D. K., & Chandwani M. (2008). SMC advertising and minimizing their effects for privacy preservation data mining. In Indiacom-2008 (pp. 98–104), IEEE.Google Scholar
  17. 17.
    Bonchi, F. et al. (2003). Examiner: Optimized level-wise frequent pattern mining with monotone constraints. In: International conference on data mining, Melbourne (pp. 11–18), IEEE.Google Scholar
  18. 18.
    Appice, A., et al. (2005). Mining and filtering multi-level spatial association rules with ares. International Symposium on Methodologies for Intelligent Systems, 15, 342–353.Google Scholar
  19. 19.
    Wang, S., Lai, T., Hong, T., & Wu, Y. (2010). Hiding collaborative recommendation association rules on horizontally partitioned data. Intelligent Data Analysis, 14(1), 47–67.Google Scholar
  20. 20.
    Demetriou, D., Stillwell, J., & See, L. (2013). A new methodology for measuring land fragmentation. Computers Environment and Urban Systems, 39, 71–80.CrossRefGoogle Scholar
  21. 21.
    Kantarcioglu, M., & Clifton, C. (2004). Privacy-preserving distributed mining of association rules on horizontally partitioned data. IEEE Transactions on Knowledge and Data Engineering, 16(9), 1026–1037.CrossRefGoogle Scholar
  22. 22.
    Kardan, A. A., & Ebrahimi, M. (2013). A novel approach to hybrid recommendation systems based on association rules mining for content recommendation in asynchronous discussion groups. Information Sciences, 219, 93–110.CrossRefGoogle Scholar
  23. 23.
    Mao, K., Ma, Y., Xia, L., et al. (2014). A neural network method for monitoring snowstorm: a case study in southern China. Chinese Geographical Science, 24(5), 599–606.CrossRefGoogle Scholar
  24. 24.
    Martín, D., Rosete, A., Alcalá-Fdez, J., et al. (2014). QAR-CIP-NSGAII: a new multi-objective evolutionary algorithm to mine quantitative association rules. Information Sciences, 258, 1–28.CrossRefGoogle Scholar
  25. 25.
    Martínez, B. M., Bacardit, J., Troncoso, A., et al. (2015). Enhancing the scalability of a genetic algorithm to discover quantitative association rules in large-scale datasets. Integrated Computer Aided Engineering, 22(1), 21–39.CrossRefGoogle Scholar
  26. 26.
    Lai, J., et al. (2014). Towards semantically secure outsourcing of association rule mining on categorical data. Information Sciences, 267, 267–286.CrossRefGoogle Scholar
  27. 27.
    Zhang, C., & Zhang, S. (2002). Association rule mining: Models and algorithms. New York: Springer.CrossRefGoogle Scholar
  28. 28.
    Zhong, S. (2007). Privacy-preserving algorithms for distributed mining of frequent itemsets. Information Sciences, 177(2), 490–503.CrossRefGoogle Scholar
  29. 29.
    Gayathiri, P., & Poorna, B. (2015). Association rule hiding techniques for privacy preserving data mining: A study. International Journal of Advanced Computer Science and Applications, 6(12), 237–242.Google Scholar
  30. 30.
    Giannotti, F., et al. (2013). Privacy-preserving mining of association rules from out-sourced transaction databases. IEEE Systems Journal, 7(3), 385–395.CrossRefGoogle Scholar
  31. 31.
    Rossi, G., Ferrarini, A., Dowgiallo, G., et al. (2014). Detecting complex relations among vegetation, soil and geomorphology. An indepth method applied to a case study in the Apennines (Italy). Ecological Complexity, 17, 87–98.CrossRefGoogle Scholar
  32. 32.
    Rozenberg, B., & Gudes, E. (2006). Association rules mining in vertically partitioned databases. Data & Knowledge Engineering, 59(2), 378–396.CrossRefGoogle Scholar
  33. 33.
    Kim, M. Y. (2017). Modelling information curation platform to improve efficiencies of smart public services. Spatial Information Research, 25(6), 735–747.CrossRefGoogle Scholar
  34. 34.
    Mal-Sarkar, S., Sikder, I. U., & Konangi, V. K. (2016). Spatio-temporal pattern discovery in sensor data: A multivalued decision systems approach. Knowledge-Based Systems, 109, 137–146.CrossRefGoogle Scholar
  35. 35.
    Muthu Lakshmi, N., & Sandhya Rani, N. (2012). Privacy preserving association rule mining in vertically partitioned databases. IJCSA, 39(13), 29–35.Google Scholar
  36. 36.
    Nahar, J., Imam, T., Tickle, K. S., et al. (2013). Association rule mining to detect factors which contribute to heart disease in males and females. Expert Systems with Applications, 40(4), 1086–1093.CrossRefGoogle Scholar
  37. 37.
    Nigro, H. O. (2007). Data mining with ontologies: Implementations, findings, and frameworks: Implementations, findings, and frameworks (pp. 14–52). Hershey: IGI Global.Google Scholar
  38. 38.
    Tassa, T. (2014). Secure mining of association rules in horizontally distributed databases scalable algorithms for association mining. IEEE Transactions on Knowledge and Data Engineering, 26(4), 78–85.CrossRefGoogle Scholar
  39. 39.
    Xue, C. J., Song, W. J., Qin, L. J., et al. (2015). A spatiotemporal mining framework for abnormal association patterns in marine environments with a time series of remote sensing images. International Journal of Applied Earth Observation and Geoinformation, 38, 105–114.CrossRefGoogle Scholar
  40. 40.
    Yang, J., Sun, J., Zhao, H., et al. (2016). Spatio-temporal differentiation of residential land for coastal town: A case study of Dalian Jinshitan. Chinese Geographical Science, 26(4), 566–576.CrossRefGoogle Scholar
  41. 41.
    Yang, J., Xie, P., Xi, J. C., et al. (2013). Spatiotemporal simulation of tourist town growth based on the cellular automata model: the case of Sanpo Town in Hebei Province. Abstract and Applied Analysis, 4, 1–7.Google Scholar
  42. 42.
    You, D. M., Zhou, J. G., Wang, J. H., et al. (2011). Analysis of relations of heavy metal accumulation with land utilization using the positive and negative association rule method. Mathematical and Computer Modelling, 54(3), 1005–1009.CrossRefGoogle Scholar
  43. 43.
    Son, L. H., Chiclana, F., Kumar, R., Mittal, M., Khari, M., Chatterjee, J. M., et al. (2018). ARM–AMO: An efficient association rule mining algorithm based on animal migration optimization. Knowledge-Based Systems, 154, 68–80.CrossRefGoogle Scholar
  44. 44.
    Ngan, R. T., Son, L. H., Cuong, B. C., & Ali, M. (2018). H-max distance measure of intuitionistic fuzzy sets in decision making. Applied Soft Computing, 69, 393–425.CrossRefGoogle Scholar
  45. 45.
    Hemanth, D. J., Anitha, J., & Son, L. H. (2018). Brain signal based human emotion analysis by circular back propagation and Deep Kohonen Neural Networks. Computers & Electrical Engineering, 68, 170–180.CrossRefGoogle Scholar
  46. 46.
    Giap, C. N., Son, L. H., & Chiclana, F. (2018). Dynamic structural neural network. Journal of Intelligent and Fuzzy Systems, 1–12 (Preprint).Google Scholar
  47. 47.
    Ali, M., Son, L. H., Thanh, N. D., & Van Minh, N. (2018). A neutrosophic recommender system for medical diagnosis based on algebraic neutrosophic measures. Applied Soft Computing.  https://doi.org/10.1016/j.asoc.2017.10.012.Google Scholar
  48. 48.
    Ngan, R. T., Ali, M., & Son, L. H. (2018). δ-equality of intuitionistic fuzzy sets: A new proximity measure and applications in medical diagnosis. Applied Intelligence, 48(2), 499–525.CrossRefGoogle Scholar
  49. 49.
    Son, L. H., & Hai, P. B. (2016). A novel multiple fuzzy clustering method based on internal clustering validation measures with gradient descent. International Journal of Fuzzy Systems, 18(5), 894–903.CrossRefGoogle Scholar
  50. 50.
    Son, L. H., & Tuan, T. M. (2017). Dental segmentation from X-ray images using semi-supervised fuzzy clustering with spatial constraints. Engineering Applications of Artificial Intelligence, 59, 186–195.CrossRefGoogle Scholar
  51. 51.
    Kapoor, R., Gupta, R., Son, L. H., Jha, S., & Kumar, R. (2018). Boosting performance of power quality event identification with KL Divergence measure and standard deviation. Measurement, 126, 134–142.CrossRefGoogle Scholar
  52. 52.
    Pham, B. T., Son, L. H., Hoang, T. A., Nguyen, D. M., & Bui, D. T. (2018). Prediction of shear strength of soft soil using machine learning methods. CATENA, 166, 181–191.CrossRefGoogle Scholar
  53. 53.
    Louati, A., Son, L. H., & Chabchoub, H. (2018). Smart routing for municipal solid waste collection: A heuristic approach. Journal of Ambient Intelligence and Humanized Computing.  https://doi.org/10.1007/s12652-018-0778-3.Google Scholar
  54. 54.
    Chuan, P. M., Son, L. H., Ali, M., Khang, T. D., & Dey, N. (2018). Link prediction in co-authorship networks based on hybrid content similarity metric. Applied Intelligence.  https://doi.org/10.1007/s10489-017-1086-x.Google Scholar
  55. 55.
    Hemanth, D. J., Anitha, J., Popescu, D. E., & Son, L. H. (2018). A modified genetic algorithm for performance improvement of transform based image steganography systems. Journal of Intelligent & Fuzzy Systems.  https://doi.org/10.3233/jifs-169580.Google Scholar
  56. 56.
    Ali, M., Son, L. H., Khan, M., & Tung, N. T. (2018). Segmentation of dental X-ray images in medical imaging using neutrosophic orthogonal matrices. Expert Systems with Applications, 91, 434–441.CrossRefGoogle Scholar
  57. 57.
    Ali, M., Dat, L. Q., Son, L. H., & Smarandache, F. (2018). Interval complex neutrosophic set: formulation and applications in decision-making. International Journal of Fuzzy Systems, 20(3), 986–999.CrossRefGoogle Scholar
  58. 58.
    Ngan, R. T., Ali, M., & Son, L. H. (2018). δ-equality of intuitionistic fuzzy sets: a new proximity measure and applications in medical diagnosis. Applied Intelligence, 48(2), 499–525.CrossRefGoogle Scholar
  59. 59.
    Ali, M., Son, L. H., Deli, I., & Tien, N. D. (2017). Bipolar neutrosophic soft sets and applications in decision making. Journal of Intelligent & Fuzzy Systems, 33(6), 4077–4087.CrossRefGoogle Scholar
  60. 60.
    Son, L. H., Viet, P. V., & Hai, P. V. (2017). Picture inference system: a new fuzzy inference system on picture fuzzy set. Applied Intelligence, 46(3), 652–669.CrossRefGoogle Scholar
  61. 61.
    Son, L. H., & Tien, N. D. (2017). Tune up fuzzy C-means for big data: some novel hybrid clustering algorithms based on initial selection and incremental clustering. International Journal of Fuzzy Systems, 19(5), 1585–1602.CrossRefGoogle Scholar
  62. 62.
    Thong, P. H., & Son, L. H. (2016). A novel automatic picture fuzzy clustering method based on particle swarm optimization and picture composite cardinality. Knowledge-Based Systems, 109, 48–60.CrossRefGoogle Scholar
  63. 63.
    Thong, P. H., & Son, L. H. (2016). Picture fuzzy clustering: a new computational intelligence method. Soft Computing, 20(9), 3549–3562.CrossRefGoogle Scholar

Copyright information

© Korean Spatial Information Society 2018

Authors and Affiliations

  1. 1.Computer Science and Engineering DepartmentLNCT CollegeBhopalIndia
  2. 2.VNU Information Technology InstituteVietnam National UniversityHanoiVietnam
  3. 3.School of Computer EngineeringKalinga Institute of Industrial TechnologyBhubaneswarIndia
  4. 4.Department of Computer Science and EngineeringG. B. Pant Engineering CollegeDelhiIndia
  5. 5.Department of Computer Science and EngineeringBharti VidyaPeeth College of EngineeringDelhiIndia

Personalised recommendations