Skip to main content

An AI-Based Causal Strategy for Securing Statistical Databases Using Micro-aggregation

  • Conference paper
AI 2008: Advances in Artificial Intelligence (AI 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5360))

Included in the following conference series:

  • 1809 Accesses

Abstract

Although Artificial Intelligent (AI) techniques have been used in various applications, their use in maintaining security in Statistical DataBases (SDBs) has not been reported. This paper presents results, that to the best of our knowledge is pioneering, by which concepts from causal networks are used to secure SDBs. We consider the Micro-Aggregation Problem (MAP) in secure SDBs which involves partitioning a set of individual records in a micro-data file into a number of mutually exclusive and exhaustive groups. This problem, which seeks for the best partition of the micro-data file, is known to be NP-hard, and has been tackled using many heuristic solutions. In this paper, we would like to demonstrate that in the process of developing Micro-Aggregation Techniques (MATs), it is expedient to incorporate AI-based causal information about the dependence between the random variables in the micro-data file. This can be achieved by pre-processing the micro-data before invoking any MAT, in order to extract the useful dependence information from the joint probability distribution of the variables in the micro-data file, and then accomplishing the micro-aggregation on the “maximally independent” variables. Our results, on artificial life data sets, show that including such information will enhance the process of determining how many variables are to be used, and which of them should be used in the micro-aggregation process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adam, N., Wortmann, J.: Security-Control Methods for Statistical Databases: A Comparative Study. ACM Computing Surveys 21(4), 515–556 (1989)

    Article  Google Scholar 

  2. Chow, C., Liu, C.: Approximating Discrete Probability Distributions with Dependence Trees. IEEE Trans. Information Theory 14(11), 462–467 (1968)

    Article  MATH  Google Scholar 

  3. Domingo-Ferrer, J., Mateo-Sanz, J.: Practical Data-Oriented Microaggregation for Statistical Disclosure Control. IEEE Transactions on Knowledge and Data Engineering 14(1), 189–201 (2002)

    Article  Google Scholar 

  4. Domingo-Ferrer, J., Torra, V.: Aggregation Techniques for Statistical confidentiality. In: Aggregation operators: new trends and applications, pp. 260–271. Physica-Verlag, Germany (2002)

    Chapter  Google Scholar 

  5. Fayyoumi, E., Oommen, B.: A Fixed Structure Learning Automaton Micro-Aggregation Technique for Secure Statistical Databases. In: Privacy Statistical Databases, Rome, Italy, pp. 114–128 (2006)

    Google Scholar 

  6. Hansen, S., Mukherjee, S.: A Polynomial Algorithm for Univariate Optimal Microaggregation. IEEE Transactions on Knowledge and Data Engineering 15(4), 1043–1044 (2003)

    Article  Google Scholar 

  7. Laszlo, M., Mukherjee, S.: Minimum Spanning Tree Partitioning Algorithm for Microaggregation. IEEE Transactions on Knowledge and Data Engineering 17(7), 902–911 (2005)

    Article  Google Scholar 

  8. Oganian, A., Domingo-Ferrer, J.: On The Complexity of Optimal Microaggregation for Statistical Disclosure Control. Statistical Journal of the United Nations Economic Comission for Europe 18(4), 345–354 (2001)

    Google Scholar 

  9. Oommen, B., Fayyoumi, E.: A Novel Method for Micro-Aggregation in Secure Statistical Databases Using Association and Interaction. In: Qing, S., Imai, H., Wang, G. (eds.) ICICS 2007. LNCS, vol. 4861, pp. 126–140. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  10. Oommen, B.J., Fayyoumi, E.: On Utilizing Dependence-Based Information to Enhance Micro-Aggregation for Secure Statistical Databases. Unabridged Version of This Paper (to be submitted)

    Google Scholar 

  11. Sanchez, J., Urrutia, J., Ripoll, E.: Trade-Off between Disclosure Risk and Information Loss Using Multivariate Microaggregation: A Case Study on Business Data. In: Domingo-Ferrer, J., Torra, V. (eds.) PSD 2004. LNCS, vol. 3050, pp. 307–322. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  12. Solanas, A., Martínez-Ballesté, A.: V-MDAV: A Multivariate Microaggregation With Variable Group Size. In: 17th COMPSTAT Symposium of the IASC, Rome (2006)

    Google Scholar 

  13. Solanas, A., Martinez-Balleste, A., Mateo-Sanz, J., Domingo-Ferrer, J.: Multivariate Microaggregation Based Genetic Algorithms. In: 3rd International IEEE Conference on Intelligent Systems, pp. 65–70 (2006)

    Google Scholar 

  14. Torra, V., Domingo-Ferrer, J.: Towards Fuzzy C-Means Based Microaggregation. In: Grzegorzewski, P., Hryniewicz, O., Gil, M. (eds.) Advances in Soft Computing: Soft Methods in Probability, Statistics and Data Analysis, pp. 289–294. Physica-Verlag, Germany (2002)

    Google Scholar 

  15. Valiveti, R., Oommen, B.: On Using the Chi-Squared Metric for Determining Stochastic Dependence. Pattern Recognition 25(11), 1389–1400 (1992)

    Article  Google Scholar 

  16. Valiveti, R., Oommen, B.: Determining Stochastic Dependence for Normally Distributed Vectors Using the Chi-squared Metric. Pattern Recognition 26(6), 975–987 (1993)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Oommen, B.J., Fayyoumi, E. (2008). An AI-Based Causal Strategy for Securing Statistical Databases Using Micro-aggregation. In: Wobcke, W., Zhang, M. (eds) AI 2008: Advances in Artificial Intelligence. AI 2008. Lecture Notes in Computer Science(), vol 5360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89378-3_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89378-3_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89377-6

  • Online ISBN: 978-3-540-89378-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics