Abstract
Although Artificial Intelligent (AI) techniques have been used in various applications, their use in maintaining security in Statistical DataBases (SDBs) has not been reported. This paper presents results, that to the best of our knowledge is pioneering, by which concepts from causal networks are used to secure SDBs. We consider the Micro-Aggregation Problem (MAP) in secure SDBs which involves partitioning a set of individual records in a micro-data file into a number of mutually exclusive and exhaustive groups. This problem, which seeks for the best partition of the micro-data file, is known to be NP-hard, and has been tackled using many heuristic solutions. In this paper, we would like to demonstrate that in the process of developing Micro-Aggregation Techniques (MATs), it is expedient to incorporate AI-based causal information about the dependence between the random variables in the micro-data file. This can be achieved by pre-processing the micro-data before invoking any MAT, in order to extract the useful dependence information from the joint probability distribution of the variables in the micro-data file, and then accomplishing the micro-aggregation on the “maximally independent” variables. Our results, on artificial life data sets, show that including such information will enhance the process of determining how many variables are to be used, and which of them should be used in the micro-aggregation process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adam, N., Wortmann, J.: Security-Control Methods for Statistical Databases: A Comparative Study. ACM Computing Surveys 21(4), 515–556 (1989)
Chow, C., Liu, C.: Approximating Discrete Probability Distributions with Dependence Trees. IEEE Trans. Information Theory 14(11), 462–467 (1968)
Domingo-Ferrer, J., Mateo-Sanz, J.: Practical Data-Oriented Microaggregation for Statistical Disclosure Control. IEEE Transactions on Knowledge and Data Engineering 14(1), 189–201 (2002)
Domingo-Ferrer, J., Torra, V.: Aggregation Techniques for Statistical confidentiality. In: Aggregation operators: new trends and applications, pp. 260–271. Physica-Verlag, Germany (2002)
Fayyoumi, E., Oommen, B.: A Fixed Structure Learning Automaton Micro-Aggregation Technique for Secure Statistical Databases. In: Privacy Statistical Databases, Rome, Italy, pp. 114–128 (2006)
Hansen, S., Mukherjee, S.: A Polynomial Algorithm for Univariate Optimal Microaggregation. IEEE Transactions on Knowledge and Data Engineering 15(4), 1043–1044 (2003)
Laszlo, M., Mukherjee, S.: Minimum Spanning Tree Partitioning Algorithm for Microaggregation. IEEE Transactions on Knowledge and Data Engineering 17(7), 902–911 (2005)
Oganian, A., Domingo-Ferrer, J.: On The Complexity of Optimal Microaggregation for Statistical Disclosure Control. Statistical Journal of the United Nations Economic Comission for Europe 18(4), 345–354 (2001)
Oommen, B., Fayyoumi, E.: A Novel Method for Micro-Aggregation in Secure Statistical Databases Using Association and Interaction. In: Qing, S., Imai, H., Wang, G. (eds.) ICICS 2007. LNCS, vol. 4861, pp. 126–140. Springer, Heidelberg (2007)
Oommen, B.J., Fayyoumi, E.: On Utilizing Dependence-Based Information to Enhance Micro-Aggregation for Secure Statistical Databases. Unabridged Version of This Paper (to be submitted)
Sanchez, J., Urrutia, J., Ripoll, E.: Trade-Off between Disclosure Risk and Information Loss Using Multivariate Microaggregation: A Case Study on Business Data. In: Domingo-Ferrer, J., Torra, V. (eds.) PSD 2004. LNCS, vol. 3050, pp. 307–322. Springer, Heidelberg (2004)
Solanas, A., Martínez-Ballesté, A.: V-MDAV: A Multivariate Microaggregation With Variable Group Size. In: 17th COMPSTAT Symposium of the IASC, Rome (2006)
Solanas, A., Martinez-Balleste, A., Mateo-Sanz, J., Domingo-Ferrer, J.: Multivariate Microaggregation Based Genetic Algorithms. In: 3rd International IEEE Conference on Intelligent Systems, pp. 65–70 (2006)
Torra, V., Domingo-Ferrer, J.: Towards Fuzzy C-Means Based Microaggregation. In: Grzegorzewski, P., Hryniewicz, O., Gil, M. (eds.) Advances in Soft Computing: Soft Methods in Probability, Statistics and Data Analysis, pp. 289–294. Physica-Verlag, Germany (2002)
Valiveti, R., Oommen, B.: On Using the Chi-Squared Metric for Determining Stochastic Dependence. Pattern Recognition 25(11), 1389–1400 (1992)
Valiveti, R., Oommen, B.: Determining Stochastic Dependence for Normally Distributed Vectors Using the Chi-squared Metric. Pattern Recognition 26(6), 975–987 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oommen, B.J., Fayyoumi, E. (2008). An AI-Based Causal Strategy for Securing Statistical Databases Using Micro-aggregation. In: Wobcke, W., Zhang, M. (eds) AI 2008: Advances in Artificial Intelligence. AI 2008. Lecture Notes in Computer Science(), vol 5360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89378-3_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-89378-3_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89377-6
Online ISBN: 978-3-540-89378-3
eBook Packages: Computer ScienceComputer Science (R0)