Abstract
In today’s information age there is a large availability of repositories storing various types of information about individuals. Data mining technology has emerged as a means of identifying patterns and trends from large quantities of data. The application of data mining technology to identify interesting patterns from these repositories leads to serious privacy concerns. Despite its benefit in a wide range of applications, data mining techniques also have raised a number of ethical issues. Some such issues are privacy, data security, intellectual property rights and many others. In this paper, we address the privacy problem against unauthorized secondary use of information. We focus primarily on privacy preserving data clustering on categorical data. In the proposed method, the categorical data is converted into binary data and it is transformed using geometric data transformation method. Then, clustering using conventional clustering algorithm is done on the transformed data to ensure privacy.
Keywords
- Data Transformation
- Noise Term
- Multiplicative Noise
- Categorical Attribute
- Privacy Preservation
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Elisa Bertino, Ravi Sandhu,” Database Security - Concepts, Approaches, and Challenges”, IEEE transactions on dependable and secure computing, vol. 2, no. 1, January-March 2005
J.Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco, CA,2001.
Stanley R. M. Oliveira, osmar R. Zaiyane ,”Privacy Preserving Clustering by Data Transformation”. In proceedings of 18th Brazilian Conference on Databases,2003
V.S. Verykios, E. Bertino, I.N. Fovino, L.P. Provenza, Y. Saygin, and Y. Theodoridis, “State-of-the-Art in Privacy Preserving Data Mining,” ACM SIGMOD Record, vol. 3, no. 1, Mar. 2004.
K. Chen and L. Liu. Privacy preserving data classification with rotation perturbation. In Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM’05), Houston, TX, November 2005
R. Agrawal and R. Srikant, “Privacy Preserving Data Mining,” Proc. ACM SIGMOD Conf. Management of Data, May 2000
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this paper
Cite this paper
Natarajan, D.A., Rajalaxmi, R., Uma, N., Kirubhakar, G. (2007). A Hybrid Data Transformation Approach for Privacy Preserving Clustering of Categorical Data. In: Sobh, T. (eds) Innovations and Advanced Techniques in Computer and Information Sciences and Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6268-1_72
Download citation
DOI: https://doi.org/10.1007/978-1-4020-6268-1_72
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-6267-4
Online ISBN: 978-1-4020-6268-1
eBook Packages: EngineeringEngineering (R0)
