Skip to main content

A Hybrid Data Transformation Approach for Privacy Preserving Clustering of Categorical Data

  • Conference paper

Abstract

In today’s information age there is a large availability of repositories storing various types of information about individuals. Data mining technology has emerged as a means of identifying patterns and trends from large quantities of data. The application of data mining technology to identify interesting patterns from these repositories leads to serious privacy concerns. Despite its benefit in a wide range of applications, data mining techniques also have raised a number of ethical issues. Some such issues are privacy, data security, intellectual property rights and many others. In this paper, we address the privacy problem against unauthorized secondary use of information. We focus primarily on privacy preserving data clustering on categorical data. In the proposed method, the categorical data is converted into binary data and it is transformed using geometric data transformation method. Then, clustering using conventional clustering algorithm is done on the transformed data to ensure privacy.

Keywords

  • Data Transformation
  • Noise Term
  • Multiplicative Noise
  • Categorical Attribute
  • Privacy Preservation

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (Canada)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Elisa Bertino, Ravi Sandhu,” Database Security - Concepts, Approaches, and Challenges”, IEEE transactions on dependable and secure computing, vol. 2, no. 1, January-March 2005

    CrossRef  Google Scholar 

  2. J.Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco, CA,2001.

    Google Scholar 

  3. Stanley R. M. Oliveira, osmar R. Zaiyane ,”Privacy Preserving Clustering by Data Transformation”. In proceedings of 18th Brazilian Conference on Databases,2003

    Google Scholar 

  4. V.S. Verykios, E. Bertino, I.N. Fovino, L.P. Provenza, Y. Saygin, and Y. Theodoridis, “State-of-the-Art in Privacy Preserving Data Mining,” ACM SIGMOD Record, vol. 3, no. 1, Mar. 2004.

    Google Scholar 

  5. K. Chen and L. Liu. Privacy preserving data classification with rotation perturbation. In Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM’05), Houston, TX, November 2005

    Google Scholar 

  6. R. Agrawal and R. Srikant, “Privacy Preserving Data Mining,” Proc. ACM SIGMOD Conf. Management of Data, May 2000

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2007 Springer

About this paper

Cite this paper

Natarajan, D.A., Rajalaxmi, R., Uma, N., Kirubhakar, G. (2007). A Hybrid Data Transformation Approach for Privacy Preserving Clustering of Categorical Data. In: Sobh, T. (eds) Innovations and Advanced Techniques in Computer and Information Sciences and Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6268-1_72

Download citation

  • DOI: https://doi.org/10.1007/978-1-4020-6268-1_72

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-6267-4

  • Online ISBN: 978-1-4020-6268-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics