Skip to main content

Data Anonymization: Techniques and Models

  • Conference paper
  • First Online:
Marketing and Smart Technologies (ICMarkTech 2022)

Abstract

Data growth is exponential and nearly immeasurable. We used to talk about megabytes when we spoke about data, but now we talk about petabytes with BigData. This data growth makes sensitive data and identifiers increasingly exposed. To address this issue, there is anonymization data, which attempts to “mask” the data so that it is nearly difficult to identify and correlate persons with them; yet, the data remains usable for statistical reasons, among other things. To avoid falling behind in these technical difficulties, many businesses employ free, open-source software. However, this software is not always secure or meets the user’s expectations. The goal of OSSpal is to normalize these concerns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bayardo, R., & Agrawal, R. (2005). Data privacy through optimal k-anonymization. In 21st International Conference on Data Engineering (ICDE’05) (pp. 217–228)

    Google Scholar 

  2. Bingchun, L., & Guohua, L. (2011). The classification of k-anonymity data. In 2011 Seventh International Conference on Computational Intelligence and Security (pp. 1374–1378)

    Google Scholar 

  3. Carvalho, G., Mykolyshyn, S., Cabral, B., Bernardino, J., & Pereira, V. (2022). Comparative analysis of data modeling design tools. IEEE Access, 10, 3351–3365.

    Article  Google Scholar 

  4. Enam, M. A., Sakib, S., & Rahman, M. S. (2019). An algorithm for l-diversity clustering of a point-set. In 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 1–6)

    Google Scholar 

  5. Karle, T., & Vora, D. (2017). Privacy preservation in big data using anonymization techniques. In 2017 International Conference on Data Management, Analytics and Innovation (ICDMAI) (pp. 340–343)

    Google Scholar 

  6. Kohlmayer, F., Prasser, F., Eckert, C., Kemper, A., & Kuhn, K. A. (2012). Flash: Efficient, stable and optimal k-anonymity. In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing (pp. 708–717)

    Google Scholar 

  7. Li, T., Li, N., & Zhang, J. (2009). Modeling and integrating background knowledge in data anonymization. In 2009 IEEE 25th International Conference on Data Engineering (pp. 6–17)

    Google Scholar 

  8. Majeed, A., & Lee, S. (2021). Anonymization techniques for privacy preserving data publishing: A comprehensive survey. IEEE Access, 9, 8512–8545.

    Article  Google Scholar 

  9. Murthy, S., Abu Bakar, A., Abdul Rahim, F., & Ramli, R. (2019). A comparative study of data anonymization techniques. In 2019 IEEE 5th Intl Conference on Big Data Security on Cloud (BigDataSecurity), IEEE Intl Conference on High Performance and Smart Computing, (HPSC) and IEEE Intl Conference on Intelligent Data and Security (IDS) (pp. 306–309)

    Google Scholar 

  10. F. Prasser, J. Eicher, R. Bild, H. Spengler, and K. A. Kuhn. A tool for optimizing de-identified health data for use in statistical classification. In 2017 IEEE 30th International Symposium on Computer-Based Medical Systems (CBMS), pages 169– 174, 2017.

    Google Scholar 

  11. Raghunathan, B. (2013). The complete book of data anonymization: From planning to implementation. CRC Press

    Google Scholar 

  12. Susan, V. S., & Christopher, T. (2022). Anatomisation with slicing: a new privacy preservation approach for multiple sensitive attributes. Springerplus, 5(1), 964

    Google Scholar 

  13. Zhang, X., Qi, L., Dou, W., He, Q., Leckie, C., Kotagiri, R., & Salcic, Z. (2022). Mrmondrian: Scalable multidimensional anonymisation for big data privacy preservation. IEEE Transactions on Big Data, 8(1), 125–139.

    Article  Google Scholar 

Download references

Acknowledgements

“This work is funded by National Funds through the FCT—Foundation for Science and Technology, IP, within the scope of the project Ref UIDB/05583/2020. Furthermore, we would like to thank the Research Centre in Digital Services (CISeD), the Polytechnic of Viseu, for their support.”

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pedro Martins .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Monteiro, S. et al. (2024). Data Anonymization: Techniques and Models. In: Reis, J.L., Del Rio Araujo, M., Reis, L.P., dos Santos, J.P.M. (eds) Marketing and Smart Technologies. ICMarkTech 2022. Smart Innovation, Systems and Technologies, vol 344. Springer, Singapore. https://doi.org/10.1007/978-981-99-0333-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-0333-7_6

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-0332-0

  • Online ISBN: 978-981-99-0333-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics