Skip to main content
Log in

On analysis of time-series data with preserved privacy

  • S.I. : ICACNI 2014
  • Published:
Innovations in Systems and Software Engineering Aims and scope Submit manuscript

Abstract

Time-series data analysis with privacy preservation is an open and challenging issue. To name a few are like analyzing company’s confidential financial data, individual’s health-related data, electricity consumption of individual’s households and so on. Due to the complex nature of time-series data, analyzing such data without any revelation of sensitive information to adversaries is a pervasive task. Here, we have addressed the issue of analyzing numerical time-series of equal length with preserved privacy. Considering the Discrete Wavelet Transform as a suitable technique for transforming time-series in frequency–time representation, we have applied the concept in privacy-preserving analysis of such data. Experimental results show that our proposed method is superior to the existing methods in preserving the trade-off between data utility and privacy. The privacy models developed using the proposed method are also evaluated in terms of clustering and classification accuracies obtained from perturbed time-series data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Aggarwal CC, Pei J, Zhang B (2006) On privacy preservation against adversarial data mining. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining—KDD ’06. ACM Press, New York, USA, pp 510–516

  2. Agrawal R, Srikant R (2000) Privacy-preserving data mining. ACM Sigmod Rec 29(2):439–450

    Article  Google Scholar 

  3. Chaovalit P, Gangopadhyay A, Karabatis G, Chen Z (2011) Discrete wavelet transform-based time series analysis and mining. ACM Comput Surv 43(2):6:1–6:37

    Article  Google Scholar 

  4. Chettri SK, Borah B (2013) An efficient microaggregation method for protecting mixed data. In: Chaki N, Meghanathan N, Nagamalai D (eds) Computer networks & communications (NetCom), Lecture notes in electrical engineering, vol 131. Springer, New York, pp 551–561

  5. Ciaccia P, Patella M, Zezula P (1997) M-tree: an efficient access method for similarity search in metric spaces. In: Proceedings of the international conference on very large data bases, vol 23. Morgan Kaufmann Pub, pp 426–435

  6. Domingo-Ferrer J, Torra V (2005) Ordinal, continuous and heterogeneous \(k\)-anonymity through microaggregation. Data Min Knowl Discov 11(2):195–212

    Article  MathSciNet  MATH  Google Scholar 

  7. Donoho DL (1995) De-noising by soft-thresholding. Inf Theory IEEE Trans Inf Theory 41(3):613–627

    Article  MathSciNet  MATH  Google Scholar 

  8. Frank A, Asuncion A (2010) UCI machine learning repository. http://archive.ics.uci.edu/ml. Accessed 20 Sept 2013

  9. Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24(1):164–181

    Article  Google Scholar 

  10. Hea-Suk K, Yang-Sae M (2010) Fourier magnitude-based privacy-preserving clustering on time-series data. IEICE Trans Inf Syst 93(6):1648–1651

    Google Scholar 

  11. Inan A, Kantarcioglu M, Bertino E (2009) Using anonymized data for classification. In: IEEE 25th international conference on data engineering (ICDE’09). IEEE, pp 429–440

  12. Keogh E, Folias T (2002) The ucr time series data mining archive. In: Computer science & engineering department. University of California, Riverside. http://www.cs.ucr.edu/eamonn/TSDMA/index.html. Accessed 15 Sept 2013

  13. Kim Hea-Suk, Choi M-Jung, Moon Yang-Sae (2012) Publishing sensitive time-series data under preservation of privacy and distance orders. Int J Innov Comput Inf Control 8(5(B)):3619–3638

    Google Scholar 

  14. Liao T Warren (2005) Clustering of time series data survey. Pattern Recogn 38(11):1857–1874

    Article  MATH  Google Scholar 

  15. Moller Levet CS, Klawonn F, Cho KH, Wolkenhauer O (2003) Fuzzy clustering of short time series and unevenly distributed sampling points. In: Proceedings of the fifth international symposium on intelligent data analysis, Berlin, Germany, pp 330–340

  16. Mukherjee Shibnath, Chen Zhiyuan, Gangopadhyay Aryya (2006) A privacy-preserving technique for euclidean distance-based mining algorithms using fourier-related transforms. VLDB J 15(4):293–315

    Article  MATH  Google Scholar 

  17. Nin J, Torra V (2006) Extending microaggregation procedures for time-series data. In: Greco S, Hata Y, Hirano S, Inuiguchi M, Miyamoto S, Nguyen HS, Slowinski R (eds) Rough sets and current trends in computing 2006, LNCS, vol 4259. Springer, Heidelberg, pp 899–908

  18. NSE (2015) National stock exchange of india limited. (nse, india). http://www.nseindia.com/. Accessed 21 Nov 2012

  19. Papadimitriou S, Li F, Kollios G, Yu PS (2007) Time series compressibility and privacy. In: Proceedings of the 33rd international conference on very large data bases. VLDB Endowment, pp 459–470

  20. Singhal Ashish, Seborg Dale E (2005) Clustering multivariate time series data. J Chem 19(8):427–438

    Article  Google Scholar 

  21. Sweeney Latanya (2002) \(k\)-Anonymity: a model for protecting privacy. Int J Uncertain Fuzzy Knowl-based Syst 10(5):1–14

    Google Scholar 

  22. Wang X, Smith KA, Hyndman R, Alahakoon D (2004) A scalable method for time series clustering. In: Tech report, Department of econometrics and business systems. Monash University, Victoria

  23. Zhu Y, Fu Y, Fu H (2008) On privacy in time series data mining. In: Advances in knowledge discovery and data mining. Springer, Berlin, pp 479–493

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sarat Kumar Chettri.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chettri, S.K., Borah, B. On analysis of time-series data with preserved privacy. Innovations Syst Softw Eng 11, 155–165 (2015). https://doi.org/10.1007/s11334-015-0249-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11334-015-0249-3

Keywords

Navigation