Abstract
Time-series data analysis with privacy preservation is an open and challenging issue. To name a few are like analyzing company’s confidential financial data, individual’s health-related data, electricity consumption of individual’s households and so on. Due to the complex nature of time-series data, analyzing such data without any revelation of sensitive information to adversaries is a pervasive task. Here, we have addressed the issue of analyzing numerical time-series of equal length with preserved privacy. Considering the Discrete Wavelet Transform as a suitable technique for transforming time-series in frequency–time representation, we have applied the concept in privacy-preserving analysis of such data. Experimental results show that our proposed method is superior to the existing methods in preserving the trade-off between data utility and privacy. The privacy models developed using the proposed method are also evaluated in terms of clustering and classification accuracies obtained from perturbed time-series data.
Similar content being viewed by others
References
Aggarwal CC, Pei J, Zhang B (2006) On privacy preservation against adversarial data mining. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining—KDD ’06. ACM Press, New York, USA, pp 510–516
Agrawal R, Srikant R (2000) Privacy-preserving data mining. ACM Sigmod Rec 29(2):439–450
Chaovalit P, Gangopadhyay A, Karabatis G, Chen Z (2011) Discrete wavelet transform-based time series analysis and mining. ACM Comput Surv 43(2):6:1–6:37
Chettri SK, Borah B (2013) An efficient microaggregation method for protecting mixed data. In: Chaki N, Meghanathan N, Nagamalai D (eds) Computer networks & communications (NetCom), Lecture notes in electrical engineering, vol 131. Springer, New York, pp 551–561
Ciaccia P, Patella M, Zezula P (1997) M-tree: an efficient access method for similarity search in metric spaces. In: Proceedings of the international conference on very large data bases, vol 23. Morgan Kaufmann Pub, pp 426–435
Domingo-Ferrer J, Torra V (2005) Ordinal, continuous and heterogeneous \(k\)-anonymity through microaggregation. Data Min Knowl Discov 11(2):195–212
Donoho DL (1995) De-noising by soft-thresholding. Inf Theory IEEE Trans Inf Theory 41(3):613–627
Frank A, Asuncion A (2010) UCI machine learning repository. http://archive.ics.uci.edu/ml. Accessed 20 Sept 2013
Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24(1):164–181
Hea-Suk K, Yang-Sae M (2010) Fourier magnitude-based privacy-preserving clustering on time-series data. IEICE Trans Inf Syst 93(6):1648–1651
Inan A, Kantarcioglu M, Bertino E (2009) Using anonymized data for classification. In: IEEE 25th international conference on data engineering (ICDE’09). IEEE, pp 429–440
Keogh E, Folias T (2002) The ucr time series data mining archive. In: Computer science & engineering department. University of California, Riverside. http://www.cs.ucr.edu/eamonn/TSDMA/index.html. Accessed 15 Sept 2013
Kim Hea-Suk, Choi M-Jung, Moon Yang-Sae (2012) Publishing sensitive time-series data under preservation of privacy and distance orders. Int J Innov Comput Inf Control 8(5(B)):3619–3638
Liao T Warren (2005) Clustering of time series data survey. Pattern Recogn 38(11):1857–1874
Moller Levet CS, Klawonn F, Cho KH, Wolkenhauer O (2003) Fuzzy clustering of short time series and unevenly distributed sampling points. In: Proceedings of the fifth international symposium on intelligent data analysis, Berlin, Germany, pp 330–340
Mukherjee Shibnath, Chen Zhiyuan, Gangopadhyay Aryya (2006) A privacy-preserving technique for euclidean distance-based mining algorithms using fourier-related transforms. VLDB J 15(4):293–315
Nin J, Torra V (2006) Extending microaggregation procedures for time-series data. In: Greco S, Hata Y, Hirano S, Inuiguchi M, Miyamoto S, Nguyen HS, Slowinski R (eds) Rough sets and current trends in computing 2006, LNCS, vol 4259. Springer, Heidelberg, pp 899–908
NSE (2015) National stock exchange of india limited. (nse, india). http://www.nseindia.com/. Accessed 21 Nov 2012
Papadimitriou S, Li F, Kollios G, Yu PS (2007) Time series compressibility and privacy. In: Proceedings of the 33rd international conference on very large data bases. VLDB Endowment, pp 459–470
Singhal Ashish, Seborg Dale E (2005) Clustering multivariate time series data. J Chem 19(8):427–438
Sweeney Latanya (2002) \(k\)-Anonymity: a model for protecting privacy. Int J Uncertain Fuzzy Knowl-based Syst 10(5):1–14
Wang X, Smith KA, Hyndman R, Alahakoon D (2004) A scalable method for time series clustering. In: Tech report, Department of econometrics and business systems. Monash University, Victoria
Zhu Y, Fu Y, Fu H (2008) On privacy in time series data mining. In: Advances in knowledge discovery and data mining. Springer, Berlin, pp 479–493
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chettri, S.K., Borah, B. On analysis of time-series data with preserved privacy. Innovations Syst Softw Eng 11, 155–165 (2015). https://doi.org/10.1007/s11334-015-0249-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11334-015-0249-3