Abstract
This paper complements the privacy preserving distributed OLAP framework proposed by us in a previous work by introducing four major theoretical properties that extend models and algorithms presented in the previous work, where the experimental validation of the framework has also been reported. Particularly, our framework makes use of the CUR matrix decomposition technique as the elementary component for computing privacy preserving two-dimensional OLAP views effectively and efficiently. Here, we investigate theoretical properties of the CUR decomposition method, and identify four theoretical extensions of this method, which, according to our vision, may result in benefits for a wide spectrum of aspects in the context of privacy preserving distributed OLAP, such as privacy preserving knowledge fruition schemes and query optimization. In addition to this, we also provide a widespread experimental analysis of the framework, which fully confirms to us the major practical achievements, in terms of both efficacy and efficiency, due to our framework.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, S., Haritsa, J.R., Prakash, B.A.: FRAPP: A Framework for High-Accuracy Privacy-Preserving Mining. Data Mining and Knowledge Discovery 18(1), 101–139 (2009)
Agrawal, R., Srikant, R., Thomas, D.: Privacy-Preserving OLAP. Proc. of SIGMOD, 251–262 (2005)
Barak, B., Chaudhuri, K., Dwork, C., Kale, S., McSherry, F., Talwar, K.: Privacy, Accuracy, and Consistency Too: A Holistic Solution to Contingency Table Release. In: Proc. of PODS, pp. 273–282 (2007)
Chan, A.C.-F., Castelluccia, C.: A Security Framework for Privacy-Preserving Data Aggregation in Wireless Sensor Networks. ACM Transactions on Sensor Networks 7(4), art. 29 (2011)
Clifton, C., Kantarcioglu, M., Lin, X., Vaidya, J., Zhu, M.: Tools for Privacy Preserving Distributed Data Mining. SIGKDD Explorations 4(2), 28–34 (2002)
Colliat, G.: OLAP, Relational, and Multidimensional Database Systems. SIGMOD Record 25(3), 64–69 (1996)
Cuzzocrea, A.: Accuracy Control in Compressed Multidimensional Data Cubes for Quality of Answer-based OLAP Tools. In: Proc. of SSDBM, pp. 301–310 (2006)
Cuzzocrea, A.: Privacy Preserving OLAP: Models, Issues, Algorithms. In: Proc. of MIPRO, pp. 1538–1543 (2011)
Cuzzocrea, A., Bertino, E.: A Secure Multiparty Computation Privacy Preserving OLAP Framework over Distributed XML Data. In: Proc. of SAC, pp. 1666–1673 (2010)
Cuzzocrea, A., Russo, V.: Privacy Preserving OLAP and OLAP Security. In: Wang, J. (ed.) Encyclopedia of Data Warehousing and Mining, 2nd edn., pp. 1575–1581. IGI Global (2009)
Cuzzocrea, A., Russo, V., Saccà, D.: A robust sampling-based framework for privacy preserving OLAP. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2008. LNCS, vol. 5182, pp. 97–114. Springer, Heidelberg (2008)
Cuzzocrea, A., Saccà, D.: Balancing Accuracy and Privacy of OLAP Aggregations on Data Cubes. In: Proc. of DOLAP, pp. 93–98 (2010)
Drineas, P., Kannan, R., Mahoney, M.W.: Computing Sketches of Matrices Efficiently and Privacy Preserving Data Mining. In: Proc. of DIMACS PPDM (2004), http://dimacs.rutgers.edu/Workshops/Privacy/
Drineas, P., Kannan, R., Mahoney, M.W.: Fast Monte Carlo algorithms for Matrices III: Computing a Compressed Approximate Matrix Decomposition. SIAM Journal on Computing 36(1), 184–206 (2006)
Dwork, C.: Differential privacy: A survey of results. In: Agrawal, M., Du, D.-Z., Duan, Z., Li, A. (eds.) TAMC 2008. LNCS, vol. 4978, pp. 1–19. Springer, Heidelberg (2008)
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., Pirahesh, H.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. Data Mining and Knowledge Discovery 1(1), 29–53 (1997)
Golub, G.H., Van Loan, C.F.: Matrix Computations. Johns Hopkins University Press (1989)
Han, J., Pei, J., Dong, G., Wang, K.: Efficient Computation of Iceberg Cubes with Complex Measures. Proc. of SIGMOD, 1–12 (2001)
He, W., Liu, X., Nguyen, H., Nahrstedt, K., Abdelzaher, T.: PDA: Privacy-Preserving Data Aggregation for Information Collection. ACM Transactions on Sensor Networks 8(1), art. 6 (2011)
Hua, M., Zhang, S., Wang, W., Zhou, H., Shi, B.-L.: FMC: An approach for privacy preserving OLAP. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 408–417. Springer, Heidelberg (2005)
Jiang, W., Clifton, C.: A Secure Distributed Framework for Achieving k-Anonymity. Very Large Data Bases Journal 15(4), 316–333 (2006)
Jurczyk, P., Xiong, L.: Distributed anonymization: Achieving privacy for both data subjects and data providers. In: Gudes, E., Vaidya, J. (eds.) Data and Applications Security XXIII. LNCS, vol. 5645, pp. 191–207. Springer, Heidelberg (2009)
Li, F., Luo, B., Liu, P.: Secure and Privacy-Preserving Information Aggregation for Smart Grids. International Journal of Security and Networks 6(1), 28–39 (2011)
Lin, X., Lu, R., Shen, X.: MDPA: Multidimensional Privacy-Preserving Aggregation Scheme for Wireless Sensor Networks. Wireless Communications and Mobile Computing 10(6), 843–856 (2010)
Liu, Y., Sung, S.Y., Xiong, H.: A Cubic-Wise Balance Approach for Privacy Preservation in Data Cubes. Information Sciences 176(9), 1215–1240 (2006)
Mohammed, N., Fung, B.C.M., Hung, P.C.K., Lee, C.-K.: Centralized and Distributed Anonymization for High-Dimensional Healthcare Data. ACM Transactions on Knowledge Discovery from Data 4(4), art. 18 (2010)
Papoulis, A.: Probability, Random Variables, and Stochastic Processes. McGraw-Hill (1984)
Pinkas, B.: Cryptographic Techniques for Privacy-Preserving Data Mining. SIGKDD Explorations 4(2), 12–19 (2002)
Sung, S.Y., Liu, Y., Xiong, H., Ng, P.A.: Privacy Preservation for Data Cubes. Knowledge and Information Systems 9(1), 38–61 (2006)
Thompson, S.K., Seber, G.A.F.: Adaptive Sampling. John Wiley & Sons (1996)
Tong, Y., Sun, G., Zhang, P., Tang, S.: Privacy-Preserving OLAP based on Output Perturbation Across Multiple Sites. In: Proc. of PST, p. 46 (2006)
Wang, L., Jajodia, S., Wijesekera, D.: Securing OLAP Data Cubes against Privacy Breaches. In: Proc. of SP, pp. 161–175 (2004)
Wang, L., Wijesekera, D., Jajodia, S.: Cardinality-based Inference Control in Data Cubes. Journal of Computer Security 12(5), 655–692 (2004)
Zhang, N., Zhao, W., Chen, J.: Cardinality-based Inference Control in OLAP Systems: An Information Theoretic Approach. In: Proc. of DOLAP, pp. 59–64 (2004)
Zipf, G.K.: Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology. Addison-Wesley (1949)
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: Proceedings of the 28th International Conference on Very Large Data Bases, pp. 974–985 (2002)
Transaction Processing Performance Council (2004), http://www.tpc.org/tpch/default.asp
Yao, B.B., Özsu, M.T., Khandelwal, N.: XBench Benchmark and Performance Testing of XML DBMSs. In: Proceedings of the 20th IEEE International Conference on Data Engineering, pp. 621–632 (2004)
University of Pennsylvania, The Penn Treebank Project (2002), http://www.cis.upenn.edu/~treebank/
Swiss Institute of Bioinformatics, Swiss-Prot Protein Knowledgebase (2005), http://www.expasy.ch/sprot/
GSFC/NASA XML Project, NASA (2003), http://xml.gsfc.nasa.gov
Cuzzocrea, A., Bertino, E., Saccà, D.: Towards A Theory for Privacy Preserving Distributed OLAP. In: Proceedings of the EDBT/ICDT Workshops, pp. 221–226 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuzzocrea, A., Bertino, E. (2014). A Comprehensive Theoretical Framework for Privacy Preserving Distributed OLAP. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Workshops. OTM 2014. Lecture Notes in Computer Science, vol 8842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45550-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-662-45550-0_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45549-4
Online ISBN: 978-3-662-45550-0
eBook Packages: Computer ScienceComputer Science (R0)