Abstract
In this paper, we propose a framework for measuring the impact of data privacy techniques, in information theoretic and in data mining terms. The need for data privacy and anonymization is often hampered by the fact that the privacy functions alter the data in non-measurable amounts and details. We propose here to use Mutual Information over non-Euclidean spaces as a means of measuring this distortion. In addition, and following the same principle, we also propose to use Machine Learning techniques in order to quantify the impact of the data obfuscation in terms of further data mining goals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Nissenbaum, H.: A contextual approach to privacy online. Daedalus, 140(4):32–48 (Fall 2011)
Gürses, S., Troncoso, C.G., Diaz, C.: Engineering privacy by design. In: Computers, Privacy and Data Protection (2011)
Oliver, I.: Privacy Engineering: A Data Flow and Ontological Approach. CreateSpace (2014)
Ciriani, V., Capitani di Vimercati, S., Foresti, S., Samarati, P.: \(\kappa \)-anonymity. In: Ting, Y., Jajodia, S. (eds.) Secure Data Management in Decentralized Systems. Advances in Information Security, vol. 33, pp. 323–353. Springer, New York (2007)
Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: \(\ell \)-diversity: Privacy beyond \(\kappa \)-anonymity. In: 2013 IEEE 29th International Conference on Data Engineering (ICDE), 0:24 (2006)
Dwork, C.: Differential privacy: a survey of results. In: Theory and Applications of Models of Computation. Lecture Notes in Computer Science, vol. 4978, pp. 1–19. Springer, Berlin (2008)
The UK Cabinet Office. Security policy framework (April 2013)
Huang, G., Chen, L., Siew, C.-K., Huang, G.-B., Chen, L., Siew, C.-K.: Universal approximation using incremental constructive feedforward neural networks with random hidden nodes. IEEE Trans. Neural Networks 17(4), 879–892 (2006)
Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Cybenko, G.: Approximations by superpositions of sigmoidal functions. Math. Control Signals Syst. 2(4), 303–314 (1989)
Miche, Y., Sorjamaa, A., Bas, P., Simula, O., Jutten, C., Lendasse, A.: OP-ELM: optimally-pruned extreme learning machine. IEEE Trans. Neural Networks 21(1), 158–162 (2010)
Miche, Y., van Heeswijk, M., Bas, P., Simula, O., Lendasse, A.: TROP-ELM: a double-regularized ELM using LARS and Tikhonov regularization. Neurocomputing 74(16), 2413–2421 (2011)
Van Heeswijk, M., Miche, Y., Oja, E., Lendasse, A.: GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74(16), 2430–2437 (2011)
Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., Leung, V.C.M., Feng, L., Ong, Y.-S., Lim, M.-H., Akusok, A., Lendasse, A., Corona, F., Nian, R., Miche, Y., Gastaldo, P., Zunino, R., Decherchi, S., Yang, X., Mao, K., Oh, B.-S., Jeon, J., Toh, K.-A., Teoh, A.B.J., Kim, J., Yu, H., Chen, Y., Liu, J.: Extreme learning machines (trends and controversies). IEEE Intell. Syst. 28(6), 30–59 (2013)
Radhakrishna Rao, C., Mitra, S.K.: Generalized Inverse of Matrices and Its Applications. Wiley, New York (1972)
Kraskov, A., Stögbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. E 69, 066138 (2004)
Pál, D., Póczos, B., Szepesvári, C.: Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs. ArXiv e-prints (2010)
Pál, D., Póczos, B., Szepesvári, C.: Estimation of rényi entropy and mutual information based on generalized nearest-neighbor graphs. In: Lafferty, J.D., Williams, C.K.I., Shawe-Taylor, J., Zemel, R.S., Culotta, A. (eds.) Advances in Neural Information Processing Systems 23, pp. 1849–1857. Curran Associates, Inc. (2010)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Bogachev, V.I., Kolesnikov, A.V.: The Monge-Kantorovich problem: achievements, connections, and perspectives. Russ. Math. Surv. 67, 785–890 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Miche, Y., Oliver, I., Holtmanns, S., Akusok, A., Lendasse, A., Björk, KM. (2016). On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels. In: Cao, J., Mao, K., Wu, J., Lendasse, A. (eds) Proceedings of ELM-2015 Volume 2. Proceedings in Adaptation, Learning and Optimization, vol 7. Springer, Cham. https://doi.org/10.1007/978-3-319-28373-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-28373-9_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28372-2
Online ISBN: 978-3-319-28373-9
eBook Packages: EngineeringEngineering (R0)