On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels

Miche, Yoan; Oliver, Ian; Holtmanns, Silke; Akusok, Anton; Lendasse, Amaury; Björk, Kaj-Mikael

doi:10.1007/978-3-319-28373-9_32

Yoan Miche⁷,
Ian Oliver⁷,
Silke Holtmanns⁷,
Anton Akusok⁸,
Amaury Lendasse⁸ &
…
Kaj-Mikael Björk⁹

Part of the book series: Proceedings in Adaptation, Learning and Optimization ((PALO,volume 7))

1141 Accesses

Abstract

In this paper, we propose a framework for measuring the impact of data privacy techniques, in information theoretic and in data mining terms. The need for data privacy and anonymization is often hampered by the fact that the privacy functions alter the data in non-measurable amounts and details. We propose here to use Mutual Information over non-Euclidean spaces as a means of measuring this distortion. In addition, and following the same principle, we also propose to use Machine Learning techniques in order to quantify the impact of the data obfuscation in terms of further data mining goals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nissenbaum, H.: A contextual approach to privacy online. Daedalus, 140(4):32–48 (Fall 2011)
Google Scholar
Gürses, S., Troncoso, C.G., Diaz, C.: Engineering privacy by design. In: Computers, Privacy and Data Protection (2011)
Google Scholar
Oliver, I.: Privacy Engineering: A Data Flow and Ontological Approach. CreateSpace (2014)
Google Scholar
Ciriani, V., Capitani di Vimercati, S., Foresti, S., Samarati, P.: \(\kappa \)-anonymity. In: Ting, Y., Jajodia, S. (eds.) Secure Data Management in Decentralized Systems. Advances in Information Security, vol. 33, pp. 323–353. Springer, New York (2007)
Google Scholar
Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: \(\ell \)-diversity: Privacy beyond \(\kappa \)-anonymity. In: 2013 IEEE 29th International Conference on Data Engineering (ICDE), 0:24 (2006)
Google Scholar
Dwork, C.: Differential privacy: a survey of results. In: Theory and Applications of Models of Computation. Lecture Notes in Computer Science, vol. 4978, pp. 1–19. Springer, Berlin (2008)
Google Scholar
The UK Cabinet Office. Security policy framework (April 2013)
Google Scholar
Huang, G., Chen, L., Siew, C.-K., Huang, G.-B., Chen, L., Siew, C.-K.: Universal approximation using incremental constructive feedforward neural networks with random hidden nodes. IEEE Trans. Neural Networks 17(4), 879–892 (2006)
Google Scholar
Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Google Scholar
Cybenko, G.: Approximations by superpositions of sigmoidal functions. Math. Control Signals Syst. 2(4), 303–314 (1989)
Article MathSciNet MATH Google Scholar
Miche, Y., Sorjamaa, A., Bas, P., Simula, O., Jutten, C., Lendasse, A.: OP-ELM: optimally-pruned extreme learning machine. IEEE Trans. Neural Networks 21(1), 158–162 (2010)
Google Scholar
Miche, Y., van Heeswijk, M., Bas, P., Simula, O., Lendasse, A.: TROP-ELM: a double-regularized ELM using LARS and Tikhonov regularization. Neurocomputing 74(16), 2413–2421 (2011)
Google Scholar
Van Heeswijk, M., Miche, Y., Oja, E., Lendasse, A.: GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74(16), 2430–2437 (2011)
Google Scholar
Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., Leung, V.C.M., Feng, L., Ong, Y.-S., Lim, M.-H., Akusok, A., Lendasse, A., Corona, F., Nian, R., Miche, Y., Gastaldo, P., Zunino, R., Decherchi, S., Yang, X., Mao, K., Oh, B.-S., Jeon, J., Toh, K.-A., Teoh, A.B.J., Kim, J., Yu, H., Chen, Y., Liu, J.: Extreme learning machines (trends and controversies). IEEE Intell. Syst. 28(6), 30–59 (2013)
Google Scholar
Radhakrishna Rao, C., Mitra, S.K.: Generalized Inverse of Matrices and Its Applications. Wiley, New York (1972)
Google Scholar
Kraskov, A., Stögbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. E 69, 066138 (2004)
Google Scholar
Pál, D., Póczos, B., Szepesvári, C.: Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs. ArXiv e-prints (2010)
Google Scholar
Pál, D., Póczos, B., Szepesvári, C.: Estimation of rényi entropy and mutual information based on generalized nearest-neighbor graphs. In: Lafferty, J.D., Williams, C.K.I., Shawe-Taylor, J., Zemel, R.S., Culotta, A. (eds.) Advances in Neural Information Processing Systems 23, pp. 1849–1857. Curran Associates, Inc. (2010)
Google Scholar
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Article MathSciNet MATH Google Scholar
Bogachev, V.I., Kolesnikov, A.V.: The Monge-Kantorovich problem: achievements, connections, and perspectives. Russ. Math. Surv. 67, 785–890 (2012)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Nokia Solutions and Networks, Espoo, Finland
Yoan Miche, Ian Oliver & Silke Holtmanns
Department of Mechanical and Industrial Engineering and The Iowa Informatics Initiative, The University of Iowa, Iowa City, USA
Anton Akusok & Amaury Lendasse
Arcada University of Applied Sciences, Helsinki, Finland
Kaj-Mikael Björk

Authors

Yoan Miche
View author publications
You can also search for this author in PubMed Google Scholar
Ian Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Silke Holtmanns
View author publications
You can also search for this author in PubMed Google Scholar
Anton Akusok
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Lendasse
View author publications
You can also search for this author in PubMed Google Scholar
Kaj-Mikael Björk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yoan Miche .

Editor information

Editors and Affiliations

Institute of Information and Contro, Hangzhou Dianzi University, Zhejiang, China
Jiuwen Cao
Nanyang Technological University, Singapore, Singapore
Kezhi Mao
ECE, U of Windsor, WINDSOR, Ontario, Canada
Jonathan Wu
Dept of Mechanical and Industrial Engg, University of Iowa, Iowa City, Iowa, USA
Amaury Lendasse

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miche, Y., Oliver, I., Holtmanns, S., Akusok, A., Lendasse, A., Björk, KM. (2016). On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels. In: Cao, J., Mao, K., Wu, J., Lendasse, A. (eds) Proceedings of ELM-2015 Volume 2. Proceedings in Adaptation, Learning and Optimization, vol 7. Springer, Cham. https://doi.org/10.1007/978-3-319-28373-9_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-28373-9_32
Published: 03 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28372-2
Online ISBN: 978-3-319-28373-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics