Privacy-Preserving SVM Classification on Vertically Partitioned Data

Yu, Hwanjo; Vaidya, Jaideep; Jiang, Xiaoqian

doi:10.1007/11731139_74

Hwanjo Yu²²,
Jaideep Vaidya²³ &
Xiaoqian Jiang²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3918))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3394 Accesses
106 Citations

Abstract

Classical data mining algorithms implicitly assume complete access to all data, either in centralized or federated form. However, privacy and security concerns often prevent sharing of data, thus derailing data mining projects. Recently, there has been growing focus on finding solutions to this problem. Several algorithms have been proposed that do distributed knowledge discovery, while providing guarantees on the non-disclosure of data. Classification is an important data mining problem applicable in many diverse domains. The goal of classification is to build a model which can predict an attribute (binary attribute in this work) based on the rest of attributes. We propose an efficient and secure privacy-preserving algorithm for support vector machine (SVM) classification over vertically partitioned data.

This research was supported in part by a Faculty Research Grant from Rutgers Business School – Newark and New Brunswick.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Standard for privacy of individually identifiable health information. Federal Register 66(40) (Febraury 28 2001)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, Chichester (1998)
MATH Google Scholar
Fung, G., Mangasarian, O.L.: Proximal support vector machine classifiers. In: Proc. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, KDD 2001 (2001)
Google Scholar
Sweeney, L., Shamos, M.: A multiparty computation for randomly ordering players and making random selections. Tech. Rep. CMU-ISRI-04-126, Carnegie Mellon University (2004)
Google Scholar
Yu, H., Vaidya, J.: Secure matrix addition. Tech. Rep., UIOWA Technical Report UIOWA-CS-04-04 (2004), http://hwanjoyu.org/paper/techreport04-04.pdf
Yao, A.C.: How to generate and exchange secrets. In: Proceedings of the 27th IEEE Symposium on Foundations of Computer Science, pp. 162–167. IEEE, Los Alamitos (1986)
Google Scholar
Goldreich, O., Micali, S., Wigderson, A.: How to play any mental game - a completeness theorem for protocols with honest majority. In: ACM Symp. on the Theory of Computing (1987)
Google Scholar
Agrawal, R., Srikant, R.: Privacy-preserving data mining. In: Proceedings of the 2000 ACM SIGMOD Conference on Management of Data (2000)
Google Scholar
Kargupta, H., Datta, S., Wang, Q., Sivakumar, K.: On the privacy preserving properties of random data perturbation techniques. In: Proceedings of the Third IEEE International Conference on Data Mining, ICDM 2003 (2003)
Google Scholar
Huang, Z., Du, W., Chen, B.: Deriving private information from randomized data. In: Proc. of ACM SIGMOD Int. Conf. Management of data (2005)
Google Scholar
Lindell, Y., Pinkas, B.: Privacy preserving data mining. Journal of Cryptology 15(3), 177–206 (2002)
Article MathSciNet MATH Google Scholar
Verykios, V.S., Bertino, E., Fovino, I.N., Provenza, L.P., Saygin, Y.: State-of-the-art in privacy preserving data mining. SIGMOD Record 33(1), 50–57 (2004)
Article Google Scholar
Aggarwal, C.C., Yu, P.S.: A condensation approach to privacy preserving data mining. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 183–199. Springer, Heidelberg (2004)
Chapter Google Scholar
Oliveira, S.R.M., Zaiane, O.R.: Privacy preserving clustering by data transformation. In: SBBD 2004 (2004)
Google Scholar
Vaidya, J., Clifton, C.: Secure set intersection cardinality with application to association rule mining. Journal of Computer Security (to appear)
Google Scholar
Lin, X., Clifton, C., Zhu, M.: Privacy preserving clustering with distributed EM mixture modeling. Knowledge and Information Systems (to appear 2004)
Google Scholar
Vaidya, J., Clifton, C.: Privacy-preserving k-means clustering over vertically partitioned data. In: ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (2003)
Google Scholar
Vaidya, J., Clifton, C.: Privacy preserving naıve bayes classifier for vertically partitioned data. In: 2004 SIAM International Conference on Data Mining (2004)
Google Scholar
Karr, A.F., Lin, X., Sanil, A.P., Reiter, J.P.: Secure regressions on distributed databases. Journal of Computational and Graphical Statistics (2005)
Google Scholar
Sanil, A.P., Karr, A.F., Lin, X., Reiter, J.P.: Privacy preserving regression modeling via distributed computation. In: ACM SIGKDD Int. Conf. Knowledge discovery and data mining (2004)
Google Scholar
Yu, H., Jiang, X., Vaidya, J.: Privacy-preserving SVM using nonlinear kernels on horizontally partitioned data. In: Proc. ACM SAC Conf. Data Mining Track (2006)
Google Scholar
Poulet, F.: Multi-way distributed SVM. In: Proc. European Conf. Machine Learning, ECML 2003 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Iowa, Iowa City, IA, 08544, USA
Hwanjo Yu & Xiaoqian Jiang
Rutgers University, Newark, NJ, 07102, USA
Jaideep Vaidya

Authors

Hwanjo Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jaideep Vaidya
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqian Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Nanyang Technological University, Singapore
Wee-Keong Ng
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, 153-8505, Tokyo, Japan
Masaru Kitsuregawa
School of Computer Science and Technology, Heilongjiang University, China
Jianzhong Li
School of Computer Engineering, Nanyang Technological University, 639798, Singapore, Singapore
Kuiyu Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, H., Vaidya, J., Jiang, X. (2006). Privacy-Preserving SVM Classification on Vertically Partitioned Data. In: Ng, WK., Kitsuregawa, M., Li, J., Chang, K. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2006. Lecture Notes in Computer Science(), vol 3918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731139_74

Download citation

DOI: https://doi.org/10.1007/11731139_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33206-0
Online ISBN: 978-3-540-33207-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics