Secure Joins with MapReduce

  • Xavier Bultel
  • Radu Ciucanu
  • Matthieu GiraudEmail author
  • Pascal Lafourcade
  • Lihua Ye
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11358)


MapReduce is one of the most popular programming paradigms that allows a user to process Big data sets. Our goal is to add privacy guarantees to the two standard algorithms of join computation for MapReduce: the cascade algorithm and the hypercube algorithm. We assume that the data is externalized in an honest-but-curious server and a user is allowed to query the join result. We design, implement, and prove the security of two approaches: (i) Secure-Private, assuming that the public cloud and the user do not collude, (ii) Collision-Resistant-Secure-Private, which resists to collusions between the public cloud and the user i.e., when the public cloud knows the secret key of the user.


Database query MapReduce Security Natural joins 



This research was conducted with the support of the FEDER program of 2014–2020, the region council of Auvergne-Rhône-Alpes, the support of the “Digital Trust” Chair from the University of Auvergne Foundation, the Indo-French Centre for the Promotion of Advanced Research (IFCPAR) and the Center Franco-Indien Pour La Promotion De La Recherche Avancée (CEFIPRA) through the project DST/CNRS 2015-03 under DST-INRIA-CNRS Targeted Programme.


  1. 1.
  2. 2.
  3. 3.
    Secure Joins with MapReduce - Technical report.
  4. 4.
    Afrati, F.N., Ullman, J.D.: Optimizing joins in a MapReduce environment. In: EDBT (2010)Google Scholar
  5. 5.
    Bellare, M., Rogaway, P.: Optimal asymmetric encryption. In: De Santis, A. (ed.) EUROCRYPT 1994. LNCS, vol. 950, pp. 92–111. Springer, Heidelberg (1995). Scholar
  6. 6.
    Blass, E.-O., Di Pietro, R., Molva, R., Önen, M.: PRISM – privacy-preserving search in MapReduce. In: Fischer-Hübner, S., Wright, M. (eds.) PETS 2012. LNCS, vol. 7384, pp. 180–200. Springer, Heidelberg (2012). Scholar
  7. 7.
    Bultel, X., Ciucanu, R., Giraud, M., Lafourcade, P.: Secure matrix multiplication with MapReduce. In: ARES (2017)Google Scholar
  8. 8.
    Chow, S.S.M., Lee, J., Subramanian, L.: Two-party computation model for privacy-preserving queries over distributed databases. In: NDSS (2009)Google Scholar
  9. 9.
    Chu, S., Balazinska, M., Suciu, D.: From theory to practice: efficient join query evaluation in a parallel database system. In: SIGMOD Conference (2015)Google Scholar
  10. 10.
    Daemen, J., Rijmen, V.: The Design of Rijndael: AES - The Advanced Encryption Standard. ISC. Springer, Heidelberg (2002). Scholar
  11. 11.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: OSDI (2004)Google Scholar
  12. 12.
    Derbeko, P., Dolev, S., Gudes, E., Sharma, S.: Security and privacy aspects in MapReduce on clouds: a survey. Comput. Sci. Rev. 20, 1–28 (2016)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Dolev, S., Gilboa, N., Li, X.: Accumulating automata and cascaded equations automata for communicationless information theoretically secure multi-party computation: extended abstract. In: ASIACCS (2015)Google Scholar
  14. 14.
    Dolev, S., Li, Y., Sharma, S.: Private and secure secret shared MapReduce. In: DBSec (2016)Google Scholar
  15. 15.
    ElGamal, T.: A public key cryptosystem and a signature scheme based on discrete logarithms. In: Blakley, G.R., Chaum, D. (eds.) CRYPTO 1984. LNCS, vol. 196, pp. 10–18. Springer, Heidelberg (1985). Scholar
  16. 16.
    Emekçi, F., Agrawal, D., El Abbadi, A., Gulbeden, A.: Privacy preserving query processing using third parties. In: ICDE (2006)Google Scholar
  17. 17.
    Laur, S., Talviste, R., Willemson, J.: From oblivious AES to efficient and secure database join in the multiparty setting. In: ACNS (2013)Google Scholar
  18. 18.
    Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, 2nd edn. Cambridge University Press, Cambridge (2014)CrossRefGoogle Scholar
  19. 19.
    Lindell, Y. (ed.): Tutorials on the Foundations of Cryptography. Springer, Cham (2017). Scholar
  20. 20.
    Mayberry, T., Blass, E.-O., Chan, A.H.: PIRMAP: efficient private information retrieval for MapReduce. In: Sadeghi, A.-R. (ed.) FC 2013. LNCS, vol. 7859, pp. 371–385. Springer, Heidelberg (2013). Scholar
  21. 21.
    Popa, R.A., Redfield, C.M.S., Zeldovich, N., Balakrishnan, H.: CryptDB: protecting confidentiality with encrypted query processing. In: SOSP (2011)Google Scholar
  22. 22.
    Popa, R.A., Zeldovich, N.: Cryptographic treatment of CryptDB’s adjustable join (2012)Google Scholar
  23. 23.
    Shamir, A.: How to share a secret. Commun. ACM 22(11), 612–613 (1979)MathSciNetCrossRefGoogle Scholar
  24. 24.
    Vo-Huu, T.D., Blass, E.-O., Noubir, G.: EPiC: efficient privacy-preserving counting for MapReduce. In: Bouajjani, A., Fauconnier, H. (eds.) NETYS 2015. LNCS, vol. 9466, pp. 426–443. Springer, Cham (2015). Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Xavier Bultel
    • 1
  • Radu Ciucanu
    • 2
  • Matthieu Giraud
    • 3
    Email author
  • Pascal Lafourcade
    • 3
  • Lihua Ye
    • 4
  1. 1.IRISA, Université de Rennes 1RennesFrance
  2. 2.INSA Centre Val de Loire, Univ. Orléans, LIFO EA 4022BourgesFrance
  3. 3.LIMOS, Université Clermont AuvergneClermont-FerrandFrance
  4. 4.Harbin Institute of TechnologyHarbinChina

Personalised recommendations