Skip to main content

On the Computational Complexity of MapReduce

  • Conference paper
  • First Online:
Distributed Computing (DISC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9363))

Included in the following conference series:

Abstract

In this paper we study the MapReduce Class (MRC) defined by Karloff et al., which is a formal complexity-theoretic model of MapReduce. We show that constant-round MRC computations can decide regular languages and simulate sublogarithmic space-bounded Turing machines. In addition, we prove hierarchy theorems for MRC under certain complexity-theoretic assumptions. These theorems show that sufficiently increasing the number of rounds or the amount of time per processor strictly increases the computational power of MRC. Our work lays the foundation for further analysis relating MapReduce to established complexity classes. Our results also hold for Valiant’s BSP model of parallel computation and the MPC model of Beame et al.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andoni, A., Nikolov, A., Onak, K., Yaroslavtsev, G.: Parallel algorithms for geometric graph problems. In: STOC, pp. 574–583 (2014)

    Google Scholar 

  2. Beame, P., Koutris, P., Suciu, D.: Communication steps for parallel query processing. In: PODS, pp. 273–284 (2013)

    Google Scholar 

  3. Chu, C.-T., Kim, S.K., Lin, Y.-A., Yu, Y., Bradski, G.R., Ng, A.Y., Olukotun, K.: Map-reduce for machine learning on multicore. In: NIPS, pp. 281–288 (2006)

    Google Scholar 

  4. Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)

    Article  Google Scholar 

  5. Farahat, A.K., Elgohary, A., Ghodsi, A., Kamel, M.S.: Distributed column subset selection on mapreduce. In: ICDM, pp. 171–180 (2013)

    Google Scholar 

  6. Feldman, J., Muthukrishnan, S., Sidiropoulos, A., Stein, C., Svitkina, Z.: On distributing symmetric streaming computations. ACM Transactions on Algorithms, 6(4) (2010)

    Google Scholar 

  7. Fortnow, L.: Time-space tradeoffs for satisfiability. J. Comput. Syst. Sci. 60(2), 337–353 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  8. Goodrich, M.T., Sitchinava, N., Zhang, Q.: Sorting, searching, and simulation in the mapreduce framework. In: Asano, T., Nakano, S., Okamoto, Y., Watanabe, O. (eds.) ISAAC 2011. LNCS, vol. 7074, pp. 374–383. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  9. Impagliazzo, R., Paturi, R.: The complexity of k-sat. In: 2012 IEEE 27th Conference on Computational Complexity, p. 237 (1999)

    Google Scholar 

  10. Impagliazzo, R., Paturi, R., Zane, F.: Which problems have strongly exponential complexity? J. Comput. Syst. Sci. 63(4), 512–530 (2001)

    Article  MATH  MathSciNet  Google Scholar 

  11. Kamara, S., Raykova, M.: Parallel homomorphic encryption. In: Financial Cryptography Workshops, pp. 213–225 (2013)

    Google Scholar 

  12. Karloff, H., Suri, S., Vassilvitskii, S.: A model of computation for mapreduce. In: SODA 2010, pp. 938–948. Society for Industrial and Applied Mathematics, Philadelphia (2010)

    Google Scholar 

  13. Kumar, R., Moseley, B., Vassilvitskii, S., Vattani, A.: Fast greedy algorithms in mapreduce and streaming. In: SPAA 2013, pp. 1–10. ACM, New York (2013)

    Google Scholar 

  14. Lokshtanov, D., Marx, D., Saurabh, S.: Lower bounds based on the exponential time hypothesis. Bulletin of the EATCS 105, 41–72 (2011)

    MATH  MathSciNet  Google Scholar 

  15. Pace, M.F.: BSP vs mapreduce. In: Proceedings of the International Conference on Computational Science, ICCS 2012, Omaha, Nebraska, USA, June 4–6, 2012, pp. 246–255 (2012)

    Google Scholar 

  16. Sarma, A.D., Afrati, F.N., Salihoglu, S., Ullman, J.D.: Upper and lower bounds on the cost of a map-reduce computation. In: PVLDB 2013, pp. 277–288. VLDB Endowment (2013)

    Google Scholar 

  17. Shepherdson, J.C.: The reduction of two-way automata to one-way automata. IBM J. Res. Dev. 3(2), 198–200 (1959)

    Article  MathSciNet  Google Scholar 

  18. Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: Khatib, M.G., He, X., Factor, M. (eds.) MSST, pp. 1–10. IEEE Computer Society (2010)

    Google Scholar 

  19. Szepietowski, A.: Turing machines with sublogarithmic space. Ernst Schering Research Foundation Workshops. Springer (1994)

    Google Scholar 

  20. Valiant, L.G.: A bridging model for parallel computation. Commun. ACM 33(8), 103–111 (1990)

    Article  Google Scholar 

  21. Wagner, K., Wechsung, G.: Computational Complexity. Mathematics and itsApplications. Springer (1986)

    Google Scholar 

  22. Williams, R.: Time-space tradeoffs for counting NP solutions modulo integers. Computational Complexity 17(2), 179–219 (2008)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jeremy Kun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fish, B., Kun, J., Lelkes, Á.D., Reyzin, L., Turán, G. (2015). On the Computational Complexity of MapReduce. In: Moses, Y. (eds) Distributed Computing. DISC 2015. Lecture Notes in Computer Science(), vol 9363. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48653-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-48653-5_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-48652-8

  • Online ISBN: 978-3-662-48653-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics