Encyclopedia of Big Data Technologies

Living Edition
| Editors: Sherif Sakr, Albert Zomaya


  • Renata Ghisloti Duarte de Souza Granha
Living reference work entry
DOI: https://doi.org/10.1007/978-3-319-63962-8_36-1



Apache Hadoop is an open-source platform for storage and efficient processing of large datasets on a cluster of computers. The framework provides fault tolerance, high availability, and scalability, being able to process petabytes of data. Its principal components are MapReduce and HDFS.



Apache Hadoop is a distributed framework used to tackle Big Data. It is a software platform in a master/worker architecture with three main components: HDFS, YARN, and MapReduce. The HDFS (Hadoop Distributed File System) is an abstraction layer responsible for the storage of data. MapReduce is the data processing framework designed specifically to scale and run distributed. YARN (Yet Another Resource Negotiator) is a management platform responsible for handling resources in the cluster. Hadoops open-source software was written in Java and distributed under Apache license 2.0.

The Hadoop framework can be...

This is a preview of subscription content, log in to check access.


  1. Cutting D (2016) https://www.youtube.com/watch?v=Phjif53vAhM . Accessed 20 Oct 2017
  2. Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107113.  https://doi.org/10.1145/1327452.1327492 CrossRefGoogle Scholar
  3. Ghemawat S, Gobioff H, Leung S-T (2003) The Google file system. SIGOPS Oper Syst Rev 37(5):2943.  https://doi.org/10.1145/1165389.945450 CrossRefGoogle Scholar
  4. Rajaraman A, Ullman JD (2011) Mining of massive datasets. Cambridge University Press, New York CrossRefGoogle Scholar
  5. White T (2015) Hadoop: the definitive guide, 4th edn. O’Reilly Media, Hadoop Google Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Renata Ghisloti Duarte de Souza Granha
    • 1
  1. 1.BoschChicagoUSA

Section editors and affiliations

  • Rodrigo N. Calheiros
    • 1
  • Marcos Dias de Assuncao
    • 2
  1. 1.School of Computing, Engineering and MathematicsWestern Sydney UniversityPenrithAustralia
  2. 2.Inria, LIP, ENS LyonLyonFrance