Advertisement

Big Data, Hadoop, and HDInsight

  • Vinit Yadav
Chapter

Abstract

Azure HDInsight is a managed Hadoop distribution, developed in partnership with Hortonworks and Microsoft. It uses the Hortonworks Data Platform (HDP) Hadoop distribution, which means that HDInsight is entirely Apache Hadoop on Azure. It deploys and provisions managed Apache Hadoop clusters in the cloud on Windows or Linux machines, which is a unique capability. It provides the Hadoop Distributed File System (HDFS) for reliable data storage. It uses the MapReduce programming model to process, analyze, and report on data stored in distributed file systems. Because it is a managed offering, within a few hours an enterprise can be up and running with a fully configured Hadoop cluster and other Hadoop ecosystem components, such HBase, Apache Spark, and Apache Storm.

Keywords

Data Block Client Application Data Node HDFSHadoop Distribute File System MapReduce Framework 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

©  Vinit Yadav 2017

Authors and Affiliations

  • Vinit Yadav
    • 1
  1. 1.AhmedabadIndia

Personalised recommendations