Working with Data in HDInsight

  • Vinit Yadav


Azure Blob storage is the default and preferred way to store data in HDInsight. HDInsight supports the Hadoop distributed file system (HDFS) as well as Azure Blob storage for storing data. This chapter covers uploading data to Blob storage and executing MapReduce jobs on it. It starts with different command-line utilities to upload data and looks at a couple of graphical clients. You’ll create your first MapReduce job and execute it using PowerShell. Also, you’ll look at .NET SDK to create and execute job on HDInsight. And finally, you’ll learn about Avro serialization.


Hadoop Distribute File System Storage Account Cluster Credential Azure Storage Console Application 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

©  Vinit Yadav 2017

Authors and Affiliations

  • Vinit Yadav
    • 1
  1. 1.AhmedabadIndia

Personalised recommendations