Querying Data with Hive

  • Vinit Yadav


Hive is probably the most used tool in the Hadoop ecosystem. To work with Hadoop data, you need to write MapReduce jobs that are not convenient for ad hoc queries. Hive comes to the rescue by providing a SQL-like query language, which internally transforms the query to MapReduce jobs. In HDInsight, Hive sits on top of Azure Blob storage data and provides interactive queries to work with data. Hive can work with structured and semi-structured data. Hive resides on top of a YARN layer and makes use of all the resource negotiations that YARN does. Internally, it uses MapReduce, Tez, or the Spark execution engine.


Query Performance Visual Studio Execution Engine User Define Function Movie Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

©  Vinit Yadav 2017

Authors and Affiliations

  • Vinit Yadav
    • 1
  1. 1.AhmedabadIndia

Personalised recommendations