Advertisement

Loading Data into Hive

  • Scott Shaw
  • Andreas François Vermeulen
  • Ankur Gupta
  • David Kjerrumgaard
Chapter

Abstract

Let’s say you have built a data lake in your organization and one of the lines of business has requested for a new use case to be implemented, for example, a 360 view of the customer. When you consider the details of the use case, you find that analytics needs to occur on all the customer data residing in the existing operational systems, data warehouse, and on all new data getting generated from social media, customer service, and call centers, to get a complete picture of the customer. Hadoop, being a general-purpose, large-scale distributed processing platform, is quite suitable for this.

Copyright information

© Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard 2016

Authors and Affiliations

  • Scott Shaw
    • 1
  • Andreas François Vermeulen
    • 2
  • Ankur Gupta
    • 3
  • David Kjerrumgaard
    • 4
  1. 1.Saint LouisUSA
  2. 2.West Kilbride North AyrshireUK
  3. 3.UxbridgeUK
  4. 4.HendersonUSA

Personalised recommendations