Advertisement

Big Data Ingestion and Streaming Patterns

  • Nitin Sawant
  • Himanshu Shah
Chapter

Abstract

Traditional business intelligence (BI) and data warehouse (DW) solutions use structured data extensively. Database platforms such as Oracle, Informatica, and others had limited capabilities to handle and manage unstructured data such as text, media, video, and so forth, although they had a data type called CLOB and BLOB; which were used to store large amounts of text, and accessing data from these platforms was a problem. With the advent of multistructured (a.k.a. unstructured) data in the form of social media and audio/video, there has to be a change in the way data is ingested, preprocessed, validated, and/or cleansed and integrated or co-related with nontextual formats. This chapter deals with the following topics:

Keywords

Data Warehouse Unstructured Data Hadoop Distribute File System Remote Method Invocation Data Mart 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Nitin Sawant 2013

Authors and Affiliations

  • Nitin Sawant
    • 1
  • Himanshu Shah
    • 1
  1. 1.MaharastraIndia

Personalised recommendations