Standard Toolkits for Hadoop and Analytics

  • Kerry Koitzsch


In this chapter, we take a look at the necessary ingredients for a BDA system: the standard libraries and toolkits most useful for building BDAs. We describe an example system (which we develop throughout the remainder of the book) using standard toolkits from the Hadoop and Spark ecosystems. We also use other analytical toolkits, such as R and Weka, with mainstream development components such as Ant, Maven, npm, pip, Bower, and other system building tools. "Glueware components" such as Apache Camel, Spring Framework, Spring Data, Apache Kafka, Apache Tika, and others can be used to create a Hadoop-based system appropriate for a variety of applications.


Hadoop MapReduce Spring Data Large Scale Data Analysis Online Analytical Processing Functional Resource 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Kerry Koitzsch 2017

Authors and Affiliations

  • Kerry Koitzsch
    • 1
  1. 1.SunnyvaleUSA

Personalised recommendations