Advertisement

© 2016

Big Data SMACK

A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

  • The first book presenting the SMACK stack

  • A practical guide teaching how to incorporate big data

  • Covers the full stack of big data architecture, discussing the practical benefits of each technology

Book

Table of contents

  1. Front Matter
    Pages i-xxv
  2. Introduction

    1. Front Matter
      Pages 1-1
    2. Raul Estrada, Isaac Ruiz
      Pages 3-7
    3. Raul Estrada, Isaac Ruiz
      Pages 9-16
  3. Playing SMACK

    1. Front Matter
      Pages 17-17
    2. Raul Estrada, Isaac Ruiz
      Pages 19-40
    3. Raul Estrada, Isaac Ruiz
      Pages 41-66
    4. Raul Estrada, Isaac Ruiz
      Pages 67-95
    5. Raul Estrada, Isaac Ruiz
      Pages 97-130
    6. Raul Estrada, Isaac Ruiz
      Pages 131-163
    7. Raul Estrada, Isaac Ruiz
      Pages 165-203
  4. Improving SMACK

    1. Front Matter
      Pages 205-205
    2. Raul Estrada, Isaac Ruiz
      Pages 207-224
    3. Raul Estrada, Isaac Ruiz
      Pages 225-250
    4. Raul Estrada, Isaac Ruiz
      Pages 251-258
  5. Back Matter
    Pages 259-264

About this book

Introduction

Integrate full-stack open-source fast data pipeline architecture and choose the correct technology—Spark, Mesos, Akka, Cassandra, and Kafka (SMACK)—in every layer. Fast data is becoming a requirement for many enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large data sets in a timely manner. In many cases organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each technology and, more importantly, how to integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by each technology. This book covers the five main concepts of data pipeline architecture and how to integrate, replace, and reinforce every layer:

  • The engine: Apache Spark
  • The container: Apache Mesos
  • The model: Akka<
  • The storage: Apache Cassandra
  • The broker: Apache Kafka

Keywords

Big Data Scala Akka Apache Spark Apache Mesos Docker Apache Cassandra Apache Kafka Hadoop No-SQL databases

Authors and affiliations

  1. 1.Mexico CityMexico
  2. 2.Mexico CityMexico

About the authors

Raúl Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. He is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. He loves functional languages like Elixir and Scala, and also has a Master of Computer Science degree.

Isaac Ruiz has been a Java programmer since 2001, and a consultant and architect since 2003. He has participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. He is a supporter of free software. Ruiz likes to experiment with new technologies (frameworks, languages, methods).

Bibliographic information