Practical Data Science

A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets

  • Andreas François Vermeulen

Table of contents

  1. Front Matter
    Pages i-xxv
  2. Andreas François Vermeulen
    Pages 1-13
  3. Andreas François Vermeulen
    Pages 15-38
  4. Andreas François Vermeulen
    Pages 39-51
  5. Andreas François Vermeulen
    Pages 53-83
  6. Andreas François Vermeulen
    Pages 85-117
  7. Andreas François Vermeulen
    Pages 119-145
  8. Andreas François Vermeulen
    Pages 147-273
  9. Andreas François Vermeulen
    Pages 275-420
  10. Andreas François Vermeulen
    Pages 421-526
  11. Andreas François Vermeulen
    Pages 527-684
  12. Andreas François Vermeulen
    Pages 685-786
  13. Back Matter
    Pages 787-805

About this book


Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets.

The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions.

What You'll Learn:
  • Become fluent in the essential concepts and terminology of data science and data engineering 
  • Build and use a technology stack that meets industry criteria
  • Master the methods for retrieving actionable business knowledge
  • Coordinate the handling of polyglot data types in a data lake for repeatable results


data science polyglot data science data engineering data lake data vault and data mart data warehouse bus matrix data scrubbing techniques data science technology stack actionable business knowledge Spark, Mesos, Akka, Cassandra, Kafka, Elasticsearch, R machine-to-machine machine learning IoT and embedded systems fog computing MQTT graph database super steps of the functional layer grids and clusters torus network

Authors and affiliations

  • Andreas François Vermeulen
    • 1
  1. 1.West Kilbride North AyrshireUnited Kingdom

Bibliographic information

  • DOI
  • Copyright Information Andreas François Vermeulen 2018
  • Publisher Name Apress, Berkeley, CA
  • eBook Packages Professional and Applied Computing
  • Print ISBN 978-1-4842-3053-4
  • Online ISBN 978-1-4842-3054-1
  • About this book