Overview
- Contains extensive coverage of machine-learning algorithms with real-time code implementation using Spark MLib
- Explains the SparkR real-time module with code implementation
- Covers Spark Streaming and Spark Integration examples with other big data components such as Kafka
Access this book
Tax calculation will be finalised at checkout
Other ways to access
Table of contents (10 chapters)
Keywords
About this book
On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
What You Will Learn
- Discover the functional programming features of Scala
- Understand the completearchitecture of Spark and its components
- Integrate Apache Spark with Hive and Kafka
- Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
- Work with different machine learning concepts and libraries using Spark's MLlib packages
Who This Book Is For
Developers and professionals who deal with batch and stream data processing.
Authors and Affiliations
About the authors
Subhashini Chellappan is a technology enthusiast with expertise in the big data and cloud space. She has rich experience in both academia and the software industry. Her areas of interest and expertise are centered on business intelligence, big data analytics and cloud computing.
Dharanitharan Ganesan is a senior analyst with five years of experience in IT. He has a high level of exposure and experience in big data – Apache Hadoop, Apache Spark and various Hadoop ecosystem components. He has a proven track record of improving efficiency and productivity through the automation of various routine and administrative functions in business intelligence and big data technologies. His areas of interest and expertise are centered on machine learning algorithms, statistical modelling and predictive analysis.
Bibliographic Information
Book Title: Practical Apache Spark
Book Subtitle: Using the Scala API
Authors: Subhashini Chellappan, Dharanitharan Ganesan
DOI: https://doi.org/10.1007/978-1-4842-3652-9
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Copyright Information: Subhashini Chellappan, Dharanitharan Ganesan 2018
Softcover ISBN: 978-1-4842-3651-2Published: 13 December 2018
eBook ISBN: 978-1-4842-3652-9Published: 12 December 2018
Edition Number: 1
Number of Pages: XVI, 280
Number of Illustrations: 303 b/w illustrations
Topics: Big Data, Open Source, Programming Languages, Compilers, Interpreters