Skip to main content
Book cover

Machine Learning Models and Algorithms for Big Data Classification

Thinking with Examples for Effective Learning

  • Book
  • © 2016

Overview

  • Addresses a new and hot field of Big Data Science and Engineering
  • Offers new Machine Learning techniques and solutions
  • Provides solutions to overcome Big Data classification problems that industries, government agencies and organizations struggle to manage and analyze
  • Includes supplementary material: sn.pub/extras

Part of the book series: Integrated Series in Information Systems (ISIS, volume 36)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (14 chapters)

  1. Understanding Big Data

  2. Understanding Big Data Systems

  3. Understanding Machine Learning

  4. Understanding Scaling-Up Machine Learning

Keywords

About this book

This book presents machine learning models and algorithms to address big data classification problems. Existing machine learning techniques like the decision tree (a hierarchical approach), random forest (an ensemble hierarchical approach), and deep learning (a layered approach) are highly suitable for the system that can handle such problems. This book helps readers, especially students and newcomers to the field of big data and machine learning, to gain a quick understanding of the techniques and technologies; therefore, the theory, examples, and programs (Matlab and R) presented in this book have been simplified, hardcoded, repeated, or spaced for improvements. They provide vehicles to test and understand the complicated concepts of various topics in the field. It is expected that the readers adopt these programs to experiment with the examples, and then modify or write their own programs toward advancing their knowledge for solving more complex and challenging problems.

The presentation format of this book focuses on simplicity, readability, and dependability so that both undergraduate and graduate students as well as new researchers, developers, and practitioners in this field can easily trust and grasp the concepts, and learn them effectively. It has been written to reduce the mathematical complexity and help the vast majority of readers to understand the topics and get interested in the field. This book consists of four parts, with the total of 14 chapters. The first part mainly focuses on the topics that are needed to help analyze and understand data and big data. The second part covers the topics that can explain the systems required for processing big data. The third part presents the topics required to understand and select machine learning techniques to classify big data. Finally, the fourth part concentrates on the topics that explain the scaling-up machine learning, an important solution for modern big data problems.

Reviews

“It provides a readable, technical, description of the whole area of machine learning applied on big data that can be understood and enjoyed by students and researchers from many areas of computer science, statistics, biology and chemistry who are seeking to understand how these new technologies can benefit their special areas. … Overall, this is an excellent introduction to the main ideas for using machine learning algorithms for big data classification.” (Smaranda Belciug, zbMATH 1409.68004, 2019)

“This book is a good introduction to machine learning models for big data classification … . Typical of a Springer book, this one is concise,clear, and well organized. … each chapter contains programming examples and references … . this book is useful if you want to know more about machine learning models and algorithms for big data classification.” (J. Myerson, Computing Reviews, February, 2016)

Authors and Affiliations

  • Department of Computer Science, UNC Greensboro, Greensboro, USA

    Shan Suthaharan

About the author

Shan Suthaharan is a Professor of Computer Science at the University of North Carolina at Greensboro (UNCG), North Carolina, USA. He also serves as the Director of Undergraduate Studies at the Department of Computer Science at UNCG. He has more than twenty-five years of university teaching and administrative experience, and has taught both undergraduate and graduate courses. His aspiration is to educate and train students so that they can prosper in the computer field by understanding current real-world and complex problems, and develop efficient techniques and technologies. His current teaching interests include big data analytics and machine learning, cryptography and network security, and computer networking and analysis. He earned his doctorate in Computer Science from Monash University, Australia. Since then, he has been actively working on disseminating his knowledge and experience through teaching, advising, seminars, research, and publications. Dr. Suthaharan enjoys investigating real-world, complex problems, and developing and implementing algorithms to solve those problems using modern technologies. The main theme of his current research is the signature discovery and event detection for a secure and reliable environment. The ultimate goal of his research is to build a secure and reliable environment using modern and emerging technologies. His current research primarily focuses on the characterization and detection of environmental events, the exploration of machine learning techniques, and the development of advanced statistical and computational techniques to discover key signatures and detect emerging events from structured and unstructured big data. Dr. Suthaharan has authored or co-authored more than seventy-five research papers in the areas of computer science, and published them in international journals and referred conference proceedings. He also invented a key management and encryption technology, which has been patented in Australia, Japan, and Singapore. He also received visiting scholar awards from and served as a visiting researcher at the University of Sydney, Australia; the University of Melbourne, Australia; and the University of California, Berkeley, USA. He was a senior member of the Institute of Electrical and Electronics Engineers, and volunteered as an elected chair of the Central North Carolina Section twice. He is a member of Sigma Xi, the Scientific Research Society, and a Fellow of the Institution of Engineering and Technology.

Bibliographic Information

  • Book Title: Machine Learning Models and Algorithms for Big Data Classification

  • Book Subtitle: Thinking with Examples for Effective Learning

  • Authors: Shan Suthaharan

  • Series Title: Integrated Series in Information Systems

  • DOI: https://doi.org/10.1007/978-1-4899-7641-3

  • Publisher: Springer New York, NY

  • eBook Packages: Business and Management, Business and Management (R0)

  • Copyright Information: Springer Science+Business Media New York 2016

  • Hardcover ISBN: 978-1-4899-7640-6Published: 21 October 2015

  • Softcover ISBN: 978-1-4899-7852-3Published: 23 August 2016

  • eBook ISBN: 978-1-4899-7641-3Published: 20 October 2015

  • Series ISSN: 1571-0270

  • Series E-ISSN: 2197-7968

  • Edition Number: 1

  • Number of Pages: XIX, 359

  • Number of Illustrations: 67 b/w illustrations, 82 illustrations in colour

  • Topics: Management, Database Management, Artificial Intelligence

Publish with us