Overview

Authors:

Shan Suthaharan ⁰

Shan Suthaharan
1. Department of Computer Science, UNC Greensboro, Greensboro, USA
View author publications

You can also search for this author in PubMed Google Scholar

Addresses a new and hot field of Big Data Science and Engineering
Offers new Machine Learning techniques and solutions
Provides solutions to overcome Big Data classification problems that industries, government agencies and organizations struggle to manage and analyze
Includes supplementary material: sn.pub/extras

Part of the book series: Integrated Series in Information Systems (ISIS, volume 36)

202k Accesses
436 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00

Price excludes VAT (USA)

Softcover Book USD 169.99

Price excludes VAT (USA)

Hardcover Book USD 169.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (14 chapters)

Front Matter

Pages i-xix

Download chapter PDF
Science of Information
- Shan Suthaharan
Pages 1-13
Understanding Big Data
1. Front Matter
  
  Pages 15-15
  
  Download chapter PDF
2. Big Data Essentials
  
  Shan Suthaharan
  
  Pages 17-29
3. Big Data Analytics
  
  Shan Suthaharan
  
  Pages 31-75
Understanding Big Data Systems
1. Front Matter
  
  Pages 77-77
  
  Download chapter PDF
2. Distributed File System
  
  Shan Suthaharan
  
  Pages 79-97
3. MapReduce Programming Platform
  
  Shan Suthaharan
  
  Pages 99-119
Understanding Machine Learning
1. Front Matter
  
  Pages 121-121
  
  Download chapter PDF
2. Modeling and Algorithms
  
  Shan Suthaharan
  
  Pages 123-143
3. Supervised Learning Models
  
  Shan Suthaharan
  
  Pages 145-181
4. Supervised Learning Algorithms
  
  Shan Suthaharan
  
  Pages 183-206
5. Support Vector Machine
  
  Shan Suthaharan
  
  Pages 207-235
6. Decision Tree Learning
  
  Shan Suthaharan
  
  Pages 237-269
Understanding Scaling-Up Machine Learning
1. Front Matter
  
  Pages 271-271
  
  Download chapter PDF
2. Random Forest Learning
  
  Shan Suthaharan
  
  Pages 273-288
3. Deep Learning Models
  
  Shan Suthaharan
  
  Pages 289-307
4. Chandelier Decision Tree
  
  Shan Suthaharan
  
  Pages 309-328
5. Dimensionality Reduction
  
  Shan Suthaharan
  
  Pages 329-355
Back Matter

Pages 357-359

Download chapter PDF

Keywords

About this book

This book presents machine learning models and algorithms to address big data classification problems. Existing machine learning techniques like the decision tree (a hierarchical approach), random forest (an ensemble hierarchical approach), and deep learning (a layered approach) are highly suitable for the system that can handle such problems. This book helps readers, especially students and newcomers to the field of big data and machine learning, to gain a quick understanding of the techniques and technologies; therefore, the theory, examples, and programs (Matlab and R) presented in this book have been simplified, hardcoded, repeated, or spaced for improvements. They provide vehicles to test and understand the complicated concepts of various topics in the field. It is expected that the readers adopt these programs to experiment with the examples, and then modify or write their own programs toward advancing their knowledge for solving more complex and challenging problems.

The presentation format of this book focuses on simplicity, readability, and dependability so that both undergraduate and graduate students as well as new researchers, developers, and practitioners in this field can easily trust and grasp the concepts, and learn them effectively. It has been written to reduce the mathematical complexity and help the vast majority of readers to understand the topics and get interested in the field. This book consists of four parts, with the total of 14 chapters. The first part mainly focuses on the topics that are needed to help analyze and understand data and big data. The second part covers the topics that can explain the systems required for processing big data. The third part presents the topics required to understand and select machine learning techniques to classify big data. Finally, the fourth part concentrates on the topics that explain the scaling-up machine learning, an important solution for modern big data problems.

Reviews

“It provides a readable, technical, description of the whole area of machine learning applied on big data that can be understood and enjoyed by students and researchers from many areas of computer science, statistics, biology and chemistry who are seeking to understand how these new technologies can benefit their special areas. … Overall, this is an excellent introduction to the main ideas for using machine learning algorithms for big data classification.” (Smaranda Belciug, zbMATH 1409.68004, 2019)

“This book is a good introduction to machine learning models for big data classification … . Typical of a Springer book, this one is concise,clear, and well organized. … each chapter contains programming examples and references … . this book is useful if you want to know more about machine learning models and algorithms for big data classification.” (J. Myerson, Computing Reviews, February, 2016)

Authors and Affiliations

Department of Computer Science, UNC Greensboro, Greensboro, USA

Shan Suthaharan

About the author

Shan Suthaharan is a Professor of Computer Science at the University of North Carolina at Greensboro (UNCG), North Carolina, USA. He also serves as the Director of Undergraduate Studies at the Department of Computer Science at UNCG. He has more than twenty-five years of university teaching and administrative experience, and has taught both undergraduate and graduate courses. His aspiration is to educate and train students so that they can prosper in the computer field by understanding current real-world and complex problems, and develop efficient techniques and technologies. His current teaching interests include big data analytics and machine learning, cryptography and network security, and computer networking and analysis. He earned his doctorate in Computer Science from Monash University, Australia. Since then, he has been actively working on disseminating his knowledge and experience through teaching, advising, seminars, research, and publications. Dr. Suthaharan enjoys investigating real-world, complex problems, and developing and implementing algorithms to solve those problems using modern technologies. The main theme of his current research is the signature discovery and event detection for a secure and reliable environment. The ultimate goal of his research is to build a secure and reliable environment using modern and emerging technologies. His current research primarily focuses on the characterization and detection of environmental events, the exploration of machine learning techniques, and the development of advanced statistical and computational techniques to discover key signatures and detect emerging events from structured and unstructured big data. Dr. Suthaharan has authored or co-authored more than seventy-five research papers in the areas of computer science, and published them in international journals and referred conference proceedings. He also invented a key management and encryption technology, which has been patented in Australia, Japan, and Singapore. He also received visiting scholar awards from and served as a visiting researcher at the University of Sydney, Australia; the University of Melbourne, Australia; and the University of California, Berkeley, USA. He was a senior member of the Institute of Electrical and Electronics Engineers, and volunteered as an elected chair of the Central North Carolina Section twice. He is a member of Sigma Xi, the Scientific Research Society, and a Fellow of the Institution of Engineering and Technology.

Bibliographic Information

Book Title: Machine Learning Models and Algorithms for Big Data Classification
Book Subtitle: Thinking with Examples for Effective Learning
Authors: Shan Suthaharan
Series Title: Integrated Series in Information Systems
DOI: https://doi.org/10.1007/978-1-4899-7641-3
Publisher: Springer New York, NY
eBook Packages: Business and Management, Business and Management (R0)
Copyright Information: Springer Science+Business Media New York 2016
Hardcover ISBN: 978-1-4899-7640-6Published: 21 October 2015
Softcover ISBN: 978-1-4899-7852-3Published: 23 August 2016
eBook ISBN: 978-1-4899-7641-3Published: 20 October 2015
Series ISSN: 1571-0270
Series E-ISSN: 2197-7968
Edition Number: 1
Number of Pages: XIX, 359
Number of Illustrations: 67 b/w illustrations, 82 illustrations in colour
Topics: Management, Database Management, Artificial Intelligence

Publish with us

Policies and ethics

Machine Learning Models and Algorithms for Big Data Classification

Overview

Access this book

Other ways to access

Table of contents (14 chapters)

Front Matter

Understanding Big Data

Front Matter

Understanding Big Data Systems

Front Matter

Understanding Machine Learning

Front Matter

Understanding Scaling-Up Machine Learning

Front Matter

Back Matter

Keywords

About this book

Reviews

Authors and Affiliations

Department of Computer Science, UNC Greensboro, Greensboro, USA

About the author

Bibliographic Information

Publish with us

Search

Navigation