Skip to main content
  • Textbook
  • © 2018

Neural Networks and Deep Learning

A Textbook

  • This book covers the theory and algorithms of deep learning and it provides detailed discussions of the relationships of neural networks with traditional machine learning algorithms.

  • The mathematical aspects are concretely presented without losing accessibility.

  • The book is written in a textbook style, and it includes exercises, a solution manual, and instructor slides. The depth and breadth of coverage are unique to the book.

  • Includes supplementary material: sn.pub/extras

  • Request lecturer material: sn.pub/lecturer-material

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • ISBN: 978-3-319-94463-0
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 79.99
Price excludes VAT (USA)
Hardcover Book USD 69.99
Price excludes VAT (USA)

This is a preview of subscription content, access via your institution.

Table of contents (10 chapters)

  1. Front Matter

    Pages I-XXIII
  2. An Introduction to Neural Networks

    • Charu C. Aggarwal
    Pages 1-52
  3. Machine Learning with Shallow Neural Networks

    • Charu C. Aggarwal
    Pages 53-104
  4. Training Deep Neural Networks

    • Charu C. Aggarwal
    Pages 105-167
  5. Teaching Deep Learners to Generalize

    • Charu C. Aggarwal
    Pages 169-216
  6. Radial Basis Function Networks

    • Charu C. Aggarwal
    Pages 217-233
  7. Restricted Boltzmann Machines

    • Charu C. Aggarwal
    Pages 235-270
  8. Recurrent Neural Networks

    • Charu C. Aggarwal
    Pages 271-313
  9. Convolutional Neural Networks

    • Charu C. Aggarwal
    Pages 315-371
  10. Deep Reinforcement Learning

    • Charu C. Aggarwal
    Pages 373-417
  11. Advanced Topics in Deep Learning

    • Charu C. Aggarwal
    Pages 419-458
  12. Back Matter

    Pages 459-497

About this book

This book covers both classical and modern models in deep learning. The chapters of this book span three categories:

The basics of neural networks:  Many traditional machine learning models can be understood as special cases of neural networks.  An emphasis is placed in the first two chapters on understanding the relationship between traditional machine learning and neural networks. Support vector machines, linear/logistic regression, singular value decomposition, matrix factorization, and recommender systems are shown to be special cases of neural networks. These methods are studied together with recent feature engineering methods like word2vec.

Fundamentals of neural networks: A detailed discussion of training and regularization is provided in Chapters 3 and 4. Chapters 5 and 6 present radial-basis function (RBF) networks and restricted Boltzmann machines.

Advanced topics in neural networks: Chapters 7 and 8 discuss recurrent neural networks and convolutional neural networks. Several advanced topics like deep reinforcement learning, neural Turing machines, Kohonen self-organizing maps, and generative adversarial networks are introduced in Chapters 9 and 10.

The book is written for graduate students, researchers, and practitioners.   Numerous exercises are available along with a solution manual to aid in classroom teaching. Where possible, an application-centric view is highlighted in order to provide an understanding of the practical uses of each class of techniques.

Keywords

  • Deep Learning
  • Machine Learning
  • Radial Basis Function Networks
  • Restricted Boltzmann Machines
  • Recurrent Neural Networks
  • Convolutional Neural Networks
  • Neural networks
  • perceptron
  • deep reinforcement learning
  • word2vec
  • autoencoder
  • logistic regression
  • dropout
  • pretraining
  • backpropagation
  • conjugate gradient-descent
  • Adam
  • RMSProp
  • Kohonean self-organizaing map
  • generative adversarial networks

Reviews

“The book recommends itself as a stepping-stone of the research-intensive area of deep learning and a worthy continuation of the previous textbooks written by the author … . Thanks to its systematic and thorough approach complemented with the variety of resources (bibliographic and software references, exercises) neatly presented after each chapter, it is suitable for audiences of varied expertise or background.” (Irina Ioana Mohorianu, zbMATH 1402.68001, 2019)

Authors and Affiliations

  • IBM T. J. Watson Research Center, International Business Machines, Yorktown Heights, USA

    Charu C. Aggarwal

About the author

Charu C. Aggarwal is a Distinguished Research Staff Member (DRSM) at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He completed his undergraduate degree in Computer Science from the Indian Institute of Technology at Kanpur in 1993 and his Ph.D. in Operations Research from the Massachusetts Institute of Technology in 1996. He has published more than 350 papers in refereed conferences and journals, and has applied for or been granted more than 80 patents. He is author or editor of 18 books, including textbooks on data mining, machine learning (for text), recommender systems, and outlier analy-sis. Because of the commercial value of his patents, he has thrice been designated a Master Inventor at IBM. He has received several inter-nal and external awards, including the EDBT Test-of-Time Award (2014) and the IEEE ICDM Research Contributions Award (2015). Aside from serving as program or general chair of many major conferences in data mining, he is an editor-in-chief of the ACM SIGKDD Explorations and also of the ACM Transactions on Knowledge Discovery from Data. He is a fellow of the SIAM, ACM, and the IEEE, for “contributions to knowledge discovery and data mining algorithms.”

Bibliographic Information

  • Book Title: Neural Networks and Deep Learning

  • Book Subtitle: A Textbook

  • Authors: Charu C. Aggarwal

  • DOI: https://doi.org/10.1007/978-3-319-94463-0

  • Publisher: Springer Cham

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: Springer International Publishing AG, part of Springer Nature 2018

  • Hardcover ISBN: 978-3-319-94462-3

  • Softcover ISBN: 978-3-030-06856-1

  • eBook ISBN: 978-3-319-94463-0

  • Edition Number: 1

  • Number of Pages: XXIII, 497

  • Number of Illustrations: 128 b/w illustrations, 11 illustrations in colour

  • Topics: Artificial Intelligence, Computer Communication Networks, Processor Architectures

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • ISBN: 978-3-319-94463-0
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 79.99
Price excludes VAT (USA)
Hardcover Book USD 69.99
Price excludes VAT (USA)