Shows how the expressiveness of OCaml allows for fast and safe development of data science applications
Exemplifies use cases drawn from many areas of Data Science, Machine Learning, and AI
Explains step by step how to deploy at scale, using parallel, distributed, and accelerated cloud computing frameworks
Part of the book series: Undergraduate Topics in Computer Science (UTICS)
This is a preview of subscription content, access via your institution.
Table of contents (16 chapters)
Advanced Data Analysis Techniques
About this book
This book is about the harmonious synthesis of functional programming and numerical computation. It shows how the expressiveness of OCaml allows for fast and safe development of data science applications. Step by step, the authors build up to use cases drawn from many areas of Data Science, Machine Learning, and AI, and then delve into how to deploy at scale, using parallel, distributed, and accelerated frameworks to gain all the advantages of cloud computing environments.
To this end, the book is divided into three parts, each focusing on a different area. Part I begins by introducing how basic numerical techniques are performed in OCaml, including classical mathematical topics (interpolation and quadrature), statistics, and linear algebra. It moves on from using only scalar values to multi-dimensional arrays, introducing the tensor and Ndarray, core data types in any numerical computing system. It concludes with two more classical numerical computing topics, the solution of Ordinary Differential Equations (ODEs) and Signal Processing, as well as introducing the visualization module we use throughout this book. Part II is dedicated to advanced optimization techniques that are core to most current popular data science fields. We do not focus only on applications but also on the basic building blocks, starting with Algorithmic Differentiation, the most crucial building block that in turn enables Deep Neural Networks. We follow this with chapters on Optimization and Regression, also used in building Deep Neural Networks. We then introduce Deep Neural Networks as well as topic modelling in Natural Language Processing (NLP), two advanced and currently very active fields in both industry and academia. Part III collects a range of case studies demonstrating how you can build a complete numerical application quickly from scratch using Owl. The cases presented include computer vision and recommender systems.
This book aims at anyone with a basic knowledge of functional programming and a desire to explore the world of scientific computing, whether to generally explore the field in the round, to build applications for particular topics, or to deep-dive into how numerical systems are constructed. It does not assume strict ordering in reading – readers can simply jump to the topic that interests them most.
- Scientific Computing
- Functional Programming
- Machine Learning
- Numerical Analysis
- Programming Languages
Authors and Affiliations
Computer Science and Technology, University of Cambridge, Cambridge, UK
Liang Wang, Jianxin Zhao, Richard Mortier
About the authors
Liang Wang is a Principal AI Architect at Nokia, a Senior Researcher at the University of Cambridge, an Intel Software Innovator, and the Chief Scientific Officer at IKVA. He has a broad research interest in artificial intelligence, machine learning, operating systems, computer networks, optimization theory, and graph theory.
Jianxin Zhao is a PhD graduate from the University of Cambridge. His research interests include numerical computation, artificial intelligence, decentralized systems, and their application in the real world.
Richard Mortier is a Professor in Computing and Human-Data Interaction at the University of Cambridge. He works at the intersection of systems and networking with human-computer interaction, and he currently focuses on building user-centric systems infrastructure.
Book Title: OCaml Scientific Computing
Book Subtitle: Functional Programming in Data Science and Artificial Intelligence
Authors: Liang Wang, Jianxin Zhao, Richard Mortier
Series Title: Undergraduate Topics in Computer Science
Publisher: Springer Cham
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2022
Softcover ISBN: 978-3-030-97644-6Published: 27 May 2022
eBook ISBN: 978-3-030-97645-3Published: 26 May 2022
Series ISSN: 1863-7310
Series E-ISSN: 2197-1781
Edition Number: 1
Number of Pages: XXII, 359
Number of Illustrations: 32 b/w illustrations, 73 illustrations in colour
Topics: Programming Language, Mathematics of Computing, Special Purpose and Application-Based Systems, Data Science