Skip to main content

Introduction to Probability, Statistics & R

Foundations for Data-Based Sciences

  • Textbook
  • © 2024


  • Covers the most fundamental topics in probability and statistics, both theoretical and applied
  • Includes a dedicated R package to learn, practise and use the theory
  • Request lecturer material:
  • 6560 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 16.99 USD 54.99
Discount applied Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 69.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (20 chapters)

  1. Introduction to Basic Statistics and R

  2. Introduction to Probability

  3. Introduction to Statistical Inference

  4. Advanced Distribution Theory and Probability


About this book

A strong grasp of elementary statistics and probability, along with basic skills in using R, is essential for various scientific disciplines reliant on data analysis. This book serves as a gateway to learning statistical methods from scratch, assuming a solid background in high school mathematics. Readers gradually progress from basic concepts to advanced statistical modelling, with examples from actuarial, biological, ecological, engineering, environmental, medicine, and social sciences highlighting the real-world relevance of the subject. An accompanying R package enables seamless practice and immediate application, making it ideal for beginners.
The book comprises 19 chapters divided into five parts. Part I introduces basic statistics and the R software package, teaching readers to calculate simple statistics and create basic data graphs. Part II delves into probability concepts, including rules and conditional probability, and introduces widelyused discrete and continuous probability distributions (e.g., binomial, Poisson, normal, log-normal). It concludes with the central limit theorem and joint distributions for multiple random variables. Part III explores statistical inference, covering point and interval estimation, hypothesis testing, and Bayesian inference. This part is intentionally less technical, making it accessible to readers without an extensive mathematical background. Part IV addresses advanced probability and statistical distribution theory, assuming some familiarity with (or concurrent study of) mathematical methods like advanced calculus and linear algebra. Finally, Part V focuses on advanced statistical modelling using simple and multiple regression and analysis of variance, laying the foundation for further studies in machine learning and data science applicable to various data and decision analytics contexts.
Based on years of teaching experience, this textbook includes numerousexercises and makes extensive use of R, making it ideal for year-long data science modules and courses. In addition to university courses, the book amply covers the syllabus for the Actuarial Statistics 1 examination of the Institute and Faculty of Actuaries in London. It also provides a solid foundation for postgraduate studies in statistics and probability, or a reliable reference for statistics.

Authors and Affiliations

  • School of Mathematical Sciences, University of Southampton, Southampton, UK

    Sujit K. Sahu

About the author

Sujit Sahu is a Professor of Statistics at the University of Southampton. He is the author of the book Bayesian Modeling of Spatio-Temporal Data with R published by Chapman and Hall/CRC Press. He has published more than 60 research papers on statistical methods and modelling.

Bibliographic Information

Publish with us