# Introductory Statistics with R

Textbook

Part of the Statistics and Computing book series (SCO)

1. Front Matter
Pages i-xvi
2. Peter Dalgaard
Pages 1-29
3. Peter Dalgaard
Pages 31-53
4. Peter Dalgaard
Pages 55-65
5. Peter Dalgaard
Pages 67-94
6. Peter Dalgaard
Pages 95-107
7. Peter Dalgaard
Pages 109-125
8. Peter Dalgaard
Pages 127-143
9. Peter Dalgaard
Pages 145-154
10. Peter Dalgaard
Pages 155-162
11. Peter Dalgaard
Pages 163-184
12. Peter Dalgaard
Pages 185-194
13. Peter Dalgaard
Pages 195-225
14. Peter Dalgaard
Pages 227-248
15. Peter Dalgaard
Pages 249-258
16. Peter Dalgaard
Pages 259-274
17. Peter Dalgaard
Pages 275-288
18. Back Matter
Pages 289-364

### Introduction

R is an Open Source implementation of the S language. It works on multiple computing platforms and can be freely downloaded. R is now in widespread use for teaching at many levels as well as for practical data analysis and methodological development.

This book provides an elementary-level introduction to R, targeting both non-statistician scientists in various fields and students of statistics. The main mode of presentation is via code examples with liberal commenting of the code and the output, from the computational as well as the statistical viewpoint. A supplementary R package can be downloaded and contains the data sets.

The statistical methodology includes statistical standard distributions, one- and two-sample tests with continuous data, regression analysis, one- and two-way analysis of variance, regression analysis, analysis of tabular data, and sample size calculations. In addition, the last six chapters contain introductions to multiple linear regression analysis, linear models in general, logistic regression, survival analysis, Poisson regression, and nonlinear regression.

In the second edition, the text and code have been updated to R version 2.6.2. The last two methodological chapters are new, as is a chapter on advanced data handling. The introductory chapter has been extended and reorganized as two chapters. Exercises have been revised and answers are now provided in an Appendix.

Peter Dalgaard is associate professor at the Department of Biostatistics at the University of Copenhagen and has extensive experience in teaching within the PhD curriculum at the Faculty of Health Sciences. He has been a member of the R Core Team since 1997.

### Keywords

ANOVA Analysis Analysis of variance Curve fitting Descriptive statistics Fitting Open Source Regression analysis Survival analysis data analysis linear regression statistics

#### Authors and affiliations

1. 1.Dept. BiostatisticsUniversity of CopenhagenKoebenhavn KDenmark

Peter Dalgaard is associate professor at the Biostatistical Department at the University of Copenhagen and has extensive experience in teaching within the PhD curriculum at the Faculty of Health Sciences. He was chairman of the Danish Society for Theoretical Statistics from 1996 to 2000. Peter Dalgaard has been a key member of the R Core Team since August 1997 and is well known among R users.

### Bibliographic information

• Book Title Introductory Statistics with R
• Authors Peter Dalgaard
• Series Title Statistics and Computing
• DOI https://doi.org/10.1007/978-0-387-79054-1
• Publisher Name Springer, New York, NY
• eBook Packages Mathematics and Statistics Mathematics and Statistics (R0)
• Softcover ISBN 978-0-387-79053-4
• eBook ISBN 978-0-387-79054-1
• Series ISSN 1431-8784
• Edition Number 2
• Number of Pages XVI, 364
• Number of Illustrations 0 b/w illustrations, 0 illustrations in colour
• Topics
• Buy this book on publisher's site

## Reviews

From the reviews:

TECHNOMETRICS

"…extensive, well organized, and well documented…The book is an elegant R companion, suitable for the statistically initiated who want to program their own analyses. For experienced statisticians and data analysts, the book provides a good overview of the basic statistical analysis capabilities of R and presumably prepares readers for later migration to S…The format of this compact book is attractive…The book makes excellent use of fonts and intersperses graphics near the codes that produced them. Output from each procedure is dissected line by line to link R code with the computed result…I can recommend [this book] to its target audience. The author provides an excellent overview of R. I found the wealth of clear examples educational and a practical way to preview both R and S."

"The scope of the book, introductory statistics, is a very useful set of methods in parametric and non-parametric statistics up to logistic regression and survival analysis. … Where many constructs in R are very attractive for mathematical oriented users, e.g. matrices, Dalgaard succeeded in convincing me that with little extra effort they can be made very useful to less mathematically oriented people, e.g. by specifying row and column names, and proposing quite attractive ways to specify for example ‘subsets’ of rows and columns." (Dr. H. W. M. Hendriks, Kwantitatieve Methoden, Vol. 72B8, 2003)

"R is an Open Source implementation of the well-known S language. It works on multiple computing platforms and can be freely downloaded. R is thus ideally suited for teaching at many levels as well as for practical data analysis and methodological development. This book provides an elementary-level introduction to R, targeting both non-statistician scientists in various fields and students of statistics. … Brief sections introduce the statistical methods before they are used. A supplementary R package can be downloaded and contains the data sets." (Zentralblatt für Didaktik der Mathematik, August, 2004)

"This is a nice book on statistical methods and statistical computing in R, a language and environment for statistical computing and graphs: this dialect of the S language is available as free software through internet. … Explanation of statistical methods, together with an interpretation of statistical concepts, is the prevailing style of the text. They are illustrated by plenty of practical examples, all computed using R. This book will be useful for novices in applied statistics or in computing in R." (European Mathematical Society Newsletter, September, 2003)

"The book is an elegant R companion, suitable for the statistically initiated who want to program their own analyses. For experienced statisticians and data analysts, the book provides a good overview of the basic statistical analysis capabilities of R … prepares readers for later migration to S. … I can recommend Introductory Statistics With R to its target audience. The author provides an excellent overview of R. I found the wealth of clear examples educational and a practical way to preview both R and S." (Thomas D. Sandry, Technometrics, Vol. 45 (3), 2003)

"R is both a statistical computer environment and a programming language designed to perform statistical analysis and to produce adequate corresponding graphics. … The present book is … a very useful guide for introducing a number of basic concepts and techniques necessary to practical statistics, covering both elementary statistics and actual programming in the R language. The book is organized in 12 chapters and three appendices, each chapter ending with a beneficial section of proposed exercises." (Silvia Curteanu, Zentralblatt MATH, Vol. 1006, 2003)

From the reviews of the second edition:

“This review … roughly cover the introductory topics of a first year statistics course. The Introductory Statistics with R (ISwR) book is targeted for a biometric/medical audience. It covers more topics … like multiple regression and survival analysis and expects the reader to know about basic statistics. … include examples and graphs together with the R code to construct them. … The ISwR book is good for an academic and biometric audience.” (Wolfgang Polasek, Statistical Papers, Vol. 52, 2011)

“This is a welcome addition to the new edition that will be appreciated by its users. … The new edition is well written, and the new materials are well incorporated. Like the first edition, this edition will continue to be useful to the target audience, and I can safely recommend it to them.” (Technometrics, Vol. 51 (2), May, 2009)