Principles and Theory for Data Mining and Machine Learning

  • Bertrand Clarke
  • Ernest Fokoue
  • Hao Helen Zhang

Part of the Springer Series in Statistics book series (SSS)

Table of contents

  1. Front Matter
    Pages i-xiv
  2. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 1-52
  3. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 53-116
  4. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 117-170
  5. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 171-230
  6. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 231-306
  7. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 307-363
  8. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 365-404
  9. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 405-491
  10. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 493-568
  11. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 569-678
  12. Bertrand Clarke, Ernest Fokoué, Hao Helen Zhang
    Pages 679-742
  13. Back Matter
    Pages 1-38

About this book

Introduction

This book is a thorough introduction to the most important topics in data mining and machine learning. It begins with a detailed review of classical function estimation and proceeds with chapters on nonlinear regression, classification, and ensemble methods. The final chapters focus on clustering, dimension reduction, variable selection, and multiple comparisons. All these topics have undergone extraordinarily rapid development in recent years and this treatment offers a modern perspective emphasizing the most recent contributions. The presentation of foundational results is detailed and includes many accessible proofs not readily available outside original sources. While the orientation is conceptual and theoretical, the main points are regularly reinforced by computational comparisons.

Intended primarily as a graduate level textbook for statistics, computer science, and electrical engineering students, this book assumes only a strong foundation in undergraduate statistics and mathematics, and facility with using R packages. The text has a wide variety of problems, many of an exploratory nature. There are numerous computed examples, complete with code, so that further computations can be carried out readily. The book also serves as a handbook for researchers who want a conceptual overview of the central topics in data mining and machine learning.

Bertrand Clarke is a Professor of Statistics in the Department of Medicine, Department of Epidemiology and Public Health, and the Center for Computational Sciences at the University of Miami. He has been on the Editorial Board of the Journal of the American Statistical Association, the Journal of Statistical Planning and Inference, and Statistical Papers. He is co-winner, with Andrew Barron, of the 1990 Browder J. Thompson Prize from the Institute of Electrical and Electronic Engineers.

Ernest Fokoue is an Assistant Professor of Statistics at Kettering University. He has also taught at Ohio State University and been a long term visitor at the Statistical and Mathematical Sciences Institute where he was a Post-doctoral Research Fellow in the Data Mining and Machine Learning Program. In 2000, he was the winner of the Young Researcher Award from the International Association for Statistical Computing.

Hao Helen Zhang is an Associate Professor of Statistics in the Department of Statistics at North Carolina State University. For 2003-2004, she was a Research Fellow at SAMSI and in 2007, she won a Faculty Early Career Development Award from the National Science Foundation. She is on the Editorial Board of the Journal of the American Statistical Association and Biometrics.

Keywords

Clustering classification data mining linear regression machine learning supervised learning unsupervised learning

Authors and affiliations

  • Bertrand Clarke
    • 1
  • Ernest Fokoue
    • 2
  • Hao Helen Zhang
    • 3
  1. 1.Dept. StatisticsUniversity of British ColumbiaVancouverCanada
  2. 2.Dept. Science & MathematicsKettering UniversityFlintU.S.A.
  3. 3.Dept. StatisticsNorth Carolina State UniversityRaleighU.S.A.

Bibliographic information

  • DOI https://doi.org/10.1007/978-0-387-98135-2
  • Copyright Information Springer-Verlag New York 2009
  • Publisher Name Springer, New York, NY
  • eBook Packages Mathematics and Statistics
  • Print ISBN 978-0-387-98134-5
  • Online ISBN 978-0-387-98135-2
  • Series Print ISSN 0172-7397
  • About this book