Skip to main content

Random Forests

  • Chapter
  • First Online:
  • 3857 Accesses

Abstract

This chapter focuses on building Random Forests (RF) with PySpark for classification. We will learn about various aspects of them and how the predictions take place; but before knowing more about random forests, we have to learn the building block of RF that is a decision tree (DT). A decision tree is also used for Classification/Regression. but in terms of accuracy, random forests beat DT classifiers due to various reasons that we will cover later in the chapter. Let’s learn more about decision trees.

This is a preview of subscription content, log in via an institution.

Buying options

eBook
USD   24.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Pramod Singh

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Singh, P. (2019). Random Forests. In: Machine Learning with PySpark . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-4131-8_6

Download citation

Publish with us

Policies and ethics